Подписаться
Tetsuro Morimura
Tetsuro Morimura
CyberAgent, Inc.
Подтвержден адрес электронной почты в домене cyberagent.co.jp
Название
Процитировано
Процитировано
Год
Nonparametric return distribution approximation for reinforcement learning
T Morimura, M Sugiyama, H Kashima, H Hachiya, T Tanaka
Proceedings of the 27th International Conference on Machine Learning (ICML …, 2010
2862010
Parametric return density estimation for reinforcement learning
T Morimura, M Sugiyama, H Kashima, H Hachiya, T Tanaka
arXiv preprint arXiv:1203.3497, 2012
1462012
Map matching with hidden Markov model on sampled road network
R Raymond, T Morimura, T Osogami, N Hirosue
Proceedings of the 21st international conference on pattern recognition …, 2012
842012
これからの強化学習
牧野, 澁谷, 長史, 白川, 浅田
(No Title), 2016
522016
Ibm mega traffic simulator
T Osogami, T Imamichi, H Mizuta, T Morimura, R Raymond, T Suzumura, ...
IBM Res., Tokyo, Japan, IBM Res. Rep. RT0896, 2012
452012
City-wide traffic flow estimation from a limited number of low-quality cameras
T Idé, T Katsuki, T Morimura, R Morris
IEEE Transactions on Intelligent Transportation Systems 18 (4), 950-959, 2016
442016
Utilizing the natural gradient in temporal difference reinforcement learning with eligibility traces
T Morimura, E Uchibe, K Doya
International Symposium on Information Geometry and Its Applications, 256-263, 2005
432005
Solving inverse problem of Markov chain with partial observations
T Morimura, T Osogami, T Idé
Advances in neural information processing systems 26, 2013
392013
Derivatives of logarithmic stationary distributions for policy gradient reinforcement learning
T Morimura, E Uchibe, J Yoshimoto, J Peters, K Doya
Neural computation 22 (2), 342-376, 2010
352010
Assistance generation
T Katsuki, T Morimura
US Patent 10,878,337, 2020
282020
Updating policy parameters under Markov decision process system environment
T Morimura, T Osogami, T Shirai
US Patent 8,818,925, 2014
242014
A generalized natural actor-critic algorithm
T Morimura, E Uchibe, J Yoshimoto, K Doya
Advances in neural information processing systems 22, 2009
222009
強化学習
森村哲郎
講談社, 2019
202019
A new natural policy gradient by stationary distribution metric
T Morimura, E Uchibe, J Yoshimoto, K Doya
Machine Learning and Knowledge Discovery in Databases: European Conference …, 2008
192008
Cooperative neural network reinforcement learning
S Dasgupta, T Morimura, T Osogami
US Patent App. 15/647,543, 2019
182019
Adaptive step-size policy gradients with average reward metric
T Matsubara, T Morimura, J Morimoto
Proceedings of 2nd Asian Conference on Machine Learning, 285-298, 2010
152010
A consistent method for graph based anomaly localization
S Hara, T Morimura, T Takahashi, H Yanagisawa, T Suzuki
Artificial intelligence and statistics, 333-341, 2015
132015
Determining optimal action in consideration of risk
T Morimura, T Osogami
US Patent 8,639,556, 2014
132014
Statistical origin-destination generation with multiple sources
T Morimura, S Kato
Proceedings of the 21st International Conference on Pattern Recognition …, 2012
132012
Identification of antibiotic clarithromycin binding peptide displayed by T7 phage particles
T Morimura, N Noda, Y Kato, T Watanabe, T Saitoh, T Yamazaki, ...
The Journal of Antibiotics 59 (10), 625-632, 2006
122006
В данный момент система не может выполнить эту операцию. Повторите попытку позднее.
Статьи 1–20