Mayank Daswani
Mayank Daswani
Подтвержден адрес электронной почты в домене google.com
Название
Процитировано
Процитировано
Год
Self-modification of policy and utility function in rational agents
T Everitt, D Filan, M Daswani, M Hutter
International Conference on Artificial General Intelligence, 1-11, 2016
252016
Feature reinforcement learning: state of the art
M Daswani, P Sunehag, M Hutter
Sequential decision-making with big data: papers from the AAAI-14 workshop, 2014
122014
A definition of happiness for reinforcement learning agents
M Daswani, J Leike
International Conference on Artificial General Intelligence, 231-240, 2015
72015
Q-learning for history-based reinforcement learning
M Daswani, P Sunehag, M Hutter
MIT Press, 2013
72013
Reinforcement learning with value advice
M Daswani, P Sunehag, M Hutter
Asian Conference on Machine Learning, 299-314, 2015
62015
Feature Reinforcement Learning using Looping Suffix Trees
M Daswani, P Sunehag, M Hutter
JMLR Workshop and Conference Proceedings : EWRL 2012 24, 11-24, 2012
62012
Generic Reinforcement Learning Beyond Small MDPs
M Daswani
The Australian National University, 2015
2015
В данный момент система не может выполнить эту операцию. Повторите попытку позднее.
Статьи 1–7