Self-modification of policy and utility function in rational agents T Everitt, D Filan, M Daswani, M Hutter International Conference on Artificial General Intelligence, 1-11, 2016 | 25 | 2016 |
Feature reinforcement learning: state of the art M Daswani, P Sunehag, M Hutter Sequential decision-making with big data: papers from the AAAI-14 workshop, 2014 | 12 | 2014 |
A definition of happiness for reinforcement learning agents M Daswani, J Leike International Conference on Artificial General Intelligence, 231-240, 2015 | 7 | 2015 |
Q-learning for history-based reinforcement learning M Daswani, P Sunehag, M Hutter MIT Press, 2013 | 7 | 2013 |
Reinforcement learning with value advice M Daswani, P Sunehag, M Hutter Asian Conference on Machine Learning, 299-314, 2015 | 6 | 2015 |
Feature Reinforcement Learning using Looping Suffix Trees M Daswani, P Sunehag, M Hutter JMLR Workshop and Conference Proceedings : EWRL 2012 24, 11-24, 2012 | 6 | 2012 |
Generic Reinforcement Learning Beyond Small MDPs M Daswani The Australian National University, 2015 | | 2015 |