Learning to act using real-time dynamic programming AG Barto, SJ Bradtke, SP Singh Artificial intelligence 72 (1-2), 81-138, 1995 | 1648 | 1995 |
Linear least-squares algorithms for temporal difference learning SJ Bradtke, AG Barto Machine learning 22 (1), 33-57, 1996 | 993 | 1996 |
Adaptive linear quadratic control using policy iteration SJ Bradtke, BE Ydstie, AG Barto Proceedings of 1994 American Control Conference-ACC'94 3, 3475-3479, 1994 | 501 | 1994 |
Reinforcement learning methods for continuous-time Markov decision problems SJ Bradtke, MO Duff Advances in Neural Information Processing Systems 7 7, 393-400, 1995 | 498 | 1995 |
Real-time learning and control using asynchronous dynamic programming AG Barto, SJ Bradtke, SP Singh University of Massachusetts at Amherst, Department of Computer and …, 1991 | 236 | 1991 |
Reinforcement learning applied to linear quadratic regulation S Bradtke Advances in neural information processing systems 5, 1992 | 215 | 1992 |
Incremental dynamic programming for on-line adaptive optimal control SJ Bradtke University of Massachusetts at Amherst, 1994 | 71 | 1994 |
Some Experiments with Case-Based Search. S Bradtke, WG Lehnert AAAI, 133-138, 1988 | 38 | 1988 |
Learning to Solve Stochastic Optimal Path Problems Using Real-Time Dynamic Programming AG Barto, SJ Bradtke The Proceedings of the Seventh Yale Workshop on Adaptive and Learning …, 1992 | 2 | 1992 |