Learning from demonstration in the wild F Behbahani, K Shiarlis, X Chen, V Kurin, S Kasewa, C Stirbu, J Gomes, ... 2019 International Conference on Robotics and Automation (ICRA), 775-781, 2019 | 68 | 2019 |
Fast efficient hyperparameter tuning for policy gradient methods S Paul, V Kurin, S Whiteson Advances in Neural Information Processing Systems 32, 2019 | 66* | 2019 |
Alternating optimisation and quadrature for robust control S Paul, K Chatzilygeroudis, K Ciosek, JB Mouret, M Osborne, S Whiteson Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018 | 28* | 2018 |
Hierarchical model-based imitation learning for planning in autonomous driving E Bronstein, M Palatucci, D Notz, B White, A Kuefler, Y Lu, S Paul, ... 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022 | 22 | 2022 |
Fingerprint policy optimisation for robust reinforcement learning S Paul, MA Osborne, S Whiteson International Conference on Machine Learning, 5082-5091, 2019 | 18 | 2019 |
Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula E Bronstein, S Srinivasan, S Paul, A Sinha, M O’Kelly, P Nikdel, ... Conference on Robot Learning, 188-198, 2023 | 5 | 2023 |
Contextual policy optimisation S Paul, MA Osborne, S Whiteson CoRR, vol. abs/1805.10662, 2018 | 3 | 2018 |
Robust reinforcement learning with Bayesian optimisation and quadrature S Paul, K Chatzilygeroudis, K Ciosek, JB Mouret, MA Osborne, ... The Journal of Machine Learning Research 21 (1), 6020-6050, 2020 | | 2020 |
Towards robust reinforcement learning S Paul University of Oxford, 2020 | | 2020 |