Follow
Supratik Paul
Supratik Paul
Waymo
Verified email at google.com
Title
Cited by
Cited by
Year
Learning from demonstration in the wild
F Behbahani, K Shiarlis, X Chen, V Kurin, S Kasewa, C Stirbu, J Gomes, ...
2019 International Conference on Robotics and Automation (ICRA), 775-781, 2019
682019
Fast efficient hyperparameter tuning for policy gradient methods
S Paul, V Kurin, S Whiteson
Advances in Neural Information Processing Systems 32, 2019
66*2019
Alternating optimisation and quadrature for robust control
S Paul, K Chatzilygeroudis, K Ciosek, JB Mouret, M Osborne, S Whiteson
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
28*2018
Hierarchical model-based imitation learning for planning in autonomous driving
E Bronstein, M Palatucci, D Notz, B White, A Kuefler, Y Lu, S Paul, ...
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022
222022
Fingerprint policy optimisation for robust reinforcement learning
S Paul, MA Osborne, S Whiteson
International Conference on Machine Learning, 5082-5091, 2019
182019
Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula
E Bronstein, S Srinivasan, S Paul, A Sinha, M O’Kelly, P Nikdel, ...
Conference on Robot Learning, 188-198, 2023
52023
Contextual policy optimisation
S Paul, MA Osborne, S Whiteson
CoRR, vol. abs/1805.10662, 2018
32018
Robust reinforcement learning with Bayesian optimisation and quadrature
S Paul, K Chatzilygeroudis, K Ciosek, JB Mouret, MA Osborne, ...
The Journal of Machine Learning Research 21 (1), 6020-6050, 2020
2020
Towards robust reinforcement learning
S Paul
University of Oxford, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–9