Cosmin Paduraru

Cited by

	All	Since 2019
Citations	2156	2069
h-index	16	15
i10-index	19	16

640

320

160

480

2017201820192020202120222023202412 19 40 94 253 405 633 640

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Cosmin Paduraru

DeepMind

Verified email at google.com


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Challenges of real-world reinforcement learning: definitions, benchmarks and analysis G Dulac-Arnold, N Levine, DJ Mankowitz, J Li, C Paduraru, S Gowal, ... Machine Learning 110 (9), 2419-2468, 2021	458*	2021
Safe exploration in continuous action spaces G Dalal, K Dvijotham, M Vecerik, T Hester, C Paduraru, Y Tassa arXiv preprint arXiv:1801.08757, 2018	452	2018
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	443	2023
Rl unplugged: A suite of benchmarks for offline reinforcement learning C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ... Advances in Neural Information Processing Systems 33, 7248-7259, 2020	175*	2020
Hyperparameter selection for offline reinforcement learning TL Paine, C Paduraru, A Michi, C Gulcehre, K Zolna, A Novikov, Z Wang, ... arXiv preprint arXiv:2007.09055, 2020	142	2020
Faster sorting algorithms discovered using deep reinforcement learning DJ Mankowitz, A Michi, A Zhernov, M Gelmi, M Selvi, C Paduraru, ... Nature 618 (7964), 257-263, 2023	81	2023
Benchmarks for deep off-policy evaluation J Fu, M Norouzi, O Nachum, G Tucker, Z Wang, A Novikov, M Yang, ... arXiv preprint arXiv:2103.16596, 2021	70	2021
Autoregressive dynamics models for offline policy evaluation and optimization MR Zhang, TL Paine, O Nachum, C Paduraru, G Tucker, Z Wang, ... arXiv preprint arXiv:2104.13877, 2021	42	2021
Off-policy evaluation in Markov decision processes C Paduraru McGill University, 2013	37	2013
Responding to new information in a mining complex: Fast mechanisms using machine learning C Paduraru, R Dimitrakopoulos Mining Technology, 2019	28	2019
Off-policy learning with options and recognizers D Precup, C Paduraru, A Koop, RS Sutton, S Singh Advances in Neural Information Processing Systems 18, 2005	28	2005
Coptidice: Offline constrained reinforcement learning via stationary distribution correction estimation J Lee, C Paduraru, DJ Mankowitz, N Heess, D Precup, KE Kim, A Guez arXiv preprint arXiv:2204.08957, 2022	25	2022
Adaptive policies for short-term material flow optimization in a mining complex C Paduraru, R Dimitrakopoulos Mining Technology 127 (1), 56-63, 2018	25	2018
Active offline policy selection K Konyushova, Y Chen, T Paine, C Gulcehre, C Paduraru, DJ Mankowitz, ... Advances in Neural Information Processing Systems 34, 24631-24644, 2021	20	2021
Development and validation of a supervised machine learning radar Doppler spectra peak-finding algorithm H Kalesse, T Vogl, C Paduraru, E Luke Atmospheric Measurement Techniques 12 (8), 4591-4617, 2019	19	2019
Controlling commercial cooling systems using reinforcement learning J Luo, C Paduraru, O Voicu, Y Chervonyi, S Munns, J Li, C Qian, P Dutta, ... arXiv preprint arXiv:2211.07357, 2022	17	2022
Grounding Abstractions in Predictive State Representations. B Tanner, V Bulitko, A Koop, C Paduraru IJCAI, 1077-1082, 2007	14	2007
Transformers meet directed graphs S Geisler, Y Li, DJ Mankowitz, AT Cemgil, S Günnemann, C Paduraru International Conference on Machine Learning, 11144-11172, 2023	13	2023
An empirical analysis of off-policy learning in discrete mdps C Păduraru, D Precup, J Pineau, G Comănici European Workshop on Reinforcement Learning, 89-102, 2013	10	2013
Robust constrained reinforcement learning for continuous control with model misspecification DJ Mankowitz, DA Calian, R Jeong, C Paduraru, N Heess, S Dathathri, ... arXiv preprint arXiv:2010.10644, 2020	8	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by