Yu Bai

Cited by

	All	Since 2019
Citations	2545	2484
h-index	24	24
i10-index	40	40

740

370

185

555

2017201820192020202120222023202417 44 84 169 352 480 732 665

Public access

View all

16 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Song MeiAssistant Professor at UC BerkeleyVerified email at berkeley.edu
Huan WangSalesforce ResearchVerified email at yale.edu
Caiming XiongSalesforce ResearchVerified email at salesforce.com
Chi JinAssistant Professor, Princeton UniversityVerified email at princeton.edu
Yu-Xiang WangAssociate Professor @ UC San DiegoVerified email at ucsd.edu
Tiancheng YuTwo SigmaVerified email at mit.edu
Nan JiangAssistant Professor of Computer Science, UIUCVerified email at illinois.edu
Jason D. LeeAssociate Professor of Electrical Engineering and Computer Science, Princeton UniversityVerified email at princeton.edu
Tengyang XieAssistant Professor of Computer Science, University of Wisconsin-MadisonVerified email at cs.wisc.edu
Minshuo ChenPrinceton UniversityVerified email at princeton.edu
Fan ChenMassachusetts Institute of TechnologyVerified email at mit.edu
Andrea MontanariProfessor of Statistics and Mathematics, Stanford UniversityVerified email at stanford.edu
Qinghua LiuPrinceton UniversityVerified email at princeton.edu
Ming YinPrinceton UniversityVerified email at princeton.edu
Ziang SongStanford UniversityVerified email at stanford.edu
Sham M KakadeHarvard UniversityVerified email at seas.harvard.edu
Tuo ZhaoAssistant Professor, Georgia TechVerified email at gatech.edu
Edo Libertypinecone.ioVerified email at edoliberty.com
Mengdi WangCenter for Statistics & Machine Learning, ECE, Princeton UniversityVerified email at princeton.edu
Tianyu GuoPh.D. student, UC BerkeleyVerified email at berkeley.edu

Yu Bai

OpenAI

Verified email at salesforce.com - Homepage

Machine Learning Statistics


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
The landscape of empirical risk for nonconvex losses S Mei, Y Bai, A Montanari The Annals of Statistics 46 (6A), 2747-2774, 2018	354	2018
Provable self-play algorithms for competitive reinforcement learning Y Bai, C Jin International conference on machine learning, 551-560, 2020	170	2020
Policy finetuning: Bridging sample-efficient offline and online reinforcement learning T Xie, N Jiang, H Wang, C Xiong, Y Bai Advances in neural information processing systems 34, 27395-27407, 2021	155	2021
Near-Optimal Reinforcement Learning with Self-Play Y Bai, C Jin, T Yu Advances in Neural Information Processing Systems, 2020, 2020	143	2020
A sharp analysis of model-based reinforcement learning with self-play Q Liu, T Yu, Y Bai, C Jin International Conference on Machine Learning, 7001-7010, 2021	140	2021
Beyond linearization: On quadratic and higher-order approximation of wide neural networks Y Bai, JD Lee International Conference on Learning Representations (ICLR) 2020, 2019	130	2019
Proxquant: Quantized neural networks via proximal operators Y Bai, YX Wang, E Liberty International Conference on Learning Representations (ICLR) 2019, 2018	120	2018
Provably Efficient Q-Learning with Low Switching Cost Y Bai, T Xie, N Jiang, YX Wang Advances in Neural Information Processing Systems, 2019, 2019	103	2019
Transformers as statisticians: Provable in-context learning with in-context algorithm selection Y Bai, F Chen, H Wang, C Xiong, S Mei Advances in neural information processing systems 36, 2024	101	2024
When can we learn general-sum Markov games with a large number of players sample-efficiently? Z Song, S Mei, Y Bai International Conference on Learning Representations (ICLR) 2022, 2021	96	2021
Near-optimal provable uniform convergence in offline policy evaluation for reinforcement learning M Yin, Y Bai, YX Wang International Conference on Artificial Intelligence and Statistics, 1567-1575, 2021	89*	2021
Approximability of discriminators implies diversity in GANs Y Bai, T Ma, A Risteski International Conference on Learning Representations (ICLR) 2019, 2018	85	2018
How important is the train-validation split in meta-learning? Y Bai, M Chen, P Zhou, T Zhao, J Lee, S Kakade, H Wang, C Xiong International Conference on Machine Learning, 543-553, 2021	77	2021
Sample-efficient learning of stackelberg equilibria in general-sum games Y Bai, C Jin, H Wang, C Xiong Advances in Neural Information Processing Systems 34, 25799-25811, 2021	71	2021
Near-optimal offline reinforcement learning via double variance reduction M Yin, Y Bai, YX Wang Advances in neural information processing systems 34, 7677-7688, 2021	66	2021
Subgradient descent learns orthogonal dictionaries Y Bai, Q Jiang, J Sun International Conference on Learning Representations (ICLR) 2019, 2018	59	2018
The role of coverage in online reinforcement learning T Xie, DJ Foster, Y Bai, N Jiang, SM Kakade arXiv preprint arXiv:2210.04157, 2022	55	2022
Towards understanding hierarchical learning: Benefits of neural representations M Chen, Y Bai, JD Lee, T Zhao, H Wang, C Xiong, R Socher Advances in Neural Information Processing Systems, 2020, 2020	48	2020
Don't Just Blame Over-parametrization for Over-confidence: Theoretical Analysis of Calibration in Binary Classification Y Bai, S Mei, H Wang, C Xiong International Conference on Machine Learning, 566-576, 2021	46	2021
Unified algorithms for rl with decision-estimation coefficients: No-regret, pac, and reward-free learning F Chen, S Mei, Y Bai arXiv preprint arXiv:2209.11745, 2022	34	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors