Behrooz Ghorbani

Cited by

	All	Since 2019
Citations	1155	1150
h-index	12	12
i10-index	14	14

360

180

270

20182019202020212022202320245 29 132 184 270 341 172

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Andrea MontanariProfessor of Statistics and Mathematics, Stanford UniversityVerified email at stanford.edu
Orhan FiratGoogle AIVerified email at google.com
Theodor MisiakiewiczResearch Assistant Professor, Toyota Technological Institute at ChicagoVerified email at ttic.edu
Song MeiAssistant Professor at UC BerkeleyVerified email at berkeley.edu
Justin GilmerGoogleVerified email at google.com
Ankush GargGoogleVerified email at google.com
Ying Xiao (肖盈)Twitter Cortex Applied ResearchVerified email at twitter.com
Xavier GarciaGoogleVerified email at google.com
Markus FreitagGoogleVerified email at google.com
Zachary NadoGoogle BrainVerified email at google.com
George E. DahlGoogle Inc.Verified email at google.com
Behnam NeyshaburSenior Staff Research Scientist, DeepMindVerified email at google.com
Shankar KrishnanGoogle ResearchVerified email at google.com
Colin CherryGoogle ResearchVerified email at google.com
Sneha KuduguntaGoogle DeepMindVerified email at google.com
Ankur BapnaSoftware Engineer, Google DeepmindVerified email at google.com
Patrick FernandesCarnegie Mellon University & Instituto Superior TécnicoVerified email at cs.cmu.edu
Biao ZhangGoogleVerified email at google.com
Ciprian ChelbaResearch Scientist, GoogleVerified email at google.com
Jeremy M CohenPhD student, Carnegie Mellon UniversityVerified email at andrew.cmu.edu

Behrooz Ghorbani

Researcher, OpenAI

Verified email at stanford.edu - Homepage

Foundation Models Science of Scaling Deep Learning Theory


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
An investigation into neural net optimization via hessian eigenvalue density B Ghorbani, S Krishnan, Y Xiao International Conference on Machine Learning, 2232-2241, 2019	287	2019
Linearized two-layers neural networks in high dimension B Ghorbani, S Mei, T Misiakiewicz, A Montanari The Annals of Statistics 49 (2), 1029-1054, 2021	227	2021
When do neural networks outperform kernel methods? B Ghorbani, S Mei, T Misiakiewicz, A Montanari Advances in Neural Information Processing Systems 33, 2020	176	2020
Limitations of lazy training of two-layers neural network B Ghorbani, S Mei, T Misiakiewicz, A Montanari Advances in Neural Information Processing Systems, 9111-9121, 2019	140	2019
Scaling Laws for Neural Machine Translation B Ghorbani, O Firat, M Freitag, A Bapna, M Krikun, X Garcia, C Chelba, ... arXiv preprint arXiv:2109.07740, 2021	68	2021
Do Current Multi-Task Optimization Methods in Deep Learning Even Help? D Xin, B Ghorbani, J Gilmer, A Garg, O Firat Advances in Neural Information Processing Systems 35, 13597-13609, 2022	39	2022
Adaptive Gradient Methods at the Edge of Stability JM Cohen, B Ghorbani, S Krishnan, N Agarwal, S Medapati, M Badura, ... arXiv preprint arXiv:2207.14484, 2022	37	2022
An instability in variational inference for topic models B Ghorbani, H Javadi, A Montanari International Conference on Machine Learning, 2221-2231, 2019	36	2019
Data Scaling Laws in NMT: The Effect of Noise and Architecture Y Bansal, B Ghorbani, A Garg, B Zhang, C Cherry, B Neyshabur, O Firat International Conference on Machine Learning, 1466-1482, 2022	31	2022
A Loss Curvature Perspective on Training Instability in Deep Learning J Gilmer, B Ghorbani, A Garg, S Kudugunta, B Neyshabur, D Cardoze, ... arXiv preprint arXiv:2110.04369, 2021	27	2021
A Loss Curvature Perspective on Training Instabilities of Deep Learning Models J Gilmer, B Ghorbani, A Garg, S Kudugunta, B Neyshabur, D Cardoze, ... International Conference on Learning Representations, 2021	25	2021
Scaling laws for multilingual neural machine translation P Fernandes, B Ghorbani, X Garcia, M Freitag, O Firat International Conference on Machine Learning, 10053-10071, 2023	14	2023
Epsilon Sampling Rocks: Investigating Sampling Strategies for\\Minimum Bayes Risk Decoding for Machine Translation M Freitag, B Ghorbani, P Fernandes arXiv preprint arXiv:2305.09860, 2023	12	2023
Examining scaling and transfer of language model architectures for machine translation B Zhang, B Ghorbani, A Bapna, Y Cheng, X Garcia, J Shen, O Firat International Conference on Machine Learning, 26176-26192, 2022	12	2022
Discussion of:“Nonparametric regression using deep neural networks with ReLU activation function” B Ghorbani, S Mei, T Misiakiewicz, A Montanari The Annals of Statistics 48 (4), 1898-1901, 2020	8	2020
Optimal Covariance Estimation for Condition Number Loss in the Spiked Model DL Donoho, B Ghorbani arXiv preprint arXiv:1810.07403, 2018	8	2018
Binarized Neural Machine Translation Y Zhang, A Garg, Y Cao, L Lew, B Ghorbani, Z Zhang, O Firat Advances in Neural Information Processing Systems 36, 2024	4	2024
The effect of network depth on the optimization landscape B Ghorbani, Y Xiao, S Krishnan ICML 2019 Workshop on Identifying and Understanding Deep Learning Phenomena, 2019	3	2019
A loss curvature perspective on training instability in deep learning J Gilmer, B Ghorbani, A Garg, SR Kudugunta, B Neyshabur, D Cardoze, ...	1	2022
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning D Choi, D Xin, H Dadkhahi, J Gilmer, A Garg, O Firat, CK Yeh, AM Dai, ... Thirty-seventh Conference on Neural Information Processing Systems, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors