Mostafa Dehghani

Cited by

	All	Since 2019
Citations	45444	45088
h-index	41	38
i10-index	68	58

22000

11000

5500

16500

2018201920202021202220232024156 344 438 3030 10497 21694 8963

Public access

View all

19 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Neil HoulsbyGoogleVerified email at google.com
Jakob UszkoreitInceptiveVerified email at uszkoreit.net
Matthias MindererSenior Research Scientist, Google DeepMindVerified email at google.com
Georg HeigoldResearch Scientist, Google Inc.Verified email at google.com
Yi TayResearch Scientist, Google BrainVerified email at google.com
Jaap KampsUniversity of AmsterdamVerified email at uva.nl
Alexey DosovitskiyInceptiveVerified email at inceptive.team
Xiaohua ZhaiGoogle DeepmindVerified email at google.com
Dirk WeissenbornInceptive Inc.Verified email at inceptive.team
Donald MetzlerGoogleVerified email at google.com
Lucas BeyerGoogle DeepMind, Google Brain, RWTH AachenVerified email at google.com
Sylvain GellyGoogle Brain ZurichVerified email at m4x.org
Thomas UnterthinerGoogle DeepMindVerified email at pm.me
Anurag ArnabPhD Student, University of OxfordVerified email at eng.ox.ac.uk
Dara BahriResearch Scientist, Google ResearchVerified email at google.com
Hosein AzarbonyadElsevierVerified email at uva.nl
Mario LučićResearch Scientist, Google DeepMindVerified email at google.com
Alexander KolesnikovGoogle DeepmindVerified email at google.com
maarten marxAssistant Professor of Computer Science, University of AmsterdamVerified email at uva.nl
Samira AbnarApple ML ResearchVerified email at apple.com

Mostafa Dehghani

Research Scientist, Google DeepMind

Verified email at google.com - Homepage

Machine Learning Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
An image is worth 16x16 words: Transformers for image recognition at scale A Dosovitskiy, L Beyer, A Kolesnikov, D Weissenborn, X Zhai, ... arXiv preprint arXiv:2010.11929, 2020	34083	2020
Vivit: A video vision transformer A Arnab, M Dehghani, G Heigold, C Sun, M Lučić, C Schmid arXiv preprint arXiv:2103.15691, 2021	1790	2021
Scaling instruction-finetuned language models HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, Y Li, X Wang, ... Journal of Machine Learning Research 25 (70), 1-53, 2024	1513	2024
Efficient Transformers survey DM Yi Tay, Mostafa Dehghani, Dara Bahri ACM Computing Survey 55 (6), 1–28, 2022	1057*	2022
Universal Transformers M Dehghani, S Gouws, O Vinyals, J Uszkoreit, Ł Kaiser International Conference on Learning Representations (ICLR), 2019	887	2019
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023	753	2023
Long Range Arena: A Benchmark for Efficient Transformers Y Tay, M Dehghani, S Abnar, Y Shen, D Bahri, P Pham, J Rao, L Yang, ... arXiv preprint arXiv:2011.04006, 2020	496	2020
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	407	2023
Neural Ranking Models with Weak Supervision M Dehghani, H Zamani, A Severyn, J Kamps, WB Croft The 40th International ACM SIGIR Conference on Research and Development in …, 2017	402	2017
Metnet: A neural weather model for precipitation forecasting CK Sønderby, L Espeholt, J Heek, M Dehghani, A Oliver, T Salimans, ... arXiv preprint arXiv:2003.12140, 2020	288	2020
Ul2: Unifying language learning paradigms Y Tay, M Dehghani, VQ Tran, X Garcia, J Wei, X Wang, HW Chung, ... arXiv preprint arXiv:2205.05131, 2022	267	2022
Simple open-vocabulary object detection M Minderer, A Gritsenko, A Stone, M Neumann, D Weissenborn, ... European Conference on Computer Vision, 728-755, 2022	264	2022
Scaling vision transformers to 22 billion parameters M Dehghani, J Djolonga, B Mustafa, P Padlewski, J Heek, J Gilmer, ... International Conference on Machine Learning, 7480-7512, 2023	251	2023
Parameter-efficient multi-task fine-tuning for transformers via shared hypernetworks RK Mahabadi, S Ruder, M Dehghani, J Henderson arXiv preprint arXiv:2106.04489, 2021	211	2021
From neural re-ranking to neural ranking: Learning a sparse representation for inverted indexing H Zamani, M Dehghani, WB Croft, E Learned-Miller, J Kamps Proceedings of the 27th ACM international conference on information and …, 2018	171	2018
Transformer memory as a differentiable search index Y Tay, V Tran, M Dehghani, J Ni, D Bahri, H Mehta, Z Qin, K Hui, Z Zhao, ... Advances in Neural Information Processing Systems 35, 21831-21843, 2022	146	2022
Tokenlearner: Adaptive space-time tokenization for videos M Ryoo, AJ Piergiovanni, A Arnab, M Dehghani, A Angelova Advances in neural information processing systems 34, 12786-12797, 2021	124	2021
Learning to Attend, Copy, and Generate for Session-Based Query Suggestion M Dehghani, S Rothe, E Alfonseca, P Fleury International Conference on Information and Knowledge Management (CIKM'17), 2017	120	2017
Scale efficiently: Insights from pre-training and fine-tuning transformers Y Tay, M Dehghani, J Rao, W Fedus, S Abnar, HW Chung, S Narang, ... arXiv preprint arXiv:2109.10686, 2021	104	2021
Exploring the limits of large scale pre-training S Abnar, M Dehghani, B Neyshabur, H Sedghi arXiv preprint arXiv:2110.02095, 2021	102	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors