Max Ryabinin

Cited by

	All	Since 2019
Citations	1522	1521
h-index	11	11
i10-index	12	12

980

490

245

735

202120222023202421 77 971 446

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Alexander BorzunovOpenAIVerified email at openai.com
Michael DiskinLead Research Engineer, WB Tech & HSE UniversityVerified email at hse.ru
Tim DettmersUniversity of WashingtonVerified email at cs.washington.edu
Beidi ChenCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Gennady PekhimenkoUniversity of TorontoVerified email at cs.toronto.edu
Percy LiangAssociate Professor of Computer Science, Stanford UniversityVerified email at cs.stanford.edu
Lianmin ZhengUC BerkeleyVerified email at berkeley.edu
Ying ShengPhD student of Stanford UniversityVerified email at stanford.edu
Ion StoicaProfessor of Computer Science, UC BerkeleyVerified email at cs.berkeley.edu
Binhang Yuan（袁彬航）Hong Kong University of Science and TechnologyVerified email at ust.hk
Christopher RéComputer Science, Stanford UniversityVerified email at cs.stanford.edu
Zhuohan LiUC BerkeleyVerified email at berkeley.edu
Ce ZhangTogether AIVerified email at together.xyz
Yacine JerniteResearch Scientist, HuggingFaceVerified email at cs.nyu.edu
Thomas WolfCo-founder at HuggingFaceVerified email at polytechnique.edu
Eduard GorbunovResearch Scientist, Mohamed bin Zayed University of Artificial IntelligenceVerified email at mbzuai.ac.ae
Younes BelkadaENS Paris SaclayVerified email at ens-paris-saclay.fr
Dmitry BaranchukYandex ResearchVerified email at yandex-team.ru
Colin RaffelUniversity of Toronto, Vector Institute and Hugging FaceVerified email at cs.toronto.edu
Andrey MalininStaff Research Scientist, Isomorphic LabsVerified email at isomorphiclabs.com

Max Ryabinin

Together AI

Verified email at together.ai - Homepage

deep learning natural language processing distributed training


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...	1153	2023
Flexgen: High-throughput generative inference of large language models with a single gpu Y Sheng, L Zheng, B Yuan, Z Li, M Ryabinin, B Chen, P Liang, C Ré, ... International Conference on Machine Learning, 31094-31116, 2023	112	2023
Towards Crowdsourced Training of Large Neural Networks using Decentralized Mixture-of-Experts M Ryabinin, A Gusev Advances in Neural Information Processing Systems 33 (NeurIPS 2020), 3659–3672, 2020	41	2020
Distributed Deep Learning in Open Collaborations M Diskin, A Bukhtiyarov, M Ryabinin*, L Saulnier, Q Lhoest, A Sinitsin, ... Advances in Neural Information Processing Systems 34 (NeurIPS 2021), 2021	40	2021
Petals: Collaborative inference and fine-tuning of large models A Borzunov, D Baranchuk, T Dettmers, M Ryabinin, Y Belkada, ... arXiv preprint arXiv:2209.01188, 2022	28	2022
Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices M Ryabinin, E Gorbunov, V Plokhotnyuk, G Pekhimenko Advances in Neural Information Processing Systems 34 (NeurIPS 2021), 2021	26	2021
Scaling Ensemble Distribution Distillation to Many Classes With Proxy Targets M Ryabinin, A Malinin, M Gales Advances in Neural Information Processing Systems 34 (NeurIPS 2021), 2021	21	2021
It's All in the Heads: Using Attention Heads as a Baseline for Cross-Lingual Transfer in Commonsense Reasoning A Tikhonov, M Ryabinin Findings of the ACL 2021, 3534–3546, 2021	21	2021
Secure Distributed Training at Scale E Gorbunov, A Borzunov, M Diskin, M Ryabinin International Conference on Machine Learning, 7679-7739, 2022	13	2022
Distributed methods with compressed communication for solving variational inequalities, with theoretical guarantees A Beznosikov, P Richtárik, M Diskin, M Ryabinin, A Gasnikov Advances in Neural Information Processing Systems 35, 14013-14029, 2022	12	2022
SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient M Ryabinin, T Dettmers, M Diskin, A Borzunov arXiv preprint arXiv:2301.11913, 2023	11	2023
Embedding Words in Non-Vector Space with Unsupervised Graph Learning M Ryabinin, S Popov, L Prokhorenkova, E Voita Empirical Methods in Natural Language Processing (EMNLP 2020), 7317–7331, 2020	10	2020
RuCoLA: Russian corpus of linguistic acceptability V Mikhailov, T Shamardina, M Ryabinin, A Pestova, I Smurov, E Artemova arXiv preprint arXiv:2210.12814, 2022	9	2022
Training Transformers Together A Borzunov, M Ryabinin, T Dettmers, Q Lhoest, L Saulnier, M Diskin, ... Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track 176 …, 2022	9	2022
Distributed Inference and Fine-tuning of Large Language Models Over The Internet A Borzunov, M Ryabinin, A Chumachenko, D Baranchuk, T Dettmers, ... arXiv preprint arXiv:2312.08361, 2023	6	2023
Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics A Voronov, M Khoroshikh, A Babenko, M Ryabinin Advances in Neural Information Processing Systems 36, 2024	4*	2024
Mind Your Format: Towards Consistent Evaluation of In-Context Learning Improvements A Voronov, L Wolf, M Ryabinin arXiv preprint arXiv:2401.06766, 2024	4	2024
Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding Z Chen, A May, R Svirschevski, Y Huang, M Ryabinin, Z Jia, B Chen arXiv preprint arXiv:2402.12374, 2024	1	2024
Adaptive Prediction Time for Sequence Classification M Ryabinin, E Lobacheva	1	2018
The Hallucinations Leaderboard--An Open Effort to Measure Hallucinations in Large Language Models G Hong, AP Gema, R Saxena, X Du, P Nie, Y Zhao, L Perez-Beltrachini, ... arXiv preprint arXiv:2404.05904, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors