Minjia Zhang

Cited by

	All	Since 2019
Citations	3285	2874
h-index	24	21
i10-index	37	33

1500

750

375

1125

201220132014201520162017201820192020202120222023202420 24 30 68 84 94 70 121 128 185 338 1490 609

Public access

View all

16 articles

0 articles

available

not available

Based on funding mandates

Co-authors

He YuxiongMicrosoft ResearchVerified email at microsoft.com
Conglong LiSenior Researcher at Microsoft, CMU Ph.D.Verified email at microsoft.com
Michael D. BondOhio State UniversityVerified email at cse.ohio-state.edu
Reza Yazdani AminabadiMicrosoft ResearchVerified email at microsoft.com
Zhewei YaoSnowflakeVerified email at snowflake.com
Olatunji RuwaseMicrosoft ResearchVerified email at microsoft.com
Swarnendu BiswasAssistant Professor, IIT KanpurVerified email at cse.iitk.ac.in
Xiaoxia (Shirley) Wu 吴晓霞DeepSpeed Team @ MicrosoftVerified email at microsoft.com
Man CaoGoogle, Ohio State UniversityVerified email at google.com
Dong LiUniversity of California, MercedVerified email at ucmerced.edu
Jeff RasleyMicrosoftVerified email at microsoft.com
Ammar Ahmad AwanMicrosoftVerified email at osu.edu
Jie RenCollege of William & MaryVerified email at wm.edu
Connor HolmesComputer Science PhD Candidate, Colorado School of MinesVerified email at mymail.mines.edu
Aritra SenguptaAutomated Reasoning Group, AWS.Verified email at cse.ohio-state.edu
Cheng LiMicrosoftVerified email at microsoft.com
Di WangMicrosoftVerified email at microsoft.com
Milind KulkarniAssociate Professor of Electrical and Computer Engineering, Purdue UniversityVerified email at purdue.edu
Jipeng HuangGoogleVerified email at cse.ohio-state.edu
Hari SubramoniThe Ohio State UniversityVerified email at cse.ohio-state.edu

Minjia Zhang

University of Illinois at Urbana-Champagin

Verified email at illinois.edu - Homepage

Parallelism Distributed Systems Machine Learning Systems Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...	1124	2023
Memcached design on high performance RDMA capable interconnects J Jose, H Subramoni, M Luo, M Zhang, J Huang, M Wasi-ur-Rahman, ... 2011 International Conference on Parallel Processing, 743-752, 2011	263	2011
{Zero-offload}: Democratizing {billion-scale} model training J Ren, S Rajbhandari, RY Aminabadi, O Ruwase, S Yang, M Zhang, D Li, ... 2021 USENIX Annual Technical Conference (USENIX ATC 21), 551-564, 2021	226	2021
Zeroquant: Efficient and affordable post-training quantization for large-scale transformers Z Yao, R Yazdani Aminabadi, M Zhang, X Wu, C Li, Y He Advances in Neural Information Processing Systems 35, 27168-27183, 2022	174	2022
Learning intrinsic sparse structures within long short-term memory W Wen, Y He, S Rajbhandari, M Zhang, W Wang, F Liu, B Hu, Y Chen, ... arXiv preprint arXiv:1709.05027, 2017	150	2017
Deepspeed-moe: Advancing mixture-of-experts inference and training to power next-generation ai scale S Rajbhandari, C Li, Z Yao, M Zhang, RY Aminabadi, AA Awan, J Rasley, ... International conference on machine learning, 18332-18346, 2022	139	2022
Deepspeed-inference: enabling efficient inference of transformer models at unprecedented scale RY Aminabadi, S Rajbhandari, AA Awan, C Li, D Li, E Zheng, O Ruwase, ... SC22: International Conference for High Performance Computing, Networking …, 2022	120	2022
{DeepCPU}: Serving {RNN-based} Deep Learning Models 10x Faster M Zhang, S Rajbhandari, W Wang, Y He 2018 USENIX Annual Technical Conference (USENIX ATC 18), 951-965, 2018	113	2018
Accelerating training of transformer-based language models with progressive layer dropping M Zhang, Y He Advances in Neural Information Processing Systems 33, 14011-14023, 2020	78	2020
Valor: Efficient, software-only region conflict exceptions S Biswas, M Zhang, MD Bond, B Lucia ACM SIGPLAN Notices 50 (10), 241-259, 2015	72	2015
Octet: Capturing and controlling cross-thread dependences efficiently MD Bond, M Kulkarni, M Cao, M Zhang, M Fathi Salmi, S Biswas, ... ACM SIGPLAN Notices 48 (10), 693-712, 2013	56	2013
Hm-ann: Efficient billion-point nearest neighbor search on heterogeneous memory J Ren, M Zhang, D Li Advances in Neural Information Processing Systems 33, 10672-10684, 2020	50	2020
Navigating with graph representations for fast and scalable decoding of neural language models M Zhang, W Wang, X Liu, J Gao, Y He Advances in neural information processing systems 31, 2018	45	2018
Hybrid static–dynamic analysis for statically bounded region serializability A Sengupta, S Biswas, M Zhang, MD Bond, M Kulkarni ACM SIGPLAN Notices 50 (4), 561-575, 2015	45	2015
Sentinel: Efficient tensor migration and allocation on heterogeneous memory systems for deep learning J Ren, J Luo, K Wu, M Zhang, H Jeon, D Li 2021 IEEE International Symposium on High-Performance Computer Architecture …, 2021	43	2021
Improving approximate nearest neighbor search through learned adaptive early termination C Li, M Zhang, DG Andersen, Y He Proceedings of the 2020 ACM SIGMOD International Conference on Management of …, 2020	42	2020
VirtCFT: A transparent VM-level fault-tolerant system for virtual clusters M Zhang, H Jin, X Shi, S Wu 2010 IEEE 16th International Conference on Parallel and Distributed Systems …, 2010	36	2010
Low-overhead software transactional memory with progress guarantees and strong semantics M Zhang, J Huang, M Cao, MD Bond Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of …, 2015	33*	2015
Lightweight data race detection for production runs S Biswas, M Cao, M Zhang, MD Bond, BP Wood Proceedings of the 26th International Conference on Compiler Construction, 11-21, 2017	31	2017
Grip: Multi-store capacity-optimized high-performance nearest neighbor search for vector search engine M Zhang, Y He Proceedings of the 28th ACM International Conference on Information and …, 2019	29	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors