Phi-3 technical report: A highly capable language model locally on your phone M Abdin, J Aneja, H Awadalla, A Awadallah, AA Awan, N Bach, A Bahree, ... arXiv preprint arXiv:2404.14219, 2024 | 541 | 2024 |
Deepspeed-chat: Easy, fast and affordable rlhf training of chatgpt-like models at all scales Z Yao, RY Aminabadi, O Ruwase, S Rajbhandari, X Wu, AA Awan, ... arXiv preprint arXiv:2308.01320, 2023 | 53 | 2023 |
Zero++: Extremely efficient collective communication for giant model training G Wang, H Qin, SA Jacobs, C Holmes, S Rajbhandari, O Ruwase, F Yan, ... arXiv preprint arXiv:2306.10209, 2023 | 36 | 2023 |
Swift machine learning model serving scheduling: a region based reinforcement learning approach H Qin, S Zawad, Y Zhou, L Yang, D Zhao, F Yan Proceedings of the International Conference for High Performance Computing …, 2019 | 33 | 2019 |
Deepspeed-fastgen: High-throughput text generation for llms via mii and deepspeed-inference C Holmes, M Tanaka, M Wyatt, AA Awan, J Rasley, S Rajbhandari, ... arXiv preprint arXiv:2401.08671, 2024 | 29 | 2024 |
The age of correlated features in supervised learning based forecasting MKC Shisher, H Qin, L Yang, F Yan, Y Sun IEEE INFOCOM 2021-IEEE Conference on Computer Communications Workshops …, 2021 | 21 | 2021 |
Reinforcement-learning-empowered MLaaS scheduling for serving intelligent internet of things H Qin, S Zawad, Y Zhou, S Padhi, L Yang, F Yan IEEE Internet of Things Journal 7 (7), 6325-6337, 2020 | 19 | 2020 |
Nemo: An open-source transformer-supercharged benchmark for fine-grained wildfire smoke detection A Yazdi, H Qin, CB Jordan, L Yang, F Yan Remote Sensing 14 (16), 3979, 2022 | 16 | 2022 |
ZeRO++: Extremely Efficient Collective Communication for Large Model Training G Wang, H Qin, SA Jacobs, X Wu, C Holmes, Z Yao, S Rajbhandari, ... The Twelfth International Conference on Learning Representations, 2024 | 6 | 2024 |
Deepspeed-visualchat: Multi-round multi-image interleave chat via multi-modal causal attention Z Yao, X Wu, C Li, M Zhang, H Qin, O Ruwase, AA Awan, S Rajbhandari, ... arXiv preprint arXiv:2309.14327, 2023 | 5 | 2023 |
Simigrad: Fine-grained adaptive batching for large scale training using gradient similarity measurement H Qin, S Rajbhandari, O Ruwase, F Yan, L Yang, Y He Advances in Neural Information Processing Systems 34, 20531-20544, 2021 | 5 | 2021 |
Enhancing Pavement Assessment with Dynamic Backcalculation: A Dynamic Finite Element Approach EY Hajj, R Skaff, PE Sebaaly, X Ma, H Qin, F Yan United States. Department of Transportation. Federal Aviation Administration …, 2024 | | 2024 |
Scalable and Efficient Machine Learning as a Service H Qin University of Nevada, Reno, 2022 | | 2022 |
The Age of Correlated Features in Supervised Learning based Forecasting M Kamran Chowdhury Shisher, H Qin, L Yang, F Yan, Y Sun arXiv e-prints, arXiv: 2103.00092, 2021 | | 2021 |