Follow
Jinghan Yao
Title
Cited by
Cited by
Year
Soft: Softmax-free transformer with linear complexity
J Lu, J Yao, J Zhang, X Zhu, H Xu, W Gao, C Xu, T Xiang, L Zhang
Advances in Neural Information Processing Systems 34, 21297-21309, 2021
1552021
SPRNet: single-pixel reconstruction for one-stage instance segmentation
J Yu, J Yao, J Zhang, Z Yu, D Tao
IEEE transactions on cybernetics 51 (4), 1731-1742, 2020
812020
A novel framework for efficient offloading of communication operations to bluefield smartnics
KK Suresh, B Michalowicz, B Ramesh, N Contini, J Yao, S Xu, A Shafi, ...
2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2023
72023
MPI-xCCL: A Portable MPI Library over Collective Communication Libraries for Various Accelerators
CC Chen, K Shafie Khorassani, P Kousha, Q Zhou, J Yao, H Subramoni, ...
Proceedings of the SC'23 Workshops of The International Conference on High …, 2023
32023
Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference
J Yao, Q Anthony, A Shafi, H Subramoni
38th IEEE International Parallel & Distributed Processing Symposium (IPDPS'24), 2024
22024
Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer
J Yao, SA Jacobs, M Tanaka, O Ruwase, A Shafi, H Subramoni, ...
arXiv preprint arXiv:2408.16978, 2024
2024
Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference
J Yao, N Alnaasan, T Chen, A Shafi, H Subramoni
In Proceeding of 30th IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE …, 2023
2023
Supplementary for SOFT: Softmax-free Transformer with Linear Complexity
J Lu, J Yao, J Zhang, X Zhu, H Xu, W Gao, C Xu, T Xiang, L Zhang
The system can't perform the operation now. Try again later.
Articles 1–8