IA-SpGEMM: An input-aware auto-tuning framework for parallel sparse matrix-matrix multiplication
Z Xie, G Tan, W Liu, N Sun
Proceedings of the ACM International Conference on Supercomputing, 94-105, 2019
Smart-PGSim: Using neural network to accelerate AC-OPF power grid simulation
W Dong, Z Xie, G Kestor, D Li
SC20: International Conference for High Performance Computing, Networking …, 2020
GenSLMs: Genome-scale language models reveal SARS-CoV-2 evolutionary dynamics
M Zvyagin, A Brace, K Hippe, Y Deng, B Zhang, CO Bohorquez, A Clyde, ...
The International Journal of High Performance Computing Applications 37 (6 …, 2023
Adaptive neural network-based approximation to accelerate eulerian fluid simulation
W Dong, J Liu, Z Xie, D Li
Proceedings of the International Conference for High Performance Computing …, 2019
MD-HM: memoization-based molecular dynamics simulations on big memory system
Z Xie, W Dong, J Liu, I Peng, Y Ma, D Li
Proceedings of the ACM International Conference on Supercomputing, 215-226, 2021
Tahoe: tree structure-aware high performance inference engine for decision tree ensemble on GPU
Z Xie, W Dong, J Liu, H Liu, D Li
Proceedings of the Sixteenth European Conference on Computer Systems, 426-440, 2021
A pattern-based spgemm library for multi-core and many-core architectures
Z Xie, G Tan, W Liu, N Sun
IEEE Transactions on Parallel and Distributed Systems 33 (1), 159-175, 2021
Enabling energy-efficient DNN training on hybrid GPU-FPGA accelerators
X He, J Liu, Z Xie, H Chen, G Chen, W Zhang, D Li
Proceedings of the ACM International Conference on Supercomputing, 227-241, 2021
Hpc-gpt: Integrating large language model for high-performance computing
X Ding, L Chen, M Emani, C Liao, PH Lin, T Vanderbruggen, Z Xie, ...
Proceedings of the SC'23 Workshops of The International Conference on High …, 2023
A Comprehensive Evaluation of Novel AI Accelerators for Deep Learning Workloads
M Emani, Z Xie, S Raskar, V Sastry, W Arnold, B Wilson, R Thakur, ...
2022 IEEE/ACM International Workshop on Performance Modeling, Benchmarking …, 2022
Modeling traffic of big data platform for large scale datacenter networks
Z Xie, Z Cao, Z Wang, D Zang, E Shao, N Sun
2016 IEEE 22nd International Conference on Parallel and Distributed Systems …, 2016
TLB-pilot: Mitigating TLB Contention Attack on GPUs with Microarchitecture-Aware Scheduling
B Di, D Hu, Z Xie, J Sun, H Chen, J Ren, D Li
ACM Transactions on Architecture and Code Optimization (TACO) 19 (1), 1-23, 2021
Flame: A self-adaptive auto-labeling system for heterogeneous mobile processors
J Liu, J Liu, Z Xie, D Li
arXiv preprint arXiv:2003.01762, 2020
{RIANN}: Real-time incremental learning with approximate nearest neighbor on mobile devices
J Liu, Z Xie, D Nikolopoulos, D Li
2020 USENIX Conference on Operational Machine Learning (OpML 20), 2020
A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators
M Emani, S Foreman, V Sastry, Z Xie, S Raskar, W Arnold, R Thakur, ...
arXiv preprint arXiv:2310.04607, 2023
A heterogeneous processing-in-memory approach to accelerate quantum chemistry simulation
Z Liu, Z Xie, W Dong, M Yuan, H You, D Li
Parallel Computing 116, 103017, 2023
Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation
G Verma, S Raskar, Z Xie, AM Malik, M Emani, B Chapman
arXiv preprint arXiv:2304.05430, 2023
Merchandiser: Data Placement on Heterogeneous Memory for Task-Parallel HPC Applications with Load-Balance Awareness
Z Xie, J Liu, J Li, D Li
Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and …, 2023
Throughput-oriented and Accuracy-aware DNN Training with BFloat16 on GPU
Z Xie, S Raskar, M Emani
2022 IEEE International Parallel and Distributed Processing Symposium …, 2022
LB-HM: load balance-aware data placement on heterogeneous memory for task-parallel HPC applications
Z Xie, J Liu, S Ma, J Li, D Li
Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of …, 2022
