Xiuhong Li(李秀红)
Xiuhong Li(李秀红)
SenseTime & CUHK
Verified email at sensetime.com - Homepage
Title
Cited by
Cited by
Year
Enabling coordinated register allocation and thread-level parallelism optimization for GPUs
X Xie, Y Liang, X Li, Y Wu, G Sun, T Wang, D Fan
2015 48th Annual IEEE/ACM International Symposium on Microarchitecture …, 2015
662015
TGPA: tile-grained pipeline architecture for low latency CNN inference
X Wei, Y Liang, X Li, CH Yu, P Zhang, J Cong
Proceedings of the International Conference on Computer-Aided Design, 1-8, 2018
272018
A coordinated tiling and batching framework for efficient GEMM on GPUs
X Li, Y Liang, S Yan, L Jia, Y Li
Proceedings of the 24th Symposium on Principles and Practice of Parallel …, 2019
162019
Efficient kernel management on GPUs
X Li, Y Liang
2016 Design, Automation & Test in Europe Conference & Exhibition (DATE), 85-90, 2016
152016
Performance-centric register file design for GPUs using racetrack memory
S Wang, Y Liang, C Zhang, X Xie, G Sun, Y Liu, Y Wang, X Li
2016 21st Asia and South Pacific Design Automation Conference (ASP-DAC), 25-30, 2016
152016
cuMBIR: An efficient framework for low-dose x-ray CT image reconstruction on GPUs
X Li, Y Liang, W Zhang, T Liu, H Li, G Luo, M Jiang
Proceedings of the 2018 International Conference on Supercomputing, 184-194, 2018
102018
Efficient kernel management on gpus
Y Liang, X Li
ACM Transactions on Embedded Computing Systems (TECS) 16 (4), 1-24, 2017
102017
CRAT: Enabling coordinated register allocation and thread-level parallelism optimization for GPUs
X Xie, Y Liang, X Li, Y Wu, G Sun, T Wang, D Fan
IEEE Transactions on Computers 67 (6), 890-897, 2017
52017
Exploring cache bypassing and partitioning for multi-tasking on GPUs
Y Liang, X Li, X Xie
2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 9-16, 2017
52017
CuLDA: solving large-scale LDA Problems on GPUs
X Xie, Y Liang, X Li, W Tan
Proceedings of the 28th International Symposium on High-Performance Parallel …, 2019
32019
CuLDA_CGS: solving large-scale LDA problems on GPUs
X Xie, Y Liang, X Li, W Tan
Proceedings of the 24th Symposium on Principles and Practice of Parallel …, 2019
32019
Enabling efficient fast convolution algorithms on GPUs via MegaKernels
L Jia, Y Liang, X Li, L Lu, S Yan
IEEE Transactions on Computers 69 (7), 986-997, 2020
12020
The system can't perform the operation now. Try again later.
Articles 1–12