Timothy G. Rogers
Title
Cited by
Cited by
Year
Cache-conscious wavefront scheduling
TG Rogers, M O'Connor, TM Aamodt
2012 45th Annual IEEE/ACM International Symposium on Microarchitecture, 72-83, 2012
4782012
Divergence-aware warp scheduling
TG Rogers, M O'Connor, TM Aamodt
Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013
1572013
Characterizing and evaluating a key-value store application on heterogeneous CPU-GPU systems
TH Hetherington, TG Rogers, L Hsu, M O'Connor, TM Aamodt
2012 IEEE International Symposium on Performance Analysis of Systems …, 2012
1222012
GPGPU-Sim 3. x manual
TM Aamodt, WWL Fung, I Singh, A El-Shafiey, J Kwa, T Hetherington, ...
2012-08-08)[2013-08-08]. http:∥ gpgpu-sim. org/manual/index. php/GPGPU …, 2012
612012
A variable warp size architecture
TG Rogers, DR Johnson, M O'Connor, SW Keckler
ACM SIGARCH Computer Architecture News 43 (3S), 489-501, 2015
482015
Accel-Sim: An extensible simulation framework for validated GPU modeling
M Khairy, Z Shen, TM Aamodt, TG Rogers
2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture …, 2020
37*2020
Pagoda: Fine-grained GPU resource virtualization for narrow tasks
TT Yeh, A Sabne, P Sakdhnagool, R Eigenmann, TG Rogers
ACM SIGPLAN Notices 52 (8), 221-234, 2017
332017
Lost in abstraction: Pitfalls of analyzing GPUs at the intermediate language level
A Gutierrez, BM Beckmann, A Dutu, J Gross, M LeBeane, J Kalamatianos, ...
2018 IEEE International Symposium on High Performance Computer Architecture …, 2018
322018
Analyzing machine learning workloads using a detailed GPU simulator
J Lew, DA Shah, S Pati, S Cattell, M Zhang, A Sandhupatla, C Ng, N Goli, ...
2019 IEEE International Symposium on Performance Analysis of Systems and …, 2019
262019
General-purpose graphics processor architectures
TM Aamodt, WWL Fung, TG Rogers
Synthesis Lectures on Computer Architecture 13 (2), 1-140, 2018
212018
Creating SIMD efficient code by transferring register state through common memory
TG Rogers, BM Beckmann, JM O'connor
US Patent 9,354,892, 2016
112016
A quantitative evaluation of contemporary gpu simulation methodology
A Jain, M Khairy, TG Rogers
Proceedings of the ACM on Measurement and Analysis of Computing Systems 2 (2 …, 2018
102018
Cache-conscious thread scheduling for massively multithreaded processors
TG Rogers, M O'Connor, TM Aamodt
IEEE Micro 33 (3), 78-85, 2013
72013
Dimensionality-aware redundant simt instruction elimination
TT Yeh, RN Green, TG Rogers
Proceedings of the Twenty-Fifth International Conference on Architectural …, 2020
42020
Locality and scheduling in the massively multithreaded era
TG Rogers
University of British Columbia, 2015
42015
A detailed model for contemporary GPU memory systems
M Khairy, A Jain, TM Aamodt, TG Rogers
2019 IEEE International Symposium on Performance Analysis of Systems and …, 2019
32019
Learning your limit: managing massively multithreaded caches through scheduling
TG Rogers, M O'Connor, TM Aamodt
Communications of the ACM 57 (12), 91-98, 2014
32014
Locality-centric data and threadblock management for massive GPUs
M Khairy, V Nikiforov, D Nellans, TG Rogers
2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020
22020
Deterministic atomic buffering
YH Chou, C Ng, S Cattell, J Intan, MD Sinclair, J Devietti, TG Rogers, ...
2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020
22020
SST_GPU: An Execution-Driven CUDA Kernel Scheduler and Streaming-Multiprocessor Compute Model.
M Khairy, M Zhang, R Green, SD Hammond, RJ Hoekstra, T Rogers, ...
Sandia National Lab.(SNL-NM), Albuquerque, NM (United States), 2019
22019
The system can't perform the operation now. Try again later.
Articles 1–20