Follow
Aravind Sukumaran-Rajam
Aravind Sukumaran-Rajam
Unknown affiliation
Verified email at meta.com
Title
Cited by
Cited by
Year
Adaptive sparse tiling for sparse matrix multiplication
C Hong, A Sukumaran-Rajam, I Nisa, K Singh, P Sadayappan
Proceedings of the 24th Symposium on Principles and Practice of Parallel …, 2019
1482019
Register optimizations for stencils on GPUs
PS Rawat, F Rastello, A Sukumaran-Rajam, LN Pouchet, A Rountev, ...
Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of …, 2018
622018
A code generator for high-performance tensor contractions on GPUs
J Kim, A Sukumaran-Rajam, V Thumma, S Krishnamoorthy, A Panyala, ...
2019 IEEE/ACM International Symposium on Code Generation and Optimization …, 2019
542019
Domain-specific optimization and generation of high-performance GPU code for stencil computations
PS Rawat, M Vaidya, A Sukumaran-Rajam, M Ravishankar, V Grover, ...
Proceedings of the IEEE 106 (11), 1902-1920, 2018
522018
Analytical characterization and design space exploration for optimization of CNNs
R Li, Y Xu, A Sukumaran-Rajam, A Rountev, P Sadayappan
Proceedings of the 26th ACM International Conference on Architectural …, 2021
502021
Load-balanced sparse mttkrp on gpus
I Nisa, J Li, A Sukumaran-Rajam, R Vuduc, P Sadayappan
2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2019
482019
Efficient sparse-matrix multi-vector product on gpus
C Hong, A Sukumaran-Rajam, B Bandyopadhyay, J Kim, SE Kurt, I Nisa, ...
Proceedings of the 27th International Symposium on High-Performance Parallel …, 2018
472018
MultiGraph: Efficient graph processing on GPUs
C Hong, A Sukumaran-Rajam, J Kim, P Sadayappan
2017 26th International Conference on Parallel Architectures and Compilation …, 2017
422017
Effective machine learning based format selection and performance modeling for SpMV on GPUs
I Nisa, C Siegel, AS Rajam, A Vishnu, P Sadayappan
2018 IEEE International Parallel and Distributed Processing Symposium …, 2018
412018
Sampled dense matrix multiplication for high-performance machine learning
I Nisa, A Sukumaran-Rajam, SE Kurt, C Hong, P Sadayappan
2018 IEEE 25th International Conference on High Performance Computing (HiPC …, 2018
392018
The polyhedral model of nonlinear loops
A Sukumaran-Rajam, P Clauss
ACM Transactions on Architecture and Code Optimization (TACO) 12 (4), 1-27, 2015
392015
An efficient mixed-mode representation of sparse tensors
I Nisa, J Li, A Sukumaran-Rajam, PS Rawat, S Krishnamoorthy, ...
Proceedings of the International Conference for High Performance Computing …, 2019
372019
Israt Nisa, Shivani Sabhlok, Ümit V. Çatalyürek, Srinivasan Parthasarathy, and P. Sadayappan. 2018. Efficient Sparse-Matrix Multi-Vector Product on GPUs
C Hong, A Sukumaran-Rajam, B Bandyopadhyay, J Kim, SE Kurt
Proceedings of the 27th International Symposium on High-Performance Parallel …, 2018
362018
On optimizing complex stencils on GPUs
PS Rawat, M Vaidya, A Sukumaran-Rajam, A Rountev, LN Pouchet, ...
2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2019
352019
Analytical cache modeling and tilesize optimization for tensor contractions
R Li, A Sukumaran-Rajam, R Veras, TM Low, F Rastello, A Rountev, ...
Proceedings of the International Conference for High Performance Computing …, 2019
342019
Parallel ccd++ on gpu for matrix factorization
I Nisa, A Sukumaran-Rajam, R Kunchum, P Sadayappan
Proceedings of the General Purpose GPUs, 73-83, 2017
252017
On improving performance of sparse matrix-matrix multiplication on gpus
R Kunchum, A Chaudhry, A Sukumaran-Rajam, Q Niu, I Nisa, ...
Proceedings of the International Conference on Supercomputing, 1-11, 2017
232017
APOLLO: Automatic speculative polyhedral loop optimizer
JMM Caamaño, A Sukumaran-Rajam, A Baloian, M Selva, P Clauss
IMPACT 2017-7th International Workshop on Polyhedral Compilation Techniques, 8, 2017
232017
Efficient tiled sparse matrix multiplication through matrix signatures
SE Kurt, A Sukumaran-Rajam, F Rastello, P Sadayyapan
SC20: International Conference for High Performance Computing, Networking …, 2020
222020
Optimizing tensor contractions in ccsd (t) for efficient execution on gpus
J Kim, A Sukumaran-Rajam, C Hong, A Panyala, RK Srivastava, ...
Proceedings of the 2018 International Conference on Supercomputing, 96-106, 2018
222018
The system can't perform the operation now. Try again later.
Articles 1–20