An overview of the BlueGene/L supercomputer NR Adiga, G Almási, GS Almasi, Y Aridor, R Barik, D Beece, R Bellofatto, ... SC'02: Proceedings of the 2002 ACM/IEEE Conference on Supercomputing, 60-60, 2002 | 685 | 2002 |
Software transactional memory: Why is it only a research toy? C Cascaval, C Blundell, M Michael, HW Cain, P Wu, S Chiras, ... Communications of the ACM 51 (11), 40-46, 2008 | 430 | 2008 |
Implementation of a portable nested data-parallel language GE Blelloch, JC Hardwick, J Sipelstein, M Zagha, S Chatterjee Journal of parallel and distributed computing 21 (1), 4-14, 1994 | 336 | 1994 |
Exact analysis of the cache behavior of nested loops S Chatterjee, E Parker, PJ Hanlon, AR Lebeck ACM SIGPLAN Notices 36 (5), 286-297, 2001 | 243 | 2001 |
Recursive array layouts and fast parallel matrix multiplication S Chatterjee, AR Lebeck, PK Patnala, M Thottethodi Proceedings of the eleventh annual ACM symposium on Parallel algorithms and …, 1999 | 243 | 1999 |
Implementation of a portable nested data-parallel language GE Blelloch, JC Hardwick, S Chatterjee, J Sipelstein, M Zagha ACM Sigplan Notices 28 (7), 102-111, 1993 | 240 | 1993 |
Nonlinear array layouts for hierarchical memory systems S Chatterjee, VV Jain, AR Lebeck, S Mundhra, M Thottethodi Proceedings of the 13th international conference on Supercomputing, 444-453, 1999 | 216 | 1999 |
Automatic array alignment in data-parallel programs S Chatterjee, JR Gilbert, R Schreiber, SH Teng Proceedings of the 20th ACM SIGPLAN-SIGACT symposium on Principles of …, 1993 | 198 | 1993 |
Scan primitives for vector computers S Chatterjee, GE Blelloch, M Zagha Carnegie Mellon University, 1990 | 158 | 1990 |
Generating local addresses and communication sets for data-parallel programs S Chatterjee, JR Gilbert, FJE Long, R Schreiber, SH Teng ACM SIGPLAN Notices 28 (7), 149-158, 1993 | 146 | 1993 |
Towards a theory of cache-efficient algorithms S Sen, S Chatterjee, N Dumir Journal of the ACM (JACM) 49 (6), 828-858, 2002 | 134 | 2002 |
Generating local addresses and communication sets for data-parallel programs S Chatterjee, JR Gilbert, FJE Long, R Schreiber, SH Teng Journal of Parallel and Distributed Computing 26 (1), 72-84, 1995 | 125 | 1995 |
Tuning Strassen's matrix multiplication for memory efficiency M Thottethodi, S Chatterjee, AR Lebeck SC'98: Proceedings of the 1998 ACM/IEEE Conference on Supercomputing, 36-36, 1998 | 115* | 1998 |
VCODE: A data-parallel intermediate language GE Blelloch, S Chatterjee Proceedings Frontiers of Massively Parallel Computation, 471-480, 1990 | 97 | 1990 |
Cache-efficient matrix transposition S Chatterjee, S Sen Proceedings Sixth International Symposium on High-Performance Computer …, 2000 | 95 | 2000 |
Shared memory programming for large scale machines C Barton, CĆ Casçaval, G Almási, Y Zheng, M Farreras, S Chatterje, ... ACM SIGPLAN Notices 41 (6), 108-117, 2006 | 82 | 2006 |
Method for improving performance of executable code G Cascaval, S Chatterjee, E Duesterwald, A Kielstra, K Stoodley US Patent 7,954,094, 2011 | 81 | 2011 |
Computer architecture: Challenges and opportunities for the next decade T Agerwala, S Chatterjee IEEE Micro 25 (3), 58-69, 2005 | 78 | 2005 |
Cache-efficient multigrid algorithms S Sellappa, S Chatterjee The International Journal of High Performance Computing Applications 18 (1 …, 2004 | 70 | 2004 |
Unlocking the performance of the BlueGene/L supercomputer G Almasi, S Chatterjee, A Gara, J Gunnels, M Gupta, A Henning, ... SC'04: Proceedings of the 2004 ACM/IEEE conference on Supercomputing, 57-57, 2004 | 65 | 2004 |