Follow
John Pennycook
Title
Cited by
Cited by
Year
Exploring SIMD for Molecular Dynamics, Using Intel Xeon Processors and Intel Xeon Phi Coprocessors
SJ Pennycook, CJ Hughes, M Smelyanskiy, SA Jarvis
IEEE International Parallel & Distributed Processing Symposium, 2013
1982013
CosmoFlow: Using deep learning to learn the universe at scale
A Mathuriya, D Bard, P Mendygral, L Meadows, J Arnemann, L Shao, ...
SC18: International Conference for High Performance Computing, Networking …, 2018
942018
Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark
SJ Pennycook, SD Hammond, SA Jarvis, GR Mudalige
ACM SIGMETRICS Performance Evaluation Review 38 (4), 23-29, 2011
882011
An investigation of the performance portability of OpenCL
SJ Pennycook, SD Hammond, SA Wright, JA Herdman, I Miller, SA Jarvis
Journal of Parallel and Distributed Computing 73 (11), 1439-1450, 2013
832013
Implications of a metric for performance portability
SJ Pennycook, JD Sewall, VW Lee
Future Generation Computer Systems 92, 947-958, 2019
592019
A metric for performance portability
SJ Pennycook, JD Sewall, VW Lee
arXiv preprint arXiv:1611.07409, 2016
582016
Parallel file system analysis through application I/O tracing
SA Wright, SD Hammond, SJ Pennycook, RF Bird, JA Herdman, I Miller, ...
The Computer Journal 56 (2), 141-155, 2013
352013
Effective performance portability
SL Harrell, J Kitson, R Bird, SJ Pennycook, J Sewall, D Jacobsen, ...
2018 IEEE/ACM International Workshop on Performance, Portability and …, 2018
292018
On the acceleration of wavefront applications using distributed many-core architectures
SJ Pennycook, SD Hammond, GR Mudalige, SA Wright, SA Jarvis
The Computer Journal 55 (2), 138-153, 2012
292012
Developing performance-portable molecular dynamics kernels in OpenCL
SJ Pennycook, SA Jarvis
2012 SC Companion: High Performance Computing, Networking Storage and …, 2012
222012
Methods and apparatus for multi-load and multi-store vector instructions
L Meadows, A Duran, S Pennycook, J Sewall
US Patent App. 15/859,033, 2019
142019
Ldplfs: Improving i/o performance without application modification
SA Wright, SD Hammond, SJ Pennycook, I Miller, JA Herdman, SA Jarvis
2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012
132012
Evaluating the impact of proposed openmp 5.0 features on performance, portability and productivity
SJ Pennycook, JD Sewall, JR Hammond
2018 IEEE/ACM International Workshop on Performance, Portability and …, 2018
112018
Interpreting and visualizing performance portability metrics
J Sewall, SJ Pennycook, D Jacobsen, T Deakin, S McIntosh-Smith
2020 IEEE/ACM International Workshop on Performance, Portability and …, 2020
72020
Towards a portable and future-proof particle-in-cell plasma physics code
RF Bird, SJ Pennycook, SA Wright, SA Jarvis
72013
Light-weight parallel I/O analysis at scale
SA Wright, SD Hammond, SJ Pennycook, SA Jarvis
European Performance Engineering Workshop, 235-249, 2011
72011
WMTrace-A Lightweight Memory Allocation Tracker and Analysis Framework
OFJ Perks, SD Hammond, SJ Pennycook, SA Jarvis
62011
Navigating performance, portability, and productivity
SJ Pennycook, JD Sewall, DW Jacobsen, T Deakin, S McIntosh-Smith
Computing in Science & Engineering 23 (5), 28-38, 2021
52021
Unveiling the Early Universe: Optimizing Cosmology Workloads for Intel Xeon Phi Coprocessors in an SGI UV2000 System
J Briggs, SJ Pennycook, EPS Shellard, C Martins, M Woodacre, K Feind
Tech. Rep.(SGI/Intel White Paper, 2014), 2014
52014
Model-led optimisation of a geometric multigrid application
R Bunt, S Pennycook, S Jarvis, L Lapworth, Y Ho
2013 IEEE 10th International Conference on High Performance Computing and …, 2013
52013
The system can't perform the operation now. Try again later.
Articles 1–20