Kim Hazelwood

Cited by

	All	Since 2019
Citations	11471	5124
h-index	34	20
i10-index	55	32

1200

600

300

900

20042005200620072008200920102011201220132014201520162017201820192020202120222023202443 83 158 187 235 273 381 480 512 609 574 647 630 677 643 755 908 962 1086 1135 274

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Robert CohnIntelVerified email at intel.com
David BrooksHaley Family Professor of Computer Science, Harvard UniversityVerified email at eecs.harvard.edu
Chris GreggLecturer, Stanford University Department of Computer ScienceVerified email at stanford.edu
Chi-Keung LukMetaVerified email at fb.com
Vijay Janapa ReddiHarvard UniversityVerified email at eecs.harvard.edu
Gu-Yeon WeiRobert and Suzanne Case Professor of EE and CS, Harvard UniversityVerified email at eecs.harvard.edu
Robert MuthVerified email at muth.org
Michael D. SmithHarvard UniversityVerified email at harvard.edu
Kevin SkadronHarry Douglas Forsyth Professor of Computer Science, University of VirginiaVerified email at virginia.edu

Kim Hazelwood

Facebook

Verified email at fb.com - Homepage

systems for ML computer architecture data center performance analysis workload characterization


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Pin: building customized program analysis tools with dynamic instrumentation CK Luk, R Cohn, R Muth, H Patil, A Klauser, G Lowney, S Wallace, ... Acm sigplan notices 40 (6), 190-200, 2005	5649	2005
Applied machine learning at facebook: A datacenter infrastructure perspective K Hazelwood, S Bird, D Brooks, S Chintala, U Diril, D Dzhulgakov, ... 2018 IEEE International Symposium on High Performance Computer Architecture …, 2018	692	2018
Profiling a warehouse-scale computer S Kanev, JP Darago, K Hazelwood, P Ranganathan, T Moseley, GY Wei, ... Proceedings of the 42nd Annual International Symposium on Computer …, 2015	530	2015
Machine learning at facebook: Understanding inference at the edge CJ Wu, D Brooks, K Chen, D Chen, S Choudhury, M Dukhan, ... 2019 IEEE international symposium on high performance computer architecture …, 2019	507	2019
Where is the data? Why you cannot debate CPU vs. GPU performance without the answer C Gregg, K Hazelwood (IEEE ISPASS) IEEE International Symposium on Performance Analysis of …, 2011	414	2011
Mlperf training benchmark P Mattson, C Cheng, G Diamos, C Coleman, P Micikevicius, D Patterson, ... Proceedings of Machine Learning and Systems 2, 336-349, 2020	303	2020
The architectural implications of facebook's dnn-based personalized recommendation U Gupta, CJ Wu, X Wang, M Naumov, B Reagen, D Brooks, B Cottel, ... 2020 IEEE International Symposium on High Performance Computer Architecture …, 2020	269	2020
Sustainable ai: Environmental implications, challenges and opportunities CJ Wu, R Raghavendra, U Gupta, B Acun, N Ardalani, K Maeng, G Chang, ... Proceedings of Machine Learning and Systems 4, 795-813, 2022	249	2022
Deep learning inference in facebook data centers: Characterization, performance optimizations and hardware implications J Park, M Naumov, P Basu, S Deng, A Kalaiah, D Khudia, J Law, P Malani, ... arXiv preprint arXiv:1811.09886, 2018	199	2018
Recnmp: Accelerating personalized recommendation with near-memory processing L Ke, U Gupta, BY Cho, D Brooks, V Chandra, U Diril, A Firoozshahian, ... 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture …, 2020	185	2020
Reducing DRAM footprint with NVM in Facebook A Eisenman, D Gardner, I AbdelRahman, J Axboe, S Dong, K Hazelwood, ... Proceedings of the Thirteenth EuroSys Conference, 1-13, 2018	183	2018
Analyzing parallel programs with pin M Bach, M Charney, R Cohn, E Demikhovsky, T Devor, K Hazelwood, ... Computer 43 (3), 34-41, 2010	158	2010
{Fine-Grained} Resource Sharing for Concurrent {GPGPU} Kernels C Gregg, J Dorn, K Hazelwood, K Skadron 4th USENIX Workshop on Hot Topics in Parallelism (HotPar 12), 2012	125	2012
Enabling task parallelism in the cuda scheduler M Guevara, C Gregg, K Hazelwood, K Skadron Workshop on Programming Models for Emerging Architectures 9, 84, 2009	123	2009
Superpin: Parallelizing dynamic instrumentation for real-time performance S Wallace, K Hazelwood International Symposium on Code Generation and Optimization (CGO'07), 209-220, 2007	119	2007
A dynamic binary instrumentation engine for the arm architecture K Hazelwood, A Klauser Proceedings of the 2006 international conference on Compilers, architecture …, 2006	116	2006
Tradeoffs between power management and tail latency in warehouse-scale applications S Kanev, K Hazelwood, GY Wei, D Brooks 2014 IEEE International Symposium on Workload Characterization (IISWC), 31-40, 2014	95	2014
Adaptive online context-sensitive inlining K Hazelwood, D Grove International Symposium on Code Generation and Optimization, 2003. CGO 2003 …, 2003	92	2003
Bandana: Using non-volatile memory for storing deep learning models A Eisenman, M Naumov, D Gardner, M Smelyanskiy, S Pupyrev, ... Proceedings of machine learning and systems 1, 40-52, 2019	87	2019
Dynamic heterogeneous scheduling decisions using historical runtime data C Gregg, M Boyer, K Hazelwood, K Skadron Workshop on Applications for Multi-and Many-Core Processors (A4MMC), 1-12, 2011	87	2011

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors