Follow
Henry Wong
Henry Wong
Unknown affiliation
Verified email at stuffedcow.net - Homepage
Title
Cited by
Cited by
Year
Analyzing CUDA workloads using a detailed GPU simulator
A Bakhoda, GL Yuan, WWL Fung, H Wong, TM Aamodt
2009 IEEE international symposium on performance analysis of systems and …, 2009
20402009
Demystifying GPU microarchitecture through microbenchmarking
H Wong, MM Papadopoulou, M Sadooghi-Alvandi, A Moshovos
2010 IEEE International Symposium on Performance Analysis of Systems …, 2010
6062010
Comparing FPGA vs. custom CMOS and the impact on processor microarchitecture
H Wong, V Betz, J Rose
Proceedings of the 19th ACM/SIGDA international symposium on Field …, 2011
1532011
Pangaea: a tightly-coupled IA32 heterogeneous chip multiprocessor
H Wong, A Bracy, E Schuchman, TM Aamodt, JD Collins, PH Wang, ...
Proceedings of the 17th international conference on Parallel architectures …, 2008
832008
Intel Ivy Bridge Cache Replacement Policy
H Wong
http://blog.stuffedcow.net/2013/01/ivb-cache-replacement/, 2013
492013
Micro-benchmarking the GT200 GPU
MM Papadopoulou, M Sadooghi-Alvandi, H Wong
Computer Group, ECE, University of Toronto, Tech. Rep, 2009
482009
Quantifying the gap between fpga and custom cmos to aid microarchitectural design
H Wong, V Betz, J Rose
IEEE Transactions on Very Large Scale Integration (VLSI) Systems 22 (10 …, 2013
252013
A Comparison of Intel's 32nm and 22nm Core i5 CPUs: Power, Voltage, Temperature, and Frequency
H Wong
http://blog.stuffedcow.net/2012/10/intel32nm-22nm-core-i5-comparison/, 2012
172012
Measuring Reorder Buffer Capacity
H Wong
http://blog.stuffedcow.net/2013/05/measuring-rob-capacity/, 2013
132013
High performance instruction scheduling circuits for out-of-order soft processors
H Wong, V Betz, J Rose
2016 IEEE 24th Annual International Symposium on Field-Programmable Custom …, 2016
122016
Store-to-Load Forwarding and Memory Disambiguation in x86 Processors
H Wong
http://blog.stuffedcow.net/2014/01/x86-memory-disambiguation/, 2014
122014
Efficient methods for out-of-order load/store execution for high-performance soft processors
H Wong, V Betz, J Rose
2013 International Conference on Field-Programmable Technology (FPT), 442-445, 2013
112013
Microarchitecture and circuits for a 200 mhz out-of-order soft processor memory system
H Wong, V Betz, J Rose
ACM Transactions on Reconfigurable Technology and Systems (TRETS) 10 (1), 1-22, 2016
92016
The performance potential for single application heterogeneous systems
H Wong, TM Aamodt
8th Workshop on Duplicating, Deconstructing, and Debunking, 2009
82009
High-performance instruction scheduling circuits for superscalar out-of-order soft processors
H Wong, V Betz, J Rose
ACM Transactions on Reconfigurable Technology and Systems (TRETS) 11 (1), 1-22, 2018
52018
A superscalar out-of-order x86 soft processor for fpga
HTH Wong
University of Toronto (Canada), 2017
42017
TLB and Pagewalk Coherence in x86 Processors
H Wong
http://blog.stuffedcow.net/2015/08/pagewalk-coherence/, 2015
42015
The Microarchitecture Behind Meltdown
H Wong
http://blog.stuffedcow.net/2018/05/meltdown-microarchitecture/, 2018
22018
Microbenchmarking Return Address Branch Prediction
H Wong
http://blog.stuffedcow.net/2018/04/ras-microbenchmarks/, 2018
22018
Architectures and limits of GPU-CPU heterogeneous systems
HTH Wong
University of British Columbia, 2008
22008
The system can't perform the operation now. Try again later.
Articles 1–20