Duplo: Lifting redundant memory accesses of deep neural networks for GPU tensor cores H Kim, S Ahn, Y Oh, B Kim, WW Ro, WJ Song 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020 | 21 | 2020 |
LAS: Locality-Aware Scheduling for GEMM-Accelerated Convolutions in GPUs H Kim, WJ Song IEEE Transactions on Parallel and Distributed Systems 34 (5), 1479-1494, 2023 | 5 | 2023 |
The nebula benchmark suite: Implications of lightweight neural networks B Kim, S Lee, C Park, H Kim, WJ Song IEEE Transactions on Computers 70 (11), 1887-1900, 2020 | 5 | 2020 |
NOMAD: Enabling Non-blocking OS-managed DRAM Cache via Tag-Data Decoupling Y Kim, H Kim, WJ Song 2023 IEEE International Symposium on High-Performance Computer Architecture …, 2023 | 3 | 2023 |
Operation device of convolutional neural network, operation method of convolutional neural network and computer program stored in a recording medium to execute the method thereof WJ Song, WW Ro, H Kim, AHN Sungwoo, OH Yunho, KIM Bogil US Patent App. 17/752,235, 2022 | 1 | 2022 |