Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU VW Lee, C Kim, J Chhugani, M Deisher, D Kim, AD Nguyen, N Satish, ... Proceedings of the 37th annual international symposium on Computer …, 2010 | 1214 | 2010 |
Baring it all to software: Raw machines E Waingold, M Taylor, D Srikrishna, V Sarkar, W Lee, V Lee, J Kim, ... Computer 30 (9), 86-93, 1997 | 1038 | 1997 |
Sort vs. Hash revisited: fast join implementation on modern multi-core CPUs C Kim, T Kaldewey, VW Lee, E Sedlar, AD Nguyen, N Satish, J Chhugani, ... Proceedings of the VLDB Endowment 2 (2), 1378-1389, 2009 | 439 | 2009 |
FAST: fast architecture sensitive tree search on modern CPUs and GPUs C Kim, J Chhugani, N Satish, E Sedlar, AD Nguyen, T Kaldewey, VW Lee, ... Proceedings of the 2010 ACM SIGMOD International Conference on Management of …, 2010 | 438 | 2010 |
Efficient implementation of sorting on multi-core SIMD CPU architecture J Chhugani, AD Nguyen, VW Lee, W Macy, M Hagog, YK Chen, A Baransi, ... Proceedings of the VLDB Endowment 1 (2), 1313-1324, 2008 | 334 | 2008 |
Fast sort on CPUs and GPUs: a case for bandwidth oblivious SIMD sort N Satish, C Kim, J Chhugani, AD Nguyen, VW Lee, D Kim, P Dubey Proceedings of the 2010 ACM SIGMOD International Conference on Management of …, 2010 | 319 | 2010 |
Architecting to achieve a billion requests per second throughput on a single key-value store server platform S Li, H Lim, VW Lee, JH Ahn, A Kalia, M Kaminsky, DG Andersen, ... ACM SIGARCH Computer Architecture News 43 (3), 476-488, 2015 | 164 | 2015 |
Convergence of recognition, mining, and synthesis workloads and its implications YK Chen, J Chhugani, P Dubey, CJ Hughes, D Kim, S Kumar, VW Lee, ... Proceedings of the IEEE 96 (5), 790-807, 2008 | 148 | 2008 |
The raw benchmark suite: Computation structures for general purpose computing J Babb, M Frank, V Lee, E Waingold, R Barua, M Taylor, J Kim, ... Field-Programmable Custom Computing Machines, 1997. Proceedings., The 5th …, 1997 | 137 | 1997 |
CosmoFlow: Using deep learning to learn the universe at scale A Mathuriya, D Bard, P Mendygral, L Meadows, J Arnemann, L Shao, ... SC18: International Conference for High Performance Computing, Networking …, 2018 | 134 | 2018 |
Performing power management in a multicore processor VW Lee, ET Grochowski, D Kim, Y Bai, S Li, NK Mellempudi, ... US Patent App. 14/621,709, 2015 | 129* | 2015 |
Mapping high-fidelity volume rendering for medical imaging to CPU, GPU and many-core architectures M Smelyanskiy, D Holmes, J Chhugani, A Larson, DM Carmean, ... IEEE transactions on visualization and computer graphics 15 (6), 2009 | 113 | 2009 |
Implications of a metric for performance portability SJ Pennycook, JD Sewall, VW Lee Future Generation Computer Systems, 2017 | 108 | 2017 |
Vector instructions to enable efficient synchronization and parallel reduction operations M Smelyanskiy, S Kumar, D Kim, J Chhugani, C Kim, CJ Hughes, VW Lee, ... US Patent 9,513,905, 2016 | 96 | 2016 |
Vector instructions to enable efficient synchronization and parallel reduction operations M Smelyanskiy, S Kumar, D Kim, J Chhugani, C Kim, CJ Hughes, VW Lee, ... US Patent 9,513,905, 2016 | 96 | 2016 |
Vector instructions to enable efficient synchronization and parallel reduction operations M Smelyanskiy, S Kumar, D Kim, J Chhugani, C Kim, CJ Hughes, VW Lee, ... US Patent 9,513,905, 2016 | 96 | 2016 |
Scheduling and partitioning tasks via architecture-aware feedback information A Ozgur, GT Buehrer, AD Nguyen, D Kim, VW Lee, M Smelyanskiy, ... US Patent App. 11/300,809, 2005 | 95 | 2005 |
Implications of I/O for gang scheduled workloads W Lee, M Frank, V Lee, K Mackenzie, L Rudolph Job Scheduling Strategies for Parallel Processing, 215-237, 1997 | 94 | 1997 |
Lattice qcd on intelŽ xeon phitm coprocessors B Joo, DD Kalamkar, K Vaidyanathan, M Smelyanskiy, K Pamnany, ... International Supercomputing Conference, 40-54, 2013 | 88 | 2013 |
Lattice qcd on intelŽ xeon phitm coprocessors B Joo, DD Kalamkar, K Vaidyanathan, M Smelyanskiy, K Pamnany, ... International Supercomputing Conference, 40-54, 2013 | 84 | 2013 |