Gangwon Jo

Ph.D. Candidate
Multicore Computing Research Laboratory
Department of Computer Science and Engineering
Seoul National University
Seoul 151-744, Korea

Research Interests

Research Activities

Current

Previous

Education

Publications

International

  1. Gangwon Jo, Jeongho Nah, Jun Lee, Jungwon Kim, and Jaejin Lee. Accelerating LINPACK with MPI-OpenCL on Clusters of Multi-GPU Nodes, To appear in IEEE Transactions on Parallel and Distributed Systems, 2014. (Link)
  2. Gangwon Jo, Won Jong Jeon, Wookeun Jung, Gordon Taft, and Jaejin Lee. OpenCL Framework for ARM Processors with NEON Support, In Proceedings of the 2014 Workshop on Programming Models for SIMD/Vector Processing (WPMVP), 2014. (Link)
  3. Sangmin Seo, Jun Lee, Gangwon Jo, and Jaejin Lee. Automatic OpenCL Work-Group Size Selection for Multicore CPUs, In Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques (PACT), 2013.
  4. Jungwon Kim, Sangmin Seo, Jun Lee, Jeongho Nah, Gangwon Jo, and Jaejin Lee. SnuCL: an OpenCL Framework for Heterogeneous CPU/GPU Clusters, In Proceedings of the 26th International Conference on Supercomputing (ICS), 2012.
  5. Jungwon Kim, Sangmin Seo, Jun Lee, Jeongho Nah, Gangwon Jo, and Jaejin Lee. OpenCL as a Unified Programming Model for Heterogeneous CPU/GPU Clusters, Poster presentation in Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2012.
  6. Sangmin Seo, Gangwon Jo, and Jaejin Lee. Performance Characterization of the NAS Parallel Benchmarks in OpenCL, In Proceedings of the 2011 IEEE International Symposium on Workload Characterization (IISWC), 2011.
  7. Jungwon Kim, Sangmin Seo, Jun Lee, Jeongho Nah, Gangwon Jo, and Jaejin Lee. OpenCL as a Programming Model for GPU Clusters, In Proceedings of the 24th International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2011.

Domestic

  1. Junghyun Kim, Jungho Park, Gangwon Jo, Thanh Tuan Dao, Jinyoung Joo, Jaehoon Jung, Jungwon Kim, Sangmin Seo, Jun Lee, Jeongho Nah, and Jaejin Lee. SnuCL: OpenCL Programming Environment for Heterogeneous Manycore Clusters, Communications of KIISE 32(5), 2014.
  2. Gangwon Jo, Sangmin Seo, Jeongho Nah, Jungwon Kim, Junghyun Kim, Jun Lee, Jungho Park, Yong-Jun Lee, Hongjune Kim, Sooyeon Kang, Jinyoung Joo, Seonmyeong Park, Wookeun Jung, Kihyun Im, and Jaejin Lee. Trends on Heterogeneous Supercomputers and a Case Study on the Development of a Supercomputer Chundoong, Communications of KIISE 31(4), 2013.
  3. Jeongho Nah, Gangwon Jo, Sooyeon Kang, Wookeun Jung, and Jaejin Lee. Design and Implementation of Virtual Machines as an Aid in Teaching Computer Concepts, Poster presentation in Korea Computer Congress 2012 (KCC), 2012.
  4. Jeongho Nah, Honggyu Kim, Hongjune Kim, Gangwon Jo, and Jaejin Lee. Implementation of Register Allocator for JavaScript JIT Compiler, 2011 KIISE Fall Conference, 2011.
  5. Hongjune Kim, Joo Hwan Lee, Gangwon Jo, and Jaejin Lee. Measuring JavaScript Performance with a Real World Web Application, 2011 KIISE Fall Conference, 2011.
  6. Gangwon Jo, Hongjune Kim, Joo Hwan Lee, Jeongho Nah, and Jaejin Lee. Alias Analysis for JavaScript Program Optimization, Poster presentation in Korea Computer Congress 2011 (KCC), 2011.

Experience

Honors and Awards

Supercomputers

Anyone can build a fast CPU. The trick is to build a fast system. - Seymour Cray

SnuCore

SnuCore is a 16-node experimental CPU/GPU cluster built in November 2011. Each node of SnuCore contains two 12-core AMD Opteron 6172 CPUs and three AMD Radeon HD 6990 graphics cards (i.e., 6 GPUs). We optimized HPL (High Performance Linpack) for multi-GPU nodes and have achieved 15.9 TFLOPS (991 GFLOPS per node).

Chundoong

Chundoong (Korean: 천둥; IPA: [cʰən.duŋ]) is a self-made 56-node heterogeneous supercomputer built in October 2012. The word 'chundoong' means thunder in Korean. Each node of Chundoong contains two 8-core Intel Xeon E5-2650 CPUs and four AMD Radeon HD 7970 GPUs. A self-made water cooling system for CPUs and GPUs are equipped. Chundoong have achieved 106.8 TFLOPS (1.907 TFLOPS per node) on the Linpack benchmark. It is ranked #277 in the Top500 list of November 2012, and #32 in the Green500 list of November 2012.

Links