SnuCL: 이종 매니코어 클러스터를 위한 OpenCL 프로그래밍 환경

  • Published : 2014.05.16

Abstract

Keywords

References

  1. NVIDIA. NVIDIA CUDA Toolkit 4.0. http://developer. nvidia.com/cuda-toolkit-40.
  2. Khronos OpenCL Working Group. The OpenCL Specification Version 1.1, 2010. http://www.khronos.org/ opencl.
  3. J. Kim, S. Seo, J. Lee, J. Nah, G. Jo, and J. Lee. SnuCL: an OpenCL Framework for Heterogeneous CPU/GPU Clusters, ICS '12: Proceedings of the 26th International Conference on Supercomputing, pp. 341-352, San Servolo Island, Venice, Italy, June 2012
  4. M. Harvey and G. Fabritiisb. Swan: A tool for porting CUDA programs to OpenCL. Computer Physics Communications, 182:1093.1099, April 2011.
  5. G. Martinez, M. Gardner, and W. chun Feng. CU2CL: A CUDA-to-OpenCL Translator for Multi- and Many- Core Architectures. In Parallel and Distributed Systems (ICPADS), 2011 IEEE 17th International Conference on, pages 300.307, 2011.
  6. P. Sathre, M. Gardner, and W.-c. Feng. Lost in Translation: Challenges in Automating CUDA-to-OpenCL Translation. In 5th International Workshop on Parallel Programming Models and Systems Software for High- End Computing (P2S2), Pittsburgh, PA, September 2012.
  7. D. Nandakumar. Automatic translation of CUDA to OpenCL and comparison of performance optimizations on GPUS. Master's thesis, University of Illinois at Urbana-Champaign, 2011.
  8. C. Dave, H. Bae, S.-J. Min, S. Lee, R. Eigenmann, and S. Midkiff. Cetus: A Source-to-Source Compiler Infrastructure for Multicores. Computer, 42(12):36.42, 2009.
  9. C. Lattner. clang: a C language family frontend for LLVM. Website, May 2007. http://clang.llvm.org.
  10. AMD CodeXL, http://developer.amd.com/tools-andsdks/ heterogeneous-computing/codexl/, 2014.
  11. Nsight Eclipse Edition, https://developer.nvidia.com/ nsight-eclipse-edition/, 2014.
  12. Intel SDK for OpenCL Applications XE 2013 R3, http:// software.intel.com/en-us/vcsource/tools/openclsdk-xe/, 2014.
  13. AmdahlSoft Codebench, http://www.amdahlsoftware. com/, 2014.
  14. Try the World's Fastest Accelerators, http://www.nvidia. com/ object/gpu-test-drive.html
  15. AMD. AMD Accelerated Parallel Processing (APP) SDK, 2011.
  16. NASA Advanced Supercomputing Division. NAS Parallel Benchmarks version 3.3. http://www.nas.nasa. gov/Resources/Software/npb.html.
  17. The IMPACT Research Group. Parboil Benchmark suite. http://impact.crhc.illinois.edu/parboil.php.
  18. C. Bienia, S. Kumar, J. P. Singh, and K. Li. The PARSEC benchmark suite: characterization and architectural implications. In Proceedings of the 17th international conference on Parallel architectures and compilation techniques, PACT '08, pages 72-81, 2008.
  19. S. Seo, G. Jo, and J. Lee. Performance Characterization of the NAS Parallel Benchmarks in OpenCL. In Proceedings of the 2011 IEEE International Symposium on Workload Characterization, IISWC '11, pages 137- 148, 2011.
  20. M. Snir, S. Otto, S. Huss-Lederman, D. Walker, and J. Dongarra. MPI-The Complete Reference, Volume 1: The MPI Core. MIT Press, Cambridge, MA, USA, 2nd. (revised) edition, 1998.
  21. Top500 supercomputer sites, http://top500.org/.
  22. SnuCL website. http://snucl.snu.ac.kr.
  23. K. Asanovic, R. Bodik, B. C. Catanzaro, J. J. Gebis, P. Husbands, K. Keutzer, D. A. Patterson, W. L. Plishker, J. Shalf, S. W. Williams, and K. A. Yelick, The Landscape of Parallel Computing Research: A View from Berkeley, Technical Report No. UCB/EECS-2006-183, Electrical Engineering and Computer Sciences, University of California, Berkeley, December 18, 2006; www.eecs.berkeley.edu/Pubs/TechRpts/2006/EECS-2 006-183.html.