DOI QR코드

DOI QR Code

Accelerating Numerical Analysis of Reynolds Equation Using Graphic Processing Units

그래픽처리장치를 이용한 레이놀즈 방정식의 수치 해석 가속화

  • Myung, Hun-Joo (Supercomputing Center, Korea Institute of Science and Technology Information (KISTI)) ;
  • Kang, Ji-Hoon (Supercomputing Center, Korea Institute of Science and Technology Information (KISTI)) ;
  • Oh, Kwang-Jin (Supercomputing Center, Korea Institute of Science and Technology Information (KISTI))
  • 명훈주 (한국과학기술정보연구원 슈퍼컴퓨팅센터) ;
  • 강지훈 (한국과학기술정보연구원 슈퍼컴퓨팅센터) ;
  • 오광진 (한국과학기술정보연구원 슈퍼컴퓨팅센터)
  • Received : 2012.05.15
  • Accepted : 2012.06.30
  • Published : 2012.08.31

Abstract

This paper presents a Reynolds equation solver for hydrostatic gas bearings, implemented to run on graphics processing units (GPUs). The original analysis code for the central processing unit (CPU) was modified for the GPU by using the compute unified device architecture (CUDA). The red-black Gauss-Seidel (RBGS) algorithm was employed instead of the original Gauss-Seidel algorithm for the iterative pressure solver, because the latter has data dependency between neighboring nodes. The implemented GPU program was tested on the nVidia GTX580 system and compared to the original CPU program on the AMD Llano system. In the iterative pressure calculation, the implemented GPU program showed 20-100 times faster performance than the original CPU codes. Comparison of the wall-clock times including all of pre/post processing codes showed that the GPU codes still delivered 4-12 times faster performance than the CPU code for our target problem.

Keywords

References

  1. 정영훈, "OpenMP 병렬프로그래밍," Chap. 1, pp. 4-7, 프리렉, 대한민국, 2011.
  2. The OpenMP API Specification for parallel programming, http://www.openmp.org.
  3. The Message Passing Interface (MPI) standard, http://www.mcs.ahl.gov/research/projects/mpi.
  4. Open MPI: Open Source High Performance Computing, http://www.open-mpi.org.
  5. Kirk, D.B. and Hwu, W.W., "Programming Massively Parallel Processors: Hands-on Approach," Chap.1, pp. 19-36, Elsevier, 2010.
  6. Stone, J.E., Hardy, D.J., Ufimtsev, I. S., and Schulten, K., "GPU-accelerated Molecular Modeling Coming of age," Journal of Molecular Graphics and Modeling, Vol. 29, No. 2, pp.116-125, 2010. https://doi.org/10.1016/j.jmgm.2010.06.010
  7. Surkov, V., "Parallel Option Pricing with Fourier Space Time-stepping Method on Graphics Processsing Units," Parallel Computing, Vol. 36, No. 7, pp. 372-380, 2010. https://doi.org/10.1016/j.parco.2010.02.006
  8. Tolke, J. and Krafczyk, M., "TefaFLOP Computing on a Desktop PC with GPUs for 3D CFD," International Journal of Computational Fluid Dynamics, Vol. 22, No. 7, pp. 443-456, 2008. https://doi.org/10.1080/10618560802238275
  9. Browell, R. and Hutchings, B., "Bigger, Better, Faster: HPC Technology Leadership," ANSYS Advantage, Vol. 5, No. 3, pp. 6-8, 2011.