DOI QR코드

DOI QR Code

WRF Physics Models Using GP-GPUs with CUDA Fortran

WRF 물리 과정의 GP-GPU 계산을 위한 CUDA Fortran 프로그램 구현

  • Kim, Youngtae (Department of Computer Science & Engineering, Gangneung-Wonju National University) ;
  • Lee, Yong Hee (Forecast Research Division, National Institute of Meteorological Research) ;
  • Chung, Kwan-Young (Forecast Research Division, National Institute of Meteorological Research)
  • 김영태 (강릉원주대학교 컴퓨터공학과) ;
  • 이용희 (국립기상연구소 예보연구과) ;
  • 정관영 (국립기상연구소 예보연구과)
  • Received : 2012.12.17
  • Accepted : 2013.03.25
  • Published : 2013.06.30

Abstract

We parallelized WRF major physics routines for Nvidia GP-GPUs with CUDA Fortran. GP-GPUs are originally designed for graphic processing, but show high performance with low electricity for calculating numerical models. In the CUDA environment, a data domain is allocated into thread blocks and threads in each thread block are computing in parallel. We parallelized the WRF program to use of thread blocks efficiently. We validated the GP-GPU program with the original CPU program, and the WRF model using GP-GPUs shows efficient speedup.

Keywords

References

  1. CUDA Fortran Programming Guide and Reference, The Portland Group, 2012.
  2. Michalakes, J., 2000: RSL: A parallel runtime system library for regional atmospheric model with nesting, in Structured Adaptive Mesh Refinement (SAMR) Grid Methods. IMA Volumes in Mathematics and Its Applications, 117, 59-66. https://doi.org/10.1007/978-1-4612-1252-2_4
  3. Michalakes, J., J. Dudhia, D. Grill, T. Henderson, J. Klemp, W. Skamarock, and W. Wang, 2004: The Weather Research and Forecast Model: Software Architecture and Performance. Proceeding of the 11th ECMWF Workshop on the Use of high Performance Computing in Meteorology, 25-29.
  4. Michalakes, J.,and M. Vachharajani, 2008: Acceleration of Numerical Weather Prediction, Parallel Processing Letters Vol. 18 No. 4. World Scientific., pp. 531-548. https://doi.org/10.1142/S0129626408003557
  5. Molnár Jr., T. Szakly, R. Mszros, and I. Lagzi, 2010: Air pollution modeling using a graphics processing unit with CUDA. Comput. Geosci., 36(5), 105-112.
  6. Nichols, J., I. Buck, M. Garland, and K. Skadron, 2008: Scalable parallel programming with CUDA, ACM Queue 6(2), 40-53.
  7. NVIDIA CUDA Compute United Device Architecture Reference Manual Version 5.0, NVIDIA Corporation, 2012.
  8. Shin, B. and Y. Kim, 2011: Parallel LU decomposition using multiple threads on GPUs, Korean Society For Internet Information, 12, 131-137.
  9. The Top Trends in High Performance Computing, The Top 500 Report, Top 500 Supercomputer Sites, 23 June 2009.