Browse > Article
http://dx.doi.org/10.3795/KSME-B.2017.41.2.117

An Investigation of the Performance of the Colored Gauss-Seidel Solver on CPU and GPU  

Yoon, Jong Seon (Dept. of Mechanical Engineering, Seoul Nat'l Univ. of Science and Technology)
Jeon, Byoung Jin (Integrative Cardiovascular Imaging Research Center, Yonsei Cardiovascular Center, College of Medicine, Yonsei Univ.)
Choi, Hyoung Gwon (Dept. of Mechanical and Automotive Engineering, Seoul Nat'l Univ. of Science and Technology)
Publication Information
Transactions of the Korean Society of Mechanical Engineers B / v.41, no.2, 2017 , pp. 117-124 More about this Journal
Abstract
The performance of the colored Gauss-Seidel solver on CPU and GPU was investigated for the two- and three-dimensional heat conduction problems by using different mesh sizes. The heat conduction equation was discretized by the finite difference method and finite element method. The CPU yielded good performance for small problems but deteriorated when the total memory required for computing was larger than the cache memory for large problems. In contrast, the GPU performed better as the mesh size increased because of the latency hiding technique. Further, GPU computation by the colored Gauss-Siedel solver was approximately 7 times that by the single CPU. Furthermore, the colored Gauss-Seidel solver was found to be approximately twice that of the Jacobi solver when parallel computing was conducted on the GPU.
Keywords
GPU; CFD; Finite Element Method; Finite Differential Method; Coloring Method; Gauss-Seidel Solver;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Wang, T., Yao, Y., Han, L., Zhang, D. and Zhang, Y., 2009, "Implementation of Jacobi Iterative Method on Graphics Processor Unit," Intelligent Computing and Intelligent Systems, IEEE International Conference, Vol. 3, pp. 324-327.
2 Thibault, J. C. and Senocak, I., 2009, "CUDA Implementation of a Navier-Stokes Solver on Multi-GPU Desktop Platforms for Incompressible Flows," Proceedings of the 47th AIAA aerospace sciences meeting, p. 758.
3 Jacobsen, D. A. and Senocak, I., 2013, "Multi-level Parallelism for Incompressible Flow Computations on GPU Clusters," Parallel Computing, Vol. 39, No. 1, pp. 1-20.   DOI
4 Jacobsen, D. A., Thibault, J. C. and Senocak, I., 2010, "An MPI-CUDA Implementation for Massively Parallel Incompressible Flow Computations on Multi-GPU Clusters," In 48th AIAA aerospace sciences meeting and exhibit, Vol. 16, p. 2.
5 Kubale, M., 2004, Graph Colorings, American Mathematical Society, Rhode Island, pp. 1-20.
6 Hsieh, C. W., Kuo, S. H., Kuo, F. A. and Chou, C. Y., 2010, "Solving Parabolic Problems Using Multithread and GPU," In International Symposium on Parallel and Distributed Processing with Applications, IEEE., pp. 75-80.
7 Kuo, S. H., Chiu, P. H., Lin, R. K. and Lin, Y. T., 2010, "GPU Implementation for Solving Incompressible Two-phase Flows," International Journal of Information and Mathematical Sciences, Vol. 5, pp. 241-249.
8 Li, R. and Saad, Y., 2013, "GPU-Accelerated Preconditioned Iterative Linear Solvers," The Journal of Supercomputing, Vol. 63, pp. 443-466.   DOI
9 Burden, R. L. and Faires, J. D., 2011, Numerical Analysis 9th Edition, Cengage Learning, Boston, pp. 450-459.
10 Cheng, J., Grossman, M. and McKercher, T., 2014, Professional CUDA C Programming, John Wiley&Sons, Inc., New York, p. 31.