Browse > Article
http://dx.doi.org/10.9708/jksci.2011.16.2.009

Implementation of Efficient Power Method on CUDA GPU  

Kim, Jung-Hwan (Dept. of Computer Science, Konkuk University)
Kim, Jin-Soo (Dept. of Computer Science, Konkuk University)
Abstract
GPU computing is emerging in high performance application area since it can easily exploit massive parallelism in a way of cost-effective computing. The power method which finds the eigen vector of a given matrix is widely used in various applications such as PageRank for calculating importance of web pages. In this research we made the power method efficiently parallelized on GPU and also suggested how it can be improved to enhance its performance. The power method mainly consists of matrix-vector product and it can be easily parallelized. However, it should decide the convergence of the eigen vector and need scaling of the vector subsequently. Such operations incur several calls to GPU kernels and data movement between host and GPU memories. We improved the performance of the power method by means of reduced calls to GPU kernels, optimized thread allocation and enhanced decision operation for the convergence.
Keywords
power method; GPU computing; GPGPU(General Purpose Computing on GPU); parallel computing; CUDA(Compute Unified Device Architecture;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 S. Ryoo et al., "Optimization Principles and Application Performance Evaluation of a Multithreaded GPU using CUDA,'' Proc. 13th ACM SIGPLAN Symp. Principles and Practice of Parallel Programming, ACM Press, 2008.
2 S.A. Johnson et al., Apparatus and Method for Imaging Objects with Wavefields, US patent 6,636,584, Patent and Trademark Office, 2003.
3 ju Hwan Kim, Koojoo Kwon, Byeong-Seok Shin, "Large-Scale Ultrasound Volume Rendering using Bricking",Korea Society of Computer Information,No13(7) pp117-126,Dec. 2008
4 Chinmay Karande, Kumar Chellapilla and Reid Andersen, "Speeding up Algorithms on Compressed Web Graphs," Proceedings of the Second ACM International Conference on Web Search and Data Mining, 2009.
5 S. Brin and L. Page, "The Anatomy of a Large-Scale Hypertextual Web Search Engine," Computer Networks and ISDN Systems Vol. 33, No. 3, pp.107-117, 1998.
6 Tianji Wu, Bo Wang, Yi Shan, Feng Yan, Yu Wang and Ningyi Xu, "Efficient PageRank and SpMV Computation on AMD GPUs," 39th International Conference on Parallel Processing, 2010.
7 Imran Patel and John R. Gilbert, "An Empirical Study of the Performance and Productivity of Two Parallel Programming Models," IEEE International Symposium on Parallel and Distributed Processing, 2008.
8 J. D. Z. Bai, J. Dongarra, A. Ruhe and H. van der Vorst, "Templates for the solution of algebraic eigenvalue problems: A practical guide," In Society for Industrial and Applied Mathematics, 2000.
9 Eun-jin Im,"An Efficient Computation of Matrix Triple Products",Korea Society of Computer Information,No11(3) pp141-149,
10 Brian Bradie, A Friendly Introduction to Numerical Analysis, Pearson Prentice Hall, 2006.
11 John Nickolls and William J. Dally "The GPU Computing Era," IEEE Micro, Vol. 30, Issue 2, March-April 2010.
12 Tom R. Halfhill, "Parallel Processing with CUDA," Microprocessor Report, Jan. 2008.
13 NVIDIA CUDA C Programming Guide, Ver. 3.1.1, Nvidia, July 2010.
14 J.A. Anderson, C.D. Lorenz and A. Travesset, "General Purpose Molecular Dynamics Simulations Fully Implemented on Graphics Processing Units," J. Computational Physics, Vol. 227, No. 10, May 2008.
15 T. Brandvik and G. Pullan, "Acceleration of a 3D Euler Solver Using Commodity Graphics Hardware," Proc. 48th AIAA Aerospace Sciences Meeting and Exhibit, AIAA Press, 2008.