Browse > Article
http://dx.doi.org/10.9708/jksci.2014.19.9.021

Exploration of an Optimal Two-Dimensional Multi-Core System for Singular Value Decomposition  

Park, Yong-Hun (School of Electrical, Electronics, and Computer Engineering, University of Ulsan)
Kim, Cheol-Hong (School of Electronics and Computer Engineering, Chonnam National University)
Kim, Jong-Myon (School of Electrical, Electronics, and Computer Engineering, University of Ulsan)
Abstract
Singular value decomposition (SVD) has been widely used to identify unique features from a data set in various fields. However, a complex matrix calculation of SVD requires tremendous computation time. This paper improves the performance of a representative one-sided block Jacoby algorithm using a two-dimensional (2D) multi-core system. In addition, this paper explores an optimal multi-core system by varying the number of processing elements in the 2D multi-core system with the same 400MHz clock frequency and TSMC 28nm technology for each matrix-based one-sided block Jacoby algorithm ($128{\times}128$, $64{\times}64$, $32{\times}32$, $16{\times}16$). Moreover, this paper demonstrates the potential of the 2D multi-core system for the one-sided block Jacoby algorithm by comparing the performance of the multi-core system with a commercial high-performance graphics processing unit (GPU).
Keywords
Design space exploration; singular value decomposition; one-sided block Jacobi; two-dimensional multi-core system;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 Y. Takahashi, Y. Hirota, Y. Yamamoto, "Performance of the block Jacobi method for the symmetric eigenvalue problem on a modern massively parallel computer," in Proceedings of Algoritmy, pp. 151-160, 2012.
2 I, Bethune, J. M. Bull, N. J. Dingle, N. J. Higham, " Performance analysis of asynchronous Jacobi's method implemented in MPI, SHMEM and OpenMP," Manchster Institute for Mathematical Sciences School of Mathematics, 2012.
3 A. Gentile, D. S. Wills, "Portable video supercomputing," IEEE Transactions on Computers, Vol. 53, No. 8, pp. 960-973, 2004.   DOI   ScienceOn
4 S. M. Kang, J. M. Kim, "Multimedia extension instructions and optimal many-core processor architecture exploration for portable ultrasonic image processing," Journal of Korea Society Computer Institute, Vol. 17, No. 8, pp. 1-10, 2012.   과학기술학회마을   DOI   ScienceOn
5 J. Y. Kim. D. K. Shon, J. M. Kim, H. S. Jun "Parallel implementation and performance evaluation of the SIFT algorithm using a many-core processor," Journal of Korea Society Computer Institute, Vol. 18, No. 9, pp. 1-10, 2013.   과학기술학회마을   DOI   ScienceOn
6 J. S. Seo, M. S. Kang, C. H. Kim, J. M. Kim, "Design space exploration of embedded many-core processors for real-time fire feature extraction," Journal of Korea Society Computer Institute, Vol. 18, No. 10, pp. 1-12, 2013.   과학기술학회마을   DOI   ScienceOn
7 Shutao Li, Leyuan Fang, Haitao Yin, "An efficient dictionary learning algorithm and its application to 3-D medical image denoising," IEEE Transactions on Biomedical Engineering, Vol. 59, No. 2, pp. 417-427, February 2012.   DOI   ScienceOn
8 A. Jindal, Mingyan Liu, "Networked computing in wireless sensor networks for structural health monitoring," IEEE Transactions on Networking, Vol. 20, No. 4, pp. 1203-1216, August 2012.   DOI   ScienceOn
9 G. Tang, A. Nehorai, "Stability of low-rank matrix reconstruction: A constrained singular value view," IEEE Transactions on Information Theory Society, Vol. 58, No. 9, 2012.
10 G. H. Golub and C. Reinsch, "Singular value decomposition and least square solutions," Journal of Numerische Mathematik, Vol. 14, No. 5, pp. 403-420, Apr. 1970.   DOI
11 F. G. Yan, M. Jin, X. Qiao, "Low-complexity DOA estimation based on compressed MUSIC and its performance analysis," IEEE Transactions on Signal Processing, Vol. 61, No. 8, pp 1915-1930, 2013.   DOI   ScienceOn
12 R. G. King, M. W. Watson, "System reduction and solution algorithm for singular linear difference systems under rational expectations," Journal of Computational Economics, Vol. 20, pp. 57-86, 2002.   DOI
13 A. Samui, S. R. Samantaray, "Wavelet singular entropy-based islanding detection in distributed generation," IEEE Transaction on Power Delivery, Vol. 28, No. 1, pp. 411-418, January 2013.   DOI   ScienceOn
14 W. Dong, G. Shi, and X. Li, "Nonlocal image restoration with bilateral variance estimation : a low-rank approach," IEEE Transactions on Image Processing, Vol. 22, No. 2, pp. 700-711, 2012.
15 S. C. Chan, Y. J. Chu, Z. G. Zhang, K. M. Tsui, "A NEW variable regularized QR decomposition-based recursive least M-estimate algorithm-performance analysis and acoustic applications," IEEE Transactions on Audio, Speech, and Language Processing, Vol. 21, No. 5, pp. 907-922, May. 2013.   DOI   ScienceOn
16 A Rajwade, A Rangarajan, A Banerjee, "Image denoising using the higher order singular value decomposition," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 35, No. 4, pp. 849-862, April 2013.   DOI   ScienceOn
17 B. B. Zhou, R. P. Brent, M. Kahn, "A one-sided Jacobi algorithm for the symmetric eigenvalue problem," in Proc. of 3rd Parallel Computing Workshop, 1994.
18 J. Demmel, K. Veselic, "Jacobi's method is more accurate than QR," SIAM Journal on Matrix Analysis and Applications, Vol. 13, No. 4, pp. 1204-1245, 1992.   DOI
19 B. A. Chartres, "Adaptation of the Jacobi method for a computer with magnetic-tape backing store," The Computer Journal, Vol. 5, No. 1, pp. 51-60, 1962.   DOI
20 V. L. Charles, "The block Jacobi method for computing the singular value decomposition," Cornell University, 1985.
21 B. B. Zhou, R. P. Brent, "A parallel ring ordering algorithm for efficient one-sided Jacobi SVD computations," Journal of Parallel and Distributed Computing, Vol. 42, No. 1, pp. 1-10, 1997.   DOI   ScienceOn
22 B. B. Zhou, R. P. Brent, "On parallel implementation of the one-sided Jacobi algorithmfor singular value decompositions," in Proceedings of Euromicro Workshop on Parallel and Distributed Processing, pp. 401-408, 1995.
23 K. Fernando, H. Nicholson, "Identification of linear systems with input and output noise: the Koopmans-Levin method," IEE Proceedings. Control Theory and Applications, Vol. 132, pp. 30-36, 1985.   DOI
24 Ake Bjorck, "A bidiagonalization algorithm for solving large and sparse ill-posed systems of linear equations," Journal of BIT Numerical Mathematics, Vol. 28, pp. 659-670, 1988.   DOI
25 M. Darouach, M. Zasadzinski, S. J. Xu, "Full-order observers for linear systems with unknown inputs," IEEE Transaction on Automatic Control, Vol. 39, No. 3, pp. 606-609, March 1994.   DOI   ScienceOn
26 E. Schmidt, "On the theory of linear and nonlinear integral equations," Journal of Mathematische Annalen, Vol. 65, pp. 370-399, 1907.
27 E. Beltrami, "On bilinear functions," Journal of Mathematics, Vol. 11, pp. 98-106, 1873.
28 C. Jordan, "Memory on bilinear forms," Journal of Pure and Applied Mathematics, Vol. 19, pp. 35-54, 1874.
29 J. J. Sylvester, "A new proof that a general quadric may be reduced to its canonical form (that is, a linear function of squares) by means of a real orthogonal substitution," Messenger of Mathematics, Vol. 19, pp. 1-5, 1889.
30 H. Weyl, "The asymptotic law granting the eigenvalues of linear partial differential equations with an application of the theory of black body radiation," Journal of Mathematische Annalen, Vol. 71, pp. 441-479, 1912.   DOI