Browse > Article

An Efficient Multidimensional Scaling Method based on CUDA and Divide-and-Conquer  

Park, Sung-In (숭실대학교 컴퓨터학과)
Hwang, Kyu-Baek (숭실대학교 컴퓨터학부)
Abstract
Multidimensional scaling (MDS) is a widely used method for dimensionality reduction, of which purpose is to represent high-dimensional data in a low-dimensional space while preserving distances among objects as much as possible. MDS has mainly been applied to data visualization and feature selection. Among various MDS methods, the classical MDS is not readily applicable to data which has large numbers of objects, on normal desktop computers due to its computational complexity. More precisely, it needs to solve eigenpair problems on dissimilarity matrices based on Euclidean distance. Thus, running time and required memory of the classical MDS highly increase as n (the number of objects) grows up, restricting its use in large-scale domains. In this paper, we propose an efficient approximation algorithm for the classical MDS based on divide-and-conquer and CUDA. Through a set of experiments, we show that our approach is highly efficient and effective for analysis and visualization of data consisting of several thousands of objects.
Keywords
multidimensional scaling; GPGPU; CUDA; unsupervised learning; divide-and-conquer;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Borg, I. and Groenen, P. J. F., Modern Multidimensional Scaling: Theory and Applications, Second Edition, Springer Science+Business Media, New York, NY, USA, 2005.
2 Shashua, A. and Wolf, L., Kernel feature selection with side data using a spectral approach, Proc. of ECCV 2004, pp.39-53, 2004.
3 Yang, T., Liu, J., McMillan, L., and Wang, W., A fast approximation to multidimensional scaling, Proc. of the ECCV 2006 Workshop on Computation Intensive Methods for Computer Vision, 2006.
4 Kirk, D. and Hwu, W.-M., CUDA Textbook, Draft Version, 2009.
5 OpenCL, Khronos group, http://www.khronos.org/opencl.
6 Larrabee, Intel, http://www.intel.com/technology/visual/microarch.htm.
7 CUDA CUBLAS Library Ver.2.3, NVIDIA Corporation, Santa Clara, CA, USA, 2009.
8 CULA tools, EM Photonics, http://www.culatools.com.
9 de Silva, V. and Tenenbaum, J.B., Sparse multidimensional scaling using landmark points, Technical Report, Stanford University, 2004.
10 Faloutsos, C. and Lin, K.-I., FastMap: a fast algorithm for indexing, data-mining and visualization, Proc. of ACM SIGMOD 1995, pp.163-174, 1995.
11 Pechenizkiy, M., Puuronen, S., and Tsymbal, A., The impact of sample reduction on PCA-based feature extraction for supervised learning, Proc. of ACM SAC 2006 Data Mining Track, pp.553-558, 2006.