Browse > Article
http://dx.doi.org/10.5351/KJAS.2017.30.4.555

Representing variables in the latent space  

Huh, Myung-Hoe (Department of Statistics, Korea University)
Publication Information
The Korean Journal of Applied Statistics / v.30, no.4, 2017 , pp. 555-566 More about this Journal
Abstract
For multivariate datasets with large number of variables, classical dimensional reduction methods such as principal component analysis may not be effective for data visualization. The underlying reason is that the dimensionality of the space of variables is often larger than two or three, while the visualization to the human eye is most effective with two or three dimensions. This paper proposes a working procedure which first partitions the variables into several "latent" clusters, explores individual data subsets, and finally integrates findings. We use R pakacage "ClustOfVar" for partitioning variables around latent dimensions and the principal component biplot method to visualize within-cluster patterns. Additionally, we use the technique for embedding supplementary variables to figure out the relationships between within-cluster variables and outside variables.
Keywords
data visualization; clustering of variables; latent variables; principal component analysis; biplot; supplementary variables;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Benzecri, J. P. (1992). Correspondence Analysis Handbook, Marcel Dekker, New York.
2 Chavent, M., Kuentz-Simonet, V., Liquet, B., and Saracco, J. (2012). ClustOfVar: an R package for the clustering of variables, Journal of Statistical Software, 50, 1-16.
3 Chavent, M., Kuentz, V., Liquet, B., and Saracco, J. (2013). Package 'ClustOfVar'. R Foundation for Statistical Computing, URL https://cran.r-project.org/mirrors.html.
4 Gabriel, K. R. (1971). The biplot graphic display of matrices with application to principal component analysis, Biometrika, 58, 453-467.   DOI
5 Vigneau, E. and Chen, M. (2015). Package 'ClustVarLV'. R Foundation for Statistical Computing, from: https://cran.r-project.org/mirrors.html.
6 Vigneau, E., Chen, M., and Qannari, E. M. (2015). ClustVarLV: an R package for the clustering of variables around latent variables, The R Journal, 7, 134-148.
7 Vigneau, E. and Quannari, E. M. (2003). Clustering of variables around latent components, Communications in Statistics - Simulation and Computation, 32, 1131-1150.   DOI