• Title/Summary/Keyword: Data Principal

Search Result 2,078, Processing Time 0.029 seconds

Asymptotic Test for Dimensionality in Probabilistic Principal Component Analysis with Missing Values

  • Park, Chong-sun
    • Communications for Statistical Applications and Methods
    • /
    • v.11 no.1
    • /
    • pp.49-58
    • /
    • 2004
  • In this talk we proposed an asymptotic test for dimensionality in the latent variable model for probabilistic principal component analysis with missing values at random. Proposed algorithm is a sequential likelihood ratio test for an appropriate Normal latent variable model for the principal component analysis. Modified EM-algorithm is used to find MLE for the model parameters. Results from simulations and real data sets give us promising evidences that the proposed method is useful in finding necessary number of components in the principal component analysis with missing values at random.

A Study on the Vulnerability Assessment for Agricultural Infrastructure using Principal Component Analysis (주성분 분석을 이용한 농업생산기반의 재해 취약성 평가에 관한 연구)

  • Kim, Sung Jae;Kim, Sung Min;Kim, Sang Min
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.55 no.1
    • /
    • pp.31-38
    • /
    • 2013
  • The purpose of this study was to evaluate climate change vulnerability over the agricultural infrastructure in terms of flood and drought using principal component analysis. Vulnerability was assessed using vulnerability resilience index (VRI) which combines climate exposure, sensitivity, and adaptive capacity. Ten flood proxy variables and six drought proxy variables for the vulnerability assessment were selected by opinions of researchers and experts. The statistical data on 16 proxy variables for the local governments (Si, Do) were collected. To identify major variables and to explain the trend in whole data set, principal component analysis (PCA) was conducted. The result of PCA showed that the first 3 principal components explained approximately 83 % and 89 % of the total variance for the flood and drought, respectively. VRI assessment for the local governments based on the PCA results indicated that provinces where having the relatively large cultivation areas were categorized as vulnerable to climate change.

Partial Quantification in Principal Component Analysis

  • Hye Sun Suh;Myung Hoe Huh
    • Communications for Statistical Applications and Methods
    • /
    • v.4 no.3
    • /
    • pp.637-644
    • /
    • 1997
  • Sometimes, the first principal component may come logically from the established knowledge and premises. For example, for the high school students' test scores of Korean, English, Mathematics, Social Study, and Science, it is natural to define the first principal component as the average of all subject scores. In such cases, we need to respect both the background knowledge and the data exploration. The aim of this study is to find the remaining components in principal component analysis of multivariate data when the first principal component is defined a priori by the researcher. Moreover, we study related matrix decomposition and their application to the graphical display.

  • PDF

Application of the Principal Component Analysis to Evaluate Concrete Condition Using Impact Resonance Test (충격공진을 이용한 콘크리트 상태 평가를 위한 주성분 분석의 적용)

  • Yoon, Young Geun;Oh, Tae Keun
    • Journal of the Korean Society of Safety
    • /
    • v.34 no.5
    • /
    • pp.95-102
    • /
    • 2019
  • Non-destructive methods such as rebound hardness method and ultrasonic method are widely studied for evaluating the physical properties, condition and damage of concrete, but are not suitable for detecting delamination and cracks near the surface due to various constraints of the site as well as the accuracy. Therefore, in this study, the impact resonance method was applied to detect the separation cracks occurring near the surface of the concrete slab and bridge deck. As a next step, the principal component analysis were performed by extracting various features using the FFT data. As a result of principal component analysis, it was analyzed that the reliability was high in distinguishing defects in concrete. This feature extraction and application of principal component analysis can be used as basic data for future use of machine learning technique for the better accuracy.

Hierarchically penalized sparse principal component analysis (계층적 벌점함수를 이용한 주성분분석)

  • Kang, Jongkyeong;Park, Jaeshin;Bang, Sungwan
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.1
    • /
    • pp.135-145
    • /
    • 2017
  • Principal component analysis (PCA) describes the variation of multivariate data in terms of a set of uncorrelated variables. Since each principal component is a linear combination of all variables and the loadings are typically non-zero, it is difficult to interpret the derived principal components. Sparse principal component analysis (SPCA) is a specialized technique using the elastic net penalty function to produce sparse loadings in principal component analysis. When data are structured by groups of variables, it is desirable to select variables in a grouped manner. In this paper, we propose a new PCA method to improve variable selection performance when variables are grouped, which not only selects important groups but also removes unimportant variables within identified groups. To incorporate group information into model fitting, we consider a hierarchical lasso penalty instead of the elastic net penalty in SPCA. Real data analyses demonstrate the performance and usefulness of the proposed method.

Resistant Singular Value Decomposition and Its Statistical Applications

  • Park, Yong-Seok;Huh, Myung-Hoe
    • Journal of the Korean Statistical Society
    • /
    • v.25 no.1
    • /
    • pp.49-66
    • /
    • 1996
  • The singular value decomposition is one of the most useful methods in the area of matrix computation. It gives dimension reduction which is the centeral idea in many multivariate analyses. But this method is not resistant, i.e., it is very sensitive to small changes in the input data. In this article, we derive the resistant version of singular value decomposition for principal component analysis. And we give its statistical applications to biplot which is similar to principal component analysis in aspects of the dimension reduction of an n x p data matrix. Therefore, we derive the resistant principal component analysis and biplot based on the resistant singular value decomposition. They provide graphical multivariate data analyses relatively little influenced by outlying observations.

  • PDF

Principal component analysis for Hilbertian functional data

  • Kim, Dongwoo;Lee, Young Kyung;Park, Byeong U.
    • Communications for Statistical Applications and Methods
    • /
    • v.27 no.1
    • /
    • pp.149-161
    • /
    • 2020
  • In this paper we extend the functional principal component analysis for real-valued random functions to the case of Hilbert-space-valued functional random objects. For this, we introduce an autocovariance operator acting on the space of real-valued functions. We establish an eigendecomposition of the autocovariance operator and a Karuhnen-Loève expansion. We propose the estimators of the eigenfunctions and the functional principal component scores, and investigate the rates of convergence of the estimators to their targets. We detail the implementation of the methodology for the cases of compositional vectors and density functions, and illustrate the method by analyzing time-varying population composition data. We also discuss an extension of the methodology to multivariate cases and develop the corresponding theory.

ECG based Personal Authentication using Principal Component Analysis (주성분 분석기법을 이용한 심전도 기반 개인인증)

  • Cho, Ju-Hee;Cho, Byeong-Jun;Lee, Dae-Jong;Chun, Myung-Geun
    • The Transactions of the Korean Institute of Electrical Engineers P
    • /
    • v.66 no.4
    • /
    • pp.258-262
    • /
    • 2017
  • The PCA(Principal Component Analysis) algorithm is widely used as a technique of expressing the eigenvectors of the covariance matrix that best represents the characteristics of the data and reducing the high dimensional vector to a low dimensional vector. In this paper, we have developed a personal authentication method based on ECG using principal component analysis. The proposed method showed excellent recognition performance of 98.2 [%] when it was experimented using electrocardiogram data obtained at weekly intervals. Therefore, it can be seen that it is useful for personal authentication by reducing the dimension without changing the information on the variability and the correlation set variable existing in the electrocardiogram data by using the principal component analysis technique.

High Resolution AR Spectral Estimation by Principal Component Analysis (Principal Componet Analysis에 의한 고 분해능 AR 모델링과 스텍트럼 추정)

  • 양흥석;이석원;공성곤
    • The Transactions of the Korean Institute of Electrical Engineers
    • /
    • v.36 no.11
    • /
    • pp.813-818
    • /
    • 1987
  • In this paper, high resolution spectral estimation by AR modelling and principal comonent analysis is proposed. The given data can be expanded by the eigenvectors of the estimated covariance matrix. The eigenspectrum is obtained for each eigenvector using the Autoressive(AR) spectral estimation technique. The final spectrum estimate is obtained by weighting each eigenspectrum with the corresponding eigenvalue and summing them. Although the proposed method increases in computational complexity, it shows good frequency resolution especially for short data records and narrow-band data whose signal-to-noise ratio is low.

On principal component analysis for interval-valued data (구간형 자료의 주성분 분석에 관한 연구)

  • Choi, Soojin;Kang, Kee-Hoon
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.1
    • /
    • pp.61-74
    • /
    • 2020
  • Interval-valued data, one type of symbolic data, are observed in the form of intervals rather than single values. Each interval-valued observation has an internal variation. Principal component analysis reduces the dimension of data by maximizing the variance of data. Therefore, the principal component analysis of the interval-valued data should account for the variance between observations as well as the variation within the observed intervals. In this paper, three principal component analysis methods for interval-valued data are summarized. In addition, a new method using a truncated normal distribution has been proposed instead of a uniform distribution in the conventional quantile method, because we believe think there is more information near the center point of the interval. Each method is compared using simulations and the relevant data set from the OECD. In the case of the quantile method, we draw a scatter plot of the principal component, and then identify the position and distribution of the quantiles by the arrow line representation method.