• Title/Summary/Keyword: Principal component

Search Result 2,718, Processing Time 0.029 seconds

Principal Component Regression by Principal Component Selection

  • Lee, Hosung;Park, Yun Mi;Lee, Seokho
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.2
    • /
    • pp.173-180
    • /
    • 2015
  • We propose a selection procedure of principal components in principal component regression. Our method selects principal components using variable selection procedures instead of a small subset of major principal components in principal component regression. Our procedure consists of two steps to improve estimation and prediction. First, we reduce the number of principal components using the conventional principal component regression to yield the set of candidate principal components and then select principal components among the candidate set using sparse regression techniques. The performance of our proposals is demonstrated numerically and compared with the typical dimension reduction approaches (including principal component regression and partial least square regression) using synthetic and real datasets.

Assessment and Classification of Korean Indigenous Corn Lines by Application of Principal Component Analysis (주성분분석에 의한 재래종 옥수수의 해석)

  • 이인섭;박종옥
    • Journal of Life Science
    • /
    • v.13 no.3
    • /
    • pp.343-348
    • /
    • 2003
  • This study was conducted to get basic information on the Korean local corn line collected from Busan City and Kyungnam Province, a total of 49 lines were selected and assessed by the principal component analysis method. In the result of principal component analysis for 7 characteristics, 67.4% and 86.3% of total variation could be appreciated by the first two and first four principal components, respectively. Contribution of characteristics to principal component was high at upper principal components and low at lower principal components. Biological meaning of principal component and plant types corresponding to the each principal component were explained clearly by the correlation coefficient between principal component and characteristics. The first principal component appeared to correspond to the size of plant and ear, and the duration of vegetative growing period. The second principal component appeared to correspond to the number of ear and tiller. But the meaning of the third and fourth principal components were not clear.

Arrow Diagrams for Kernel Principal Component Analysis

  • Huh, Myung-Hoe
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.3
    • /
    • pp.175-184
    • /
    • 2013
  • Kernel principal component analysis(PCA) maps observations in nonlinear feature space to a reduced dimensional plane of principal components. We do not need to specify the feature space explicitly because the procedure uses the kernel trick. In this paper, we propose a graphical scheme to represent variables in the kernel principal component analysis. In addition, we propose an index for individual variables to measure the importance in the principal component plane.

Risk Evaluation of Slope Using Principal Component Analysis (PCA) (주성분분석을 이용한 사면의 위험성 평가)

  • Jung, Soo-Jung;Kim, -Yong-Soo;Kim, Tae-Hyung
    • Journal of the Korean Geotechnical Society
    • /
    • v.26 no.10
    • /
    • pp.69-79
    • /
    • 2010
  • To detect abnormal events in slopes, Principal Component Analysis (PCA) is applied to the slope that was collapsed during monitoring. Principal component analysis is a kind of statical methods and is called non-parametric modeling. In this analysis, principal component score indicates an abnormal behavior of slope. In an abnormal event, principal component score is relatively higher or lower compared to a normal situation so that there is a big score change in the case of abnormal. The results confirm that the abnormal events and collapses of slope were detected by using principal component analysis. It could be possible to predict quantitatively the slope behavior and abnormal events using principal component analysis.

Numerical Investigations in Choosing the Number of Principal Components in Principal Component Regression - CASE I

  • Shin, Jae-Kyoung;Moon, Sung-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.8 no.2
    • /
    • pp.127-134
    • /
    • 1997
  • A method is proposed for the choice of the number of principal components in principal component regression based on the predicted error sum of squares. To do this, we approximately evaluate that statistic using a linear approximation based on the perturbation expansion. In this paper, we apply the proposed method to various data sets and discuss some properties in choosing the number of principal components in principal component regression.

  • PDF

Asymptotic Test for Dimensionality in Probabilistic Principal Component Analysis with Missing Values

  • Park, Chong-sun
    • Communications for Statistical Applications and Methods
    • /
    • v.11 no.1
    • /
    • pp.49-58
    • /
    • 2004
  • In this talk we proposed an asymptotic test for dimensionality in the latent variable model for probabilistic principal component analysis with missing values at random. Proposed algorithm is a sequential likelihood ratio test for an appropriate Normal latent variable model for the principal component analysis. Modified EM-algorithm is used to find MLE for the model parameters. Results from simulations and real data sets give us promising evidences that the proposed method is useful in finding necessary number of components in the principal component analysis with missing values at random.

Simple principal component analysis using Lasso (라소를 이용한 간편한 주성분분석)

  • Park, Cheolyong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.3
    • /
    • pp.533-541
    • /
    • 2013
  • In this study, a simple principal component analysis using Lasso is proposed. This method consists of two steps. The first step is to compute principal components by the principal component analysis. The second step is to regress each principal component on the original data matrix by Lasso regression method. Each of new principal components is computed as the linear combination of original data matrix using the scaled estimated Lasso regression coefficient as the coefficients of the combination. This method leads to easily interpretable principal components with more 0 coefficients by the properties of Lasso regression models. This is because the estimator of the regression of each principal component on the original data matrix is the corresponding eigenvector. This method is applied to real and simulated data sets with the help of an R package for Lasso regression and its usefulness is demonstrated.

A Study on Selecting Principle Component Variables Using Adaptive Correlation (적응적 상관도를 이용한 주성분 변수 선정에 관한 연구)

  • Ko, Myung-Sook
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.3
    • /
    • pp.79-84
    • /
    • 2021
  • A feature extraction method capable of reflecting features well while mainaining the properties of data is required in order to process high-dimensional data. The principal component analysis method that converts high-level data into low-dimensional data and express high-dimensional data with fewer variables than the original data is a representative method for feature extraction of data. In this study, we propose a principal component analysis method based on adaptive correlation when selecting principal component variables in principal component analysis for data feature extraction when the data is high-dimensional. The proposed method analyzes the principal components of the data by adaptively reflecting the correlation based on the correlation between the input data. I want to exclude them from the candidate list. It is intended to analyze the principal component hierarchy by the eigen-vector coefficient value, to prevent the selection of the principal component with a low hierarchy, and to minimize the occurrence of data duplication inducing data bias through correlation analysis. Through this, we propose a method of selecting a well-presented principal component variable that represents the characteristics of actual data by reducing the influence of data bias when selecting the principal component variable.

Numerical Investigations in Choosing the Number of Principal Components in Principal Component Regression - CASE II

  • Shin, Jae-Kyoung;Moon, Sung-Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.10 no.1
    • /
    • pp.163-172
    • /
    • 1999
  • We propose a cross-validatory method for the choice of the number of principal components in principal component regression based on the magnitudes of correlations with y. There are two different manners in choosing principal components, one is the order of eigenvalues(Shin and Moon, 1997) and the other is that of correlations with y. We apply our method to various data sets and compare results of those two methods.

  • PDF

Catch Specification of Japanese Tuna Purse Seine in the Western Pacific Ocean (서부태평야지역에서 일본 다랑어선망어업의 어획특성)

  • 김형석
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • v.35 no.3
    • /
    • pp.243-249
    • /
    • 1999
  • Specificity of catches has been analyzed to japanese tuna purse seine A principle component analysis was used to improve the efficiency of fishing and increase sustainable production and productivity of Korean tuna purse seine.The result are as follows;From the principal component analysis of the fish catches, the first principal component(Z1) to promote principal component score was skipjack Kastsuwonus Pelamis, LINNAEUS and yellowfin tuna Thunnus Albacares, BONNATERRE (Small : smaller than 10kg) and proportion was 86.8% of total. The second principal component(Z2) to increase principal component score was yellowfin tuna (Large : larger than 10kg) and proportion was 9.5%.On the other hand, fish operating that have caught skipjack and yellowfin tuna (Small and Larger) was not so much. Fish catches for one species raised volume of the catches while catches for multi-species decreased it since principal composition score for one species and both species together has been increased.Fish school could be divided into three groups of schools each of which was associated with drift objects, payaho and ship, school associated with shark, whale and porpoise and school of breezing, feeding and jumping from proportion of principal component analysis for fish catches of school types. However, the biological pattern is different among school associated with ship, payaho and school associated with drift objects for analysis eigen vector. School associated with ship, payaho and school associated with drifting object associated is judged as school which be assembled to vessel and drifted log temporary.

  • PDF