• Title/Summary/Keyword: Principal component

Search Result 2,723, Processing Time 0.033 seconds

Application of the supplementary principal component analysis for the 1982-1992 Korean Pro Baseball data (89-92 한국 프로야구의 각 팀과 부문별 평균 성적에 대한 추가적 주성분분석의 응용)

  • 최용석;심희정
    • The Korean Journal of Applied Statistics
    • /
    • v.8 no.1
    • /
    • pp.51-60
    • /
    • 1995
  • Given an $n \times p$ data matrix, if we add the $p_s$ variables somewhat different nature than the p variables to this matrix, we have a new $n \times (p+p_s)$ data matrix. Because of these $p_s$ variables, the traditional principal component analysis can't provide its efficient results. In this study, to improve this problem we review the supplementary principal component analysis putting $p_s$ variables to supplementary variable. This technique is based on the algebraic and geometric aspects of the traditional principal component analysis. So we provide a type of statistical data analysis for the records of eight teams and fourteen fields of the 1982-1992 Korean Pro Baseball Data based on the supplementary principal component analysis and the traditional principal component analysis. And we compare the their results.

  • PDF

Classification of honeydew and blossom honeys by principal component analysis of physicochemical parameters

  • Choi, Suk-Ho;Nam, Myoung Soo
    • Korean Journal of Agricultural Science
    • /
    • v.47 no.1
    • /
    • pp.67-81
    • /
    • 2020
  • The physicochemical parameters of honey are used to determine the botanic origin of honey and to specify the composition criteria for honey in regulations and standards. The parameters of honeydew and blossom honeys from Korean beekeepers were determined to investigate whether they complied with the composition criteria for honey in the food code legislated by Korean authority and to establish the parameters which should be subjected to principal component analysis for improved differentiation of honeys. The fructose and glucose contents of the honeydew honey did not comply with the composition criteria. The ash content of the honey was closely correlated with CIE a* and CIE L* The principal component analysis of fructose to glucose ratio, CIE a*, CIE L*, ash content, free acidity, and fructose and glucose contents enabled classification of honeydew, chestnut, multifloral, and acacia honeys. Additional advantage of the principal component analysis (PCA) is that the physicochemical parameters, such as fructose to glucose ratio (F/G) and color, can be determined using the analytical instruments for composition criteria and quality control of honey. This study suggested that composition criteria for honeydew honey should be established in the food code in accordance with international standards. The principal component analysis reported in this study resulted in improved classification of the honeys from Korean beekeepers.

AN EFFICIENT ALGORITHM FOR SLIDING WINDOW BASED INCREMENTAL PRINCIPAL COMPONENTS ANALYSIS

  • Lee, Geunseop
    • Journal of the Korean Mathematical Society
    • /
    • v.57 no.2
    • /
    • pp.401-414
    • /
    • 2020
  • It is computationally expensive to compute principal components from scratch at every update or downdate when new data arrive and existing data are truncated from the data matrix frequently. To overcome this limitations, incremental principal component analysis is considered. Specifically, we present a sliding window based efficient incremental principal component computation from a covariance matrix which comprises of two procedures; simultaneous update and downdate of principal components, followed by the rank-one matrix update. Additionally we track the accurate decomposition error and the adaptive numerical rank. Experiments show that the proposed algorithm enables a faster execution speed and no-meaningful decomposition error differences compared to typical incremental principal component analysis algorithms, thereby maintaining a good approximation for the principal components.

Predicting Korea Pro-Baseball Rankings by Principal Component Regression Analysis (주성분회귀분석을 이용한 한국프로야구 순위)

  • Bae, Jae-Young;Lee, Jin-Mok;Lee, Jea-Young
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.3
    • /
    • pp.367-379
    • /
    • 2012
  • In baseball rankings, prediction has been a subject of interest for baseball fans. To predict these rankings, (based on 2011 data from Korea Professional Baseball records) the arithmetic mean method, the weighted average method, principal component analysis, and principal component regression analysis is presented. By standardizing the arithmetic average, the correlation coefficient using the weighted average method, using principal components analysis to predict rankings, the final model was selected as a principal component regression model. By practicing regression analysis with a reduced variable by principal component analysis, we propose a rank predictability model of a pitcher part, a batter part and a pitcher batter part. We can estimate a 2011 rank of pro-baseball by a predicted regression model. By principal component regression analysis, the pitcher part, the other part, the pitcher and the batter part of the ranking prediction model is proposed. The regression model predicts the rankings for 2012.

Hierarchically penalized sparse principal component analysis (계층적 벌점함수를 이용한 주성분분석)

  • Kang, Jongkyeong;Park, Jaeshin;Bang, Sungwan
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.1
    • /
    • pp.135-145
    • /
    • 2017
  • Principal component analysis (PCA) describes the variation of multivariate data in terms of a set of uncorrelated variables. Since each principal component is a linear combination of all variables and the loadings are typically non-zero, it is difficult to interpret the derived principal components. Sparse principal component analysis (SPCA) is a specialized technique using the elastic net penalty function to produce sparse loadings in principal component analysis. When data are structured by groups of variables, it is desirable to select variables in a grouped manner. In this paper, we propose a new PCA method to improve variable selection performance when variables are grouped, which not only selects important groups but also removes unimportant variables within identified groups. To incorporate group information into model fitting, we consider a hierarchical lasso penalty instead of the elastic net penalty in SPCA. Real data analyses demonstrate the performance and usefulness of the proposed method.

Application of Principal Component Analysis Prior to Cluster Analysis in the Concept of Informative Variables

  • Chae, Seong-San
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.3
    • /
    • pp.1057-1068
    • /
    • 2003
  • Results of using principal component analysis prior to cluster analysis are compared with results from applying agglomerative clustering algorithm alone. The retrieval ability of the agglomerative clustering algorithm is improved by using principal components prior to cluster analysis in some situations. On the other hand, the loss in retrieval ability for the agglomerative clustering algorithms decreases, as the number of informative variables increases, where the informative variables are the variables that have distinct information(or, necessary information) compared to other variables.

An Analysis of Human Body Shape of Junior High School Girls by Using Plan Potogrammetry (평면사진 계측에 의한 여중생의 체형분석)

  • Kim Kyung Sook;Lee Choon Kye
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.14 no.3 s.35
    • /
    • pp.208-215
    • /
    • 1990
  • The purpose of this study is to provide the fundamental data of a dummy design for more suitable ready made clothing by making a pattern of somatic types and analyzing their morphological characteristics in accordance with different pattern of somatic types. The side view silhouettes of 90 junior high school girls of age $13\~16$ in seoul urban area were measured by means of the plan photographing and the low data were examined by principal component analysis, while the principal component analysis was applied and three components were extracted and then interpreted to explain to variation of the form of the body. Using three components respectively the cluster analysis was carried out and the subject classified into 4 cluster The following outcomes are obtained. . The results of principal component analysis of this study would be turned out the three; 1) The first principal component shows the degree of erectness or stoop of the figure. 2) The second principal component was a stature length or a growth rate. 3) The third principal component was the obesity component. 2. The results of cluster analysis by using three principal component analysis would be turned out the four cluser; 1) Cluster 1 ($29\%$ of the total) is characterized with lower stature. 2) Cluster 2 ($21\%$ of the total) is characterized with backward somatotype, and the highest leg. 3) Cluster 3 ($23\%$ of the total) is thicked back of neck. 4) Cluster 4 ($27\%$ of the total) is characterized with forward somatotype, and highest stature, height.

  • PDF

Observation on the shape of the neck -by principal component analysis of the mesurements- (피복 구성을 위한 경부 형태의 관찰)

  • 이연순
    • Journal of the Ergonomics Society of Korea
    • /
    • v.10 no.2
    • /
    • pp.31-42
    • /
    • 1991
  • To understand the shape of the neck in a view of garment planning, principal component analysis has been appliedto the measurement of the neck. The neck surface development and the cross sections of the neck have been observed. The materials consist of the body mearsurements, the neck surface developments and the cross sec- tions of the necks of a total of 108 korean woman students. The difference between the right side and the left side of the neck has not been reconginiged. But the differenece among the height of the front neck point, that of the side neck point and that of the back neck point has been recognized. 2. The initial 41 items have been found having variety and duplication. So two criteria have been made to solve those problems and the selection of 34 items have been made by each criterion. 3. 43 and 34 items have been compared by means of accumulative ratios of contribution and of clearness within the meaning of principal component. As a result, 34 measurement items have been further anylysis. 4. As a result of principal component analysis on the 34 items, the four principal components have been found obtaines and inter-preted. The four principal components are 1) the thick of the neck, 2) the front neck-line on the waist basic pattern, basic pattern, 3) the shape of the neck surface development, and 4) the back neck-line on the waist basic pattern. 5. According to the graphic informations concerning these principal components, the meaning of these four principal components has been grasped on the visual. As a result, there is a large individual difference in the shape of neck.

  • PDF

A Study on the Vulnerability Assessment for Agricultural Infrastructure using Principal Component Analysis (주성분 분석을 이용한 농업생산기반의 재해 취약성 평가에 관한 연구)

  • Kim, Sung Jae;Kim, Sung Min;Kim, Sang Min
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.55 no.1
    • /
    • pp.31-38
    • /
    • 2013
  • The purpose of this study was to evaluate climate change vulnerability over the agricultural infrastructure in terms of flood and drought using principal component analysis. Vulnerability was assessed using vulnerability resilience index (VRI) which combines climate exposure, sensitivity, and adaptive capacity. Ten flood proxy variables and six drought proxy variables for the vulnerability assessment were selected by opinions of researchers and experts. The statistical data on 16 proxy variables for the local governments (Si, Do) were collected. To identify major variables and to explain the trend in whole data set, principal component analysis (PCA) was conducted. The result of PCA showed that the first 3 principal components explained approximately 83 % and 89 % of the total variance for the flood and drought, respectively. VRI assessment for the local governments based on the PCA results indicated that provinces where having the relatively large cultivation areas were categorized as vulnerable to climate change.

Principal Component Analysis Based Two-Dimensional (PCA-2D) Correlation Spectroscopy: PCA Denoising for 2D Correlation Spectroscopy

  • Jung, Young-Mee
    • Bulletin of the Korean Chemical Society
    • /
    • v.24 no.9
    • /
    • pp.1345-1350
    • /
    • 2003
  • Principal component analysis based two-dimensional (PCA-2D) correlation analysis is applied to FTIR spectra of polystyrene/methyl ethyl ketone/toluene solution mixture during the solvent evaporation. Substantial amount of artificial noise were added to the experimental data to demonstrate the practical noise-suppressing benefit of PCA-2D technique. 2D correlation analysis of the reconstructed data matrix from PCA loading vectors and scores successfully extracted only the most important features of synchronicity and asynchronicity without interference from noise or insignificant minor components. 2D correlation spectra constructed with only one principal component yield strictly synchronous response with no discernible a asynchronous features, while those involving at least two or more principal components generated meaningful asynchronous 2D correlation spectra. Deliberate manipulation of the rank of the reconstructed data matrix, by choosing the appropriate number and type of PCs, yields potentially more refined 2D correlation spectra.