• Title/Summary/Keyword: Principal components analysis (PCA)

Search Result 295, Processing Time 0.036 seconds

Hierarchically penalized sparse principal component analysis (계층적 벌점함수를 이용한 주성분분석)

  • Kang, Jongkyeong;Park, Jaeshin;Bang, Sungwan
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.1
    • /
    • pp.135-145
    • /
    • 2017
  • Principal component analysis (PCA) describes the variation of multivariate data in terms of a set of uncorrelated variables. Since each principal component is a linear combination of all variables and the loadings are typically non-zero, it is difficult to interpret the derived principal components. Sparse principal component analysis (SPCA) is a specialized technique using the elastic net penalty function to produce sparse loadings in principal component analysis. When data are structured by groups of variables, it is desirable to select variables in a grouped manner. In this paper, we propose a new PCA method to improve variable selection performance when variables are grouped, which not only selects important groups but also removes unimportant variables within identified groups. To incorporate group information into model fitting, we consider a hierarchical lasso penalty instead of the elastic net penalty in SPCA. Real data analyses demonstrate the performance and usefulness of the proposed method.

A Study on the Classification of Islands by PCA ( I ) (PCA에 의한 도서분류에 관한 연구( I ))

  • 이강우
    • The Journal of Fisheries Business Administration
    • /
    • v.14 no.2
    • /
    • pp.1-14
    • /
    • 1983
  • This paper considers a classification of the 88 islands located at Kyong-nam area in Korea, using by examples of 12 components of the islands. By means of principal component analysis 2 principle components were extracted, which explained a total of 73.7% of the variance. Using an eigen variable criterion (λ>1), no further principle components were discussed. Principal component 1 and 2 explained 63.4% and 10.3% of the total variance respectively, The representation of the unrelated factor scores along the first and second principal axes produced a new information with respect to the classification of the islands. Based upon the representation, 88 islands were classified into 6 groups i. e. A, B, C, D, E, and F according to similarity of the components among them in this paper. The "Group F" belongs to a miscellaneous assortment that does not fit into the logical category. category.

  • PDF

Thermal Behavior of Langmuir-Blodgett Film of Poly(tert-butyl methacrylate) by Principal Component Analysis Based Two-Dimensional Correlation Spectroscopy

  • Jung, Young-Mee;Kim, Seung-Bin
    • Bulletin of the Korean Chemical Society
    • /
    • v.26 no.12
    • /
    • pp.2027-2032
    • /
    • 2005
  • This paper demonstrates details of thermal behavior of Langmuir-Blodgett (LB) film of poly(tert-butyl methacrylate) (PtBMA) by using the principal component analysis based two-dimensional correlation spectroscopy (PCA2D) through eigenvalue manipulating transformation (EMT). By uniformly lowering the power of a set of eigenvalues associated with the original data, the smaller eigenvalues becomes more prominent and the subtle contribution from minor components is now highlighted much more strongly than the original data. Thus, the subtle difference of thermal behavior of LB film of PtBMA from minor components, which is not readily detectable in the conventional 2D correlation analysis, is much more noticeable than the original data. PCA2D correlation spectra with EMT operation for the temperature-dependent IR spectra of LB film of PtBMA reveal the hidden property of phase transition processes during heating.

Arrow Diagrams for Kernel Principal Component Analysis

  • Huh, Myung-Hoe
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.3
    • /
    • pp.175-184
    • /
    • 2013
  • Kernel principal component analysis(PCA) maps observations in nonlinear feature space to a reduced dimensional plane of principal components. We do not need to specify the feature space explicitly because the procedure uses the kernel trick. In this paper, we propose a graphical scheme to represent variables in the kernel principal component analysis. In addition, we propose an index for individual variables to measure the importance in the principal component plane.

Analysis of Functional Connectivity in Human Working Memory using Positron Emission Tomography and Principal Component Analysis

  • Lee, J.S.;Ahn, J.Y.;Jang, M.J.;Lee, D.S.;Chung, J.K.;Lee, M.C.;Park, K.S.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1998 no.11
    • /
    • pp.257-258
    • /
    • 1998
  • To reveal the interconnected brain regions involved in human working memory, their functional connectivity was analyzed using principal component analysis (PCA). rCBF PET scans were peformed on 5 normal volunteers during the verbal and visual working memory tasks and PCA was applied. PCA produced the first principal components related with the increase of the difficulty and the second one which demonstrate the dissociation of verbal and visual memory system.

  • PDF

Principal Component Analysis with Coefficient of Variation Matrix (변동계수행렬을 이용한 주성분분석)

  • Kim, Ji-Hyun
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.3
    • /
    • pp.385-392
    • /
    • 2015
  • Principal component analysis (PCA), a dimension-reduction technique, is usually implemented after the variables are standardized when the measurement unit of variables are different. To standardize a variable we divide it by its standard deviation. But there is another way to transform a variable to be independent of its measurement unit. It is to divide it by its mean rather than standard deviation. Implementing PCA on standardized variables is equivalent to implementing PCA with a correlation matrix of original variables. Similarly, implementing PCA on the transformed variables divided by their means is equivalent to implementing PCA with a matrix related to the coefficients of variation of the original variables. We explain why we need to implement PCA on the variables transformed by their means.

Sequential Registration of the Face Recognition candidate using SKL Algorithm (SKL 알고리즘을 이용한 얼굴인식 후보의 점진적 등록)

  • Han, Hag-Yong;Lee, Sung-Mok;Kwak, Boo-Dong;Choi, Won-Tae;Kang, Bong-Soon
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.11 no.4
    • /
    • pp.320-325
    • /
    • 2010
  • This paper is about the method and procedure to register the candidate sequentially in the face recognition system using the PCA(Principal Components Analysis). We use the method to update the principal components sequentially with the SKL algorithm which is improved R-SVD algorithm. This algorithm enable us to solve the re-training problem of the increase the candidates number sequentially in the face recognition using the PCA. Also this algorithm can use in robust tracking system with the bright change based to the principal components. This paper proposes the procedure in the face recognition system which sequentially updates the principal components using the SKL algorithm. Then we compared the face recognition performance with the batch procedure for calculating the principal components using the standard KL algorithm and confirms the effects of the forgetting factor in the SKL algorithm experimentally.

Genetic Diversity of Soybean Pod Shape Based on Elliptic Fourier Descriptors

  • Truong Ngon T.;Gwag Jae-Gyun;Park Yong-Jin;Lee Suk-Ha
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.50 no.1
    • /
    • pp.60-66
    • /
    • 2005
  • Pod shape of twenty soybean (Glycine max L. Merrill) genotypes was evaluated quantitatively by image analysis using elliptic Fourier descriptors and their principal components. The closed contour of each pod projection was extracted, and 80 elliptic Fourier coefficients were calculated for each contour. The Fourier coefficients were standardized so that they were invariant of size, rotation, shift, and chain code starting point. Then, the principal components on the standardized Fourier coefficients were evaluated. The cumulative contribution at the fifth principal component was higher than $95\%$, indicating that the first, second, third, fourth, and fifth principal components represented the aspect ratio of the pod, the location of the pod centroid, the sharpness of the two pod tips and the roundness of the base in the pod contour, respectively. Analysis of variance revealed significant genotypic differences in these principal components and seed number per pod. As the principal components for pod shape varied continuously, pod shape might be controlled by polygenes. It was concluded that principal component scores based on elliptic Fourier descriptors yield seemed to be useful in quantitative parameters not only for evaluating soybean pod shape in a soybean breeding program but also for describing pod shape for evaluating soybean germplasm.

Classification of Rural village of Eum-Seong Gun by Amenity investigation base on village (마을단위 어메니티 조사를 통한 음성군 지역의 농촌마을 유형화)

  • Kim, Ji-Hyun;Yoon, Seong-Soo;Rhee, Shin-Ho
    • Proceedings of the Korean Society of Agricultural Engineers Conference
    • /
    • 2005.10a
    • /
    • pp.461-466
    • /
    • 2005
  • The purpose of this study is to classify rural villages through the amenity investigation by a village unit. PCA(Principal component analysis) is used for the classification of rural villages. The principal components of rural villages are deduced scale, population, infrastructure, traffic, education welfare and sightseeing by PCA.

  • PDF

Seasonal Variation and Statistical Analysis of Particulate Pollutants in Urban Air (도시대기립자상물질중 오염성분의 계절적 변동 및 통계적 해석)

  • 이승일
    • Journal of environmental and Sanitary engineering
    • /
    • v.9 no.2
    • /
    • pp.8-23
    • /
    • 1994
  • During the period from Mar., 1991 to Feb., 1992 66 tSP samples were collected by Hi volume air sampler at 1 sampling site in Seoul and the amount of concentration of 21 components(SO$_{4}$$^{2-}$, NO$_{3}$$^{-}$, NH$_{4}$$^{+}$, Cl$^{-}$, Al, Ba, Ca, Cd, Cr, Cu, Fe, It Mg, Mn, Na, Ni, Pt Si, Ti, Zn, Zr ) were measured. And monthly and seasonal variation were surveyed and the principal component analysis( PCA ) were carried out with respect to these amount of pollutants, minimum of visibility and radiation on a horizontal surface. The total amount of soluble ion in water was high in order o(SO$_{4}$$^{2-}$> NO$_{3}$$^{-}$> N%'>Cl$^{-}$ and metal ion was high in order of Na> Ca>Si> Fe> Al> K> Mg> Zn> Pb> Cu>Ti> Mn > Ba> Cr> Zr> Ni> Cd. There was Seasonal variation in concentration for SO$_{4}$$^{2-}$, NH$_{4}$$^{+}$, Cl$^{-}$, Na, Al, Ca, Bt Mg, Fe and Si. It was assumed that the components of the highest concentration on April were depend on yellow sand and the frequency of wind velocity and direction. As the results of PCA, the amount of pollution components was able to characterized with two principal components(Z$_{1}$, Z$_{2}$ ). The first principal components Z$_{1}$ was considered to be a factor indicating the pollutants originated from natural generation and The second principal components Z$_{2}$ was considered to be a factor indicating the pollutants originated from human work. The monthly concentration of pollutants in ISP, minimum of visibility and radiation on a horizontal surface was possible to evaluate by the use of these two principal components Z$_{1}$ and Z$_{2}$ .

  • PDF