• Title/Summary/Keyword: factor analysis(PCA: principal component analysis)

Search Result 88, Processing Time 0.024 seconds

A dimensional reduction method in cluster analysis for multidimensional data: principal component analysis and factor analysis comparison (다차원 데이터의 군집분석을 위한 차원축소 방법: 주성분분석 및 요인분석 비교)

  • Hong, Jun-Ho;Oh, Min-Ji;Cho, Yong-Been;Lee, Kyung-Hee;Cho, Wan-Sup
    • The Journal of Bigdata
    • /
    • v.5 no.2
    • /
    • pp.135-143
    • /
    • 2020
  • This paper proposes a pre-processing method and a dimensional reduction method in the analysis of shopping carts where there are many correlations between variables when dividing the types of consumers in the agri-food consumer panel data. Cluster analysis is a widely used method for dividing observational objects into several clusters in multivariate data. However, cluster analysis through dimensional reduction may be more effective when several variables are related. In this paper, the food consumption data surveyed of 1,987 households was clustered using the K-means method, and 17 variables were re-selected to divide it into the clusters. Principal component analysis and factor analysis were compared as the solution for multicollinearity problems and as the way to reduce dimensions for clustering. In this study, both principal component analysis and factor analysis reduced the dataset into two dimensions. Although the principal component analysis divided the dataset into three clusters, it did not seem that the difference among the characteristics of the cluster appeared well. However, the characteristics of the clusters in the consumption pattern were well distinguished under the factor analysis method.

Varietal Classification by Multivariate Analysis on Quantitative Traits in Pecan

  • Shin, Dong-Young;Nou, Ill-Sup
    • Plant Resources
    • /
    • v.2 no.2
    • /
    • pp.75-80
    • /
    • 1999
  • Twenty two varieties of pecan including wild types were classified based on 6 characters measured by principal component analysis score distance. The results are summarized as fellow. Twenty two varieties were classified into 5 groups based in PCA score distance. Five groups were distinctly characterized by many morphological characters. Total variation could be explained by 51%, 95%, 99% with first, third and fifth principal components respectively. Varimax rotation of the factor loading of the first factors indicated that the first component was highly loaded with leaf characters, the second component with fruit characters, but fruit length was negative loaded. The second, the third and the fourths groups of cultivars had very close genetic parentage similarity.

  • PDF

The Application of SVD for Feature Extraction (특징추출을 위한 특이값 분할법의 응용)

  • Lee Hyun-Seung
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.43 no.2 s.308
    • /
    • pp.82-86
    • /
    • 2006
  • The design of a pattern recognition system generally involves the three aspects: preprocessing, feature extraction, and decision making. Among them, a feature extraction method determines an appropriate subspace of dimensionality in the original feature space of dimensionality so that it can reduce the complexity of the system and help to improve successful recognition rates. Linear transforms, such as principal component analysis, factor analysis, and linear discriminant analysis have been widely used in pattern recognition for feature extraction. This paper shows that singular value decomposition (SVD) can be applied usefully in feature extraction stage of pattern recognition. As an application, a remote sensing problem is applied to verify the usefulness of SVD. The experimental result indicates that the feature extraction using SVD can improve the recognition rate about 25% compared with that of PCA.

A Study on the Classification of Islands by PCA ( I ) (PCA에 의한 도서분류에 관한 연구( I ))

  • 이강우
    • The Journal of Fisheries Business Administration
    • /
    • v.14 no.2
    • /
    • pp.1-14
    • /
    • 1983
  • This paper considers a classification of the 88 islands located at Kyong-nam area in Korea, using by examples of 12 components of the islands. By means of principal component analysis 2 principle components were extracted, which explained a total of 73.7% of the variance. Using an eigen variable criterion (λ>1), no further principle components were discussed. Principal component 1 and 2 explained 63.4% and 10.3% of the total variance respectively, The representation of the unrelated factor scores along the first and second principal axes produced a new information with respect to the classification of the islands. Based upon the representation, 88 islands were classified into 6 groups i. e. A, B, C, D, E, and F according to similarity of the components among them in this paper. The "Group F" belongs to a miscellaneous assortment that does not fit into the logical category. category.

  • PDF

Assessment of seasonal variations in water quality of Brahmani river using PCA

  • Mohanty, Chitta R.;Nayak, Saroj K.
    • Advances in environmental research
    • /
    • v.6 no.1
    • /
    • pp.53-65
    • /
    • 2017
  • Assessment of seasonal changes in surface water quality is an important aspect for evaluating temporal variations of river pollution due to natural or anthropogenic inputs of point and non-point sources. In this study, surface water quality data for 15 physico-chemical parameters collected from 7 monitoring stations in a river during the years from 2014 to 2016 were analyzed. The principal component analysis technique was employed to evaluate the seasonal correlations of water quality parameters, while the principal factor analysis technique was used to extract the parameters that are most important in assessing seasonal variations of river water quality. Analysis shows that a parameter that is most important in contributing to water quality variation for one season may not be important for another season except alkalinity, which is always the most important parameters in contributing to water quality variations for all three seasons.

Estimation of Source Contribution of Particulate Matter in Taegu Area using Factor Analysis (다변량 통계분석법을 이용한 대구지역 부유분진의 오염원 기여도 추정)

  • 최성우;송형도
    • Journal of Environmental Health Sciences
    • /
    • v.26 no.4
    • /
    • pp.1-8
    • /
    • 2000
  • The objective of this study was to identify the sources and to estimate the source contributions to the atmospheric TSP(total suspended particulate matter) and PM-10(particulate matter with aerodynamic diameters less than 10$\mu\textrm{m}$) concentration in Taegu area. A total of 84 samples was collected during the January to December 1999. TSP and PM-10 were collected on filters by portable air sampler, and heavy metals in TSP and PM-배 were analyzed by ICO(Inductively Coupled Plasma Spectrometery) after preliminary treatment. The results were follow as : First, annual average of TSP and PM-10 concentration was 123 and 69$\mu\textrm{g}$/㎥ respectively. The concentration of TSP and PM-10 were highest in winter season compared to other seasons. Second, the concentration of Al, Fe, Mn were higher in TSP than in PM-10, indicating that these heavy metals are generally associate with natural contributions. Third, metal combinations showed that a high correlation among concentrations of heavy metals were follows: As Al, Fe and Mn in TSP ; Ni, Cr, Cd and Pb in PM-10. Finally, Statistical analysis was performed using Principal Components Analysis(PCA) in order to find possible sources of the pollutants. The factor analysis was permitted to identify four major sources(soil/road dust resuspension, waste incineration, furl combustion, vehicular emission) in each fraction. These source accounted for at least 83, 85% of variance of TSP and PM-10 concentration in Taegu area.

  • PDF

PCA-Based Feature Reduction for Depth Estimation (깊이 추정을 위한 PCA기반의 특징 축소)

  • Shin, Sung-Sik;Gwun, Ou-Bong
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.47 no.3
    • /
    • pp.29-35
    • /
    • 2010
  • This paper discusses a method that can enhance the exactness of depth estimation of an image by PCA(Principle Component Analysis) based on feature reduction through learning algorithm. In estimation of the depth of an image, hyphen such as energy of pixels and gradient of them are found, those selves and their relationship are used for depth estimation. In such a case, many features are obtained by various filter operations. If all of the obtained features are equally used without considering their contribution for depth estimation, The efficiency of depth estimation goes down. This paper proposes a method that can enhance the exactness of depth estimation of an image and its processing speed is considered as the contribution factor through PCA. The experiment shows that the proposed method(30% of an feature vector) is more exact(average 0.4%, maximum 2.5%) than using all of an image data in depth estimation.

Evaluation of significant pollutant sources affecting water quality of the Geum River using principal component analysis (주성분분석(PCA) 방법을 이용한 금강 수질의 주요 오염원 영향 평가)

  • Legesse, Natnael Shiferaw;Kim, Jaeyoung;Seo, Dongil
    • Journal of Korea Water Resources Association
    • /
    • v.55 no.8
    • /
    • pp.577-588
    • /
    • 2022
  • This study aims to identify the limiting nutrient for algal growth in the Geum River and the significant pollutant sources from the tributaries affecting the water quality and to provide a management alternative for an improvement of water quality. An eight-year of daily data (2013~2020) were collected from the Water Environment Information System (water.nier.go.kr) and Water Resources Management Information System (wamis.go.kr). 14 water quality variables were analyzed at five water quality monitoring stations in the Geum River (WQ1-WQ5). In the Geum River, the water quality variables, especially Chl-a vary greatly in downstream of the river. In the open weir gate operation, TP (total phosphorus) and water temperature greatly influence the growth of algae in downstream of the river. A correlation analysis was used to identify the relationship between variables and investigate the factor affecting algal growth in the Geum River. At the downstream station (WQ5), TP and Temp have shown a strong correlation with Chl-a, indicating they significantly influence the algal bloom. The principal component analysis (PCA) was applied to identify and prioritize the major pollutant sources of the two major tributaries of the river, Gab-cheon and Miho-cheon. PCA identifies three major pollutant sources for Gab-cheon and Miho-cheon, respectively. For Gab-cheon, wastewater treatment plant, urban, and agricultural pollutions pollution are identified as significant pollutant sources. For Miho-cheon, agricultural, urban, and forest land are identified as major pollutant sources. PCA seems to be effective in identifying water pollutant sources for the Geum River and its tributaries in detail and thus can be used to develop water quality management strategies.

Determination of Flood Risk Considering Flood Control Ability and Urban Environment Risk (수방능력 및 재해위험을 고려한 침수위험도 결정)

  • Lee, Eui Hoon;Choi, Hyeon Seok;Kim, Joong Hoon
    • Journal of Korea Water Resources Association
    • /
    • v.48 no.9
    • /
    • pp.757-768
    • /
    • 2015
  • Recently, climate change has affected short time concentrated local rainfall and unexpected heavy rain which is increasingly causing life and property damage. In this research, arithmetic average analysis, weighted average analysis, and principal component analysis are used for predicting flood risk. This research is foundation for application of predicting flood risk based on annals of disaster and status of urban planning. Results obtained by arithmetic average analysis, weighted average analysis, and principal component analysis using many factors affect on flood are compared. In case of arithmetic average analysis, each factor has same weights though it is simple method. In case of weighted average analysis, correlation factors are complex by many variables and multicollinearty problem happen though it has different weights. For solving these problems, principal component analysis (PCA) is used because each factor has different weights and the number of variables is smaller than other methods by combining variables. Finally, flood risk assessment considering flood control ability and urban environment risk in former research is predicted.

Analysis of Behavioral Traits in Violation related to LPG Accidents (LPG 관련 산재사고의 위반행동 특성 분석)

  • Seung Eon Ham;Hyeon Kyo Lim
    • Journal of the Korean Society of Safety
    • /
    • v.38 no.4
    • /
    • pp.15-22
    • /
    • 2023
  • LPG-related accidents, which account for half of all gas accidents in Korea, have not shown any sign of decrease over the past decade, partially owing to the lack of effective safety improvement measures. The purpose of this study was to identify the effectiveness of improvement measures by analyzing the traits of accidents in terms of human factors, and to seek more effective accident prevention strategies. In this study, 108 accident cases were collected and analyzed in the aspect of accident characteristics such as violation type, human factors, and so on. The results showed that the work procedures of suppliers and engineers related to LPG accidents seemed to be similar in outward appearance; however, specific accident causes and unsafe behaviors were different. Particularly, type and target of violations were different, which could be visually confirmed by the Principal Component Analysis (PCA) and the Quantification Techniques (QT). Furthermore, for engineers, insufficient supervision was a major influencing factor. In conclusion, because the accident characteristics of suppliers and engineers are different, differentiated accident prevention strategies should be implemented, which was discussed in this study.