• Title/Summary/Keyword: Principal Component Factor Analysis

Search Result 347, Processing Time 0.031 seconds

Independent Component Biplot (독립성분 행렬도)

  • Lee, Su Jin;Choi, Yong-Seok
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.1
    • /
    • pp.31-41
    • /
    • 2014
  • Biplot is a useful graphical method to simultaneously explore the rows and columns of a two-way data matrix. In particular, principal component factor biplot is a graphical method to describe the interrelationship among many variables in terms of a few underlying but unobservable random variables called factors. If we consider the unobservable variables (which are mutually independent and also non-Gaussian), we can apply the independent component analysis decomposing a mixture of non-Gaussian in its independent components. In this case, if we apply the principal component factor analysis, we cannot clearly describe the interrelationship among many variables. Therefore, in this study, we apply the independent component analysis of Jutten and Herault (1991) decomposing a mixture of non-Gaussian in its independent components. We suggest an independent component biplot to interpret the independent component analysis graphically.

A Study on the Shapes of the Neck and the Shoulder in Dressmaking; young wonen age group (의복원형설계를 위한 성인여성 두.견부의 형태분류 -20대 여성을 중심으로-)

  • 김희숙
    • Journal of the Korean Home Economics Association
    • /
    • v.36 no.12
    • /
    • pp.43-54
    • /
    • 1998
  • From the viewpoint of clothing construction, it is necessary to grasp exactly the shapes of the neck and the shouder, such as the line of the neck base, the neck gradient, the shoulder gradient, the shape of the scapular, and the shape of the breast. In this report, factor analysis was applied to 39 items of neck & shoulder level measurements, including stature, weight, but grith, waist girth, to demonstrate the most relevant measurements for collar and bodice pattern designing, and to classify the neck and shoulder level shapes. The subjects investigated were 126 women of the age 20-29. The main results are follows : 1. For factors of body form were extracted by the factor analysis. The 1st principal component can be interpreted as "size" component, the 2nd-3th principal component is "shape" component relating to neck and shoulder level, and the 4th principal component is "shoulder shape" component. 2. With regard to factor loadings, we were able to extract the most relevant measurements for collar and bodice pattern designing. M16, M22, S26, S30, S34, S35, S36, C37, C38, C39.

  • PDF

Comparison of hydrochemical informations of groundwater obtained from two different underground storage systems

  • Lee, Jeonghoon;Kim, Jun-Mo;Chang, Ho-Wan
    • Proceedings of the Korean Society of Soil and Groundwater Environment Conference
    • /
    • 2002.04a
    • /
    • pp.110-113
    • /
    • 2002
  • Statistical- based, principal component analysis (PCA) was applied to chemical data from two underground storage systems containing LPG to assess the usefulness of such technique at the initial stage (Pyeongtaek) or middle stage (Ulsan) of hydrochemical studies. For the first case, both natural and anthropogenic contamination characterize regional groundwater. Saline water buffered by Namyang lake affects as a natural factor, whereas cement grouting influence as an artificial factor. For the second study area, contaminations due to operation of LPG caverns, such as disinfection activity and cement grouting effect, deteriorate groundwater quality. This study indicates that principal component analysis would be particularly useful for summarizing large data set for the purpose of subsurface characterization, assessing their vulnerability to contamination and protecting recharge zones.

  • PDF

Assessment of Water Quality using Multivariate Statistical Techniques: A Case Study of the Nakdong River Basin, Korea

  • Park, Seongmook;Kazama, Futaba;Lee, Shunhwa
    • Environmental Engineering Research
    • /
    • v.19 no.3
    • /
    • pp.197-203
    • /
    • 2014
  • This study estimated spatial and seasonal variation of water quality to understand characteristics of Nakdong river basin, Korea. All together 11 parameters (discharge, water temperature, dissolved oxygen, 5-day biochemical oxygen demand, chemical oxygen demand, pH, suspended solids, electrical conductivity, total nitrogen, total phosphorus, and total organic carbon) at 22 different sites for the period of 2003-2011 were analyzed using multivariate statistical techniques (cluster analysis, principal component analysis and factor analysis). Hierarchical cluster analysis grouped whole river basin into three zones, i.e., relatively less polluted (LP), medium polluted (MP) and highly polluted (HP) based on similarity of water quality characteristics. The results of factor analysis/principal component analysis explained up to 83.0%, 81.7% and 82.7% of total variance in water quality data of LP, MP, and HP zones, respectively. The rotated components of PCA obtained from factor analysis indicate that the parameters responsible for water quality variations were mainly related to discharge and total pollution loads (non-point pollution source) in LP, MP and HP areas; organic and nutrient pollution in LP and HP zones; and temperature, DO and TN in LP zone. This study demonstrates the usefulness of multivariate statistical techniques for analysis and interpretation of multi-parameter, multi-location and multi-year data sets.

Evaluation of the Geum River by Multivariate Analysis: Principal Component Analysis and Factor Analysis (다변량분석법을 이용한 금강 유역의 수질오염특성 연구)

  • Kim, Mi-Ah;Lee, Jae-kwan;Zoh, Kyung-Duk
    • Journal of Korean Society on Water Environment
    • /
    • v.23 no.1
    • /
    • pp.161-168
    • /
    • 2007
  • The main aim of this work is focus on the Geum river water quality evaluation of pollution data obtained by monitoring measurement during the period 2001-2005. The complex data matrix 19 (entire monitoring stations)*13 (parameters), 60 (month)*13 (parameters) and 20 (season)*13 (parameters) were treated with different multivariate techniques such as factor analysis/principal component analysis (FA/PCA). FA/PCA identified two factor (19*13) classified pollutant Loading factor (BOD, COD, pH, Cond, T-N, T-P, $NH_3$-N, $NO_3$-N, $PO_4$-P, Chl-a), seasonal factor (water temp, SS) and three Factor (60*13, 20*13) classified pollutant Loading factor (BOD, COD, Cond, T-N, T-P, $NH_3$-N, $NO_3$-N, $PO_4$-P), seasonal factor (water temp, SS) and metabolic factor (Chl-a, pH). Loadings of pollutant factor is potent influence main factor in the Geum river which is explained by loadings of pollutant factor at whole sampling stations (71.16%), month (52.75%) and season (56.57%) of main water quality stations. Result of this study is that pollutant loading factor is affected at Gongju 1, 2, Buyeo 1, 2, Gangkyeong, Yeongi stations by entire stations and entire month (Gongju 1, Cheongwon stations), April, May, July and August (buyeo 1) by month. Also the pollutant Loading factor is season gives an influence in winter (Gongju 1, buyeo 1) from main sampling stations, but Cheongwon characteristic is non-seasonal influenced. This study presents necessity and usefulness of multivariate statistic techniques for evaluation and interpretation of large complex data set with a view to get better information data effective management of water sources.

Application of Regression Analysis Model to TOC Concentration Estimation - Osu Stream Watershed - (회귀분석에 의한 TOC 농도 추정 - 오수천 유역을 대상으로 -)

  • Park, Jinhwan;Moon, Myungjin;Han, Sungwook;Lee, Hyungjin;Jung, Soojung;Hwang, Kyungsup;Kim, Kapsoon
    • Journal of Environmental Impact Assessment
    • /
    • v.23 no.3
    • /
    • pp.187-196
    • /
    • 2014
  • The objective of this study is to evaluate and analyze Osu stream watershed water environment system. The data were collected from January 2009 to December 2011 including water temperature, pH, DO, EC, BOD, COD, TOC, SS, T-N, T-P and discharge. The data were used for principle component analysis and factor analysis. The results are as followes. The primary factors obtained from both the principal component analysis and the factor analysis were BOD, COD, TOC, SS and T-P. Once principal component analysis and factor analysis have been performed with the collected data and then the results will be applied to both simple regression model and multiple regression model. The regression model was developed into case 1 using concentrations of water quality parameters and case 2 using delivery loads. The value of the coefficient of determination on case 1 fell between 0.629 and 0.866; this was lower than case 2 value which fell between 0.946 and 0.998. Therefore, case 2 model would be a reliable choice.The coefficient of determination between the estimated figure using data which was developed to the regression model in 2012 and the actual measurement value was over 0.6, overall. It can be safely deduced that the correlation value between the two findings was high. The same model can be applied to get TOC concentrations in future.

Cluster Analysis with Air Pollutants and Meteorological Factors in Seoul

  • Kim, Jae-Hee;Lim, Ji-Won
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.4
    • /
    • pp.773-787
    • /
    • 2003
  • Principal component analysis, factor analysis and cluster analysis have been performed to analyze the relationship between air pollutants and meteorological variables measured in 1999 in Seoul. In principal analysis, the first principal has been shown the contrast effect between $O_3$ and the other pollutants, the second principal has been shown the contrast effect between CO, $SO_2$, $NO_2$ and $O_3$, PM10, TSP. In factor analysis, the first factor has been found as PM10, TSP, $NO_2$ concentrations which are related with suspended particulates. As a result of cluster analysis, three clusters respectively have represented different air pollution levels, seasonal characteristics of air pollutants and meteorological situations.

  • PDF

Assessment of water quality variations under non-rainy and rainy conditions by principal component analysis techniques in Lake Doam watershed, Korea

  • Bhattrai, Bal Dev;Kwak, Sungjin;Heo, Woomyung
    • Journal of Ecology and Environment
    • /
    • v.38 no.2
    • /
    • pp.145-156
    • /
    • 2015
  • This study was based on water quality data of the Lake Doam watershed, monitored from 2010 to 2013 at eight different sites with multiple physiochemical parameters. The dataset was divided into two sub-datasets, namely, non-rainy and rainy. Principal component analysis (PCA) and factor analysis (FA) techniques were applied to evaluate seasonal correlations of water quality parameters and extract the most significant parameters influencing stream water quality. The first five principal components identified by PCA techniques explained greater than 80% of the total variance for both datasets. PCA and FA results indicated that total nitrogen, nitrate nitrogen, total phosphorus, and dissolved inorganic phosphorus were the most significant parameters under the non-rainy condition. This indicates that organic and inorganic pollutants loads in the streams can be related to discharges from point sources (domestic discharges) and non-point sources (agriculture, forest) of pollution. During the rainy period, turbidity, suspended solids, nitrate nitrogen, and dissolved inorganic phosphorus were identified as the most significant parameters. Physical parameters, suspended solids, and turbidity, are related to soil erosion and runoff from the basin. Organic and inorganic pollutants during the rainy period can be linked to decayed matters, manure, and inorganic fertilizers used in farming. Thus, the results of this study suggest that principal component analysis techniques are useful for analysis and interpretation of data and identification of pollution factors, which are valuable for understanding seasonal variations in water quality for effective management.

Varietal Classification by Multivariate Analysis on Quantitative Traits in Pecan

  • Shin, Dong-Young;Nou, Ill-Sup
    • Plant Resources
    • /
    • v.2 no.2
    • /
    • pp.75-80
    • /
    • 1999
  • Twenty two varieties of pecan including wild types were classified based on 6 characters measured by principal component analysis score distance. The results are summarized as fellow. Twenty two varieties were classified into 5 groups based in PCA score distance. Five groups were distinctly characterized by many morphological characters. Total variation could be explained by 51%, 95%, 99% with first, third and fifth principal components respectively. Varimax rotation of the factor loading of the first factors indicated that the first component was highly loaded with leaf characters, the second component with fruit characters, but fruit length was negative loaded. The second, the third and the fourths groups of cultivars had very close genetic parentage similarity.

  • PDF

THE ANALYSIS AND DIAGNOSIS OF SOWN PASTURE VEGETATION 2. GROUPING AND CHARACTERIZATION THE SOWN AND WEED SPECIES BY MEANS OF PRINCIPAL COMPONENT ANALYSIS

  • Kawanabe, S.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.4 no.3
    • /
    • pp.245-250
    • /
    • 1991
  • Analysis of the characteristics and the grouping of the species of sown and weeds in artificial pastures was studied applying the principal component analysis method. Presency and coverage of six sown species and fifteen weed species which occurred in pastures of under-grazing and optimumgrazing were subject to analysis. From field survey, species were divided into three groups: the group A included five species such as Festuca arundinacea, Lolium perenne and Dactylis glomerata, etc., the group B included eleven species such as Polygonum longisetum, Agrostis alba and Rumex obtusifolius, etc., and the group C included five species such as Miscanthus sinensis, Rubus palmatus and Artemisia princeps, etc. The group A species corresponded to good pasture conditions and management. On the contrary, the group C species occurred in poor pasture conditions with inadequate management. The group B species corresponded to intermediate pasture conditions and management. Interrelated pair species co-existing and species non-co-existing were discovered. Factor loading as negative for the group A species. positive for the group C species and positive but lower than the group C species for the group B species. From these results it is concluded that the principal component analysis seems to one of the useful tools for the analysis of characteristics of species and the diagnosis of sown pasture vegetation, although further studies are required to get more general information about species characteristics.