• Title/Summary/Keyword: 주성분 요인

Search Result 277, Processing Time 0.028 seconds

Extract the main factors related to ground subsidence near abandoned underground coal mine using PCA (PCA 기법을 이용한 폐탄광 지역의 지반침하 관련 요인 추출)

  • Choi, Jong-Kuk;Kim, Ki-Dong
    • Proceedings of the KSRS Conference
    • /
    • 2007.03a
    • /
    • pp.301-304
    • /
    • 2007
  • 본 연구에서는 폐탄광 지역에서 발생하는 지반침하에 영향을 주는 주요 요인들을 추출하기 위하여 다변량 통계분석 방법의 하나인 주성분분석(Principle Component Analysis : PCA)기법과 지리정보시스템 (Geographic Information System : GIS)을 이용하였다. 이를 위해 연구지역에서 수행한 지표지질조사, 정밀조사, 실내암석시험 등으로부터 취득된 자료를 데이터베이스로 구축하고, 지반침하 위험지역 분포를 공간적으로 해석할 수 있는 지질, 토지이용, 경사도, 지표로부터 지하 갱도까지의 심도, 갱도의 지표상 위치로부터의 수평거리, 지하수심도, 투수계수, RMR(Rock Mass Rating) 값을 분석대상으로 선정하였다. 각 요인들이 연구지역 전체에 걸쳐 분포하도록 GIS의 공간분석 기법의 하나인 표면분석(Surface Analysis), 버퍼링기법(Buffering) 및 내삽법(Interpolation)을 이용하여 래스터 데이터베이스로 구축하고 이로부터 추출된 자료들을 입력값으로 하는 주성분분석을 수행하였다. 주성분분석 결과 폐탄광 지역의 지반침하에 영향을 주는 주요인을 추출하는 것이 가능하였으며, 연구지역은 지질 및 지반강도 관련 요인이 침하발생의 가장 큰 요인인 것으로 분석되었다.

  • PDF

A Comparative Study on Factor Recovery of Principal Component Analysis and Common Factor Analysis (주성분분석과 공통요인분석에 대한 비교연구: 요인구조 복원 관점에서)

  • Jung, Sunho;Seo, Sangyun
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.6
    • /
    • pp.933-942
    • /
    • 2013
  • Common factor analysis and principal component analysis represent two technically distinctive approaches to exploratory factor analysis. Much of the psychometric literature recommends the use of common factor analysis instead of principal component analysis. Nonetheless, factor analysts use principal component analysis more frequently because they believe that principal component analysis could yield (relatively) less accurate estimates of factor loadings compared to common factor analysis but most often produce similar pattern of factor loadings, leading to essentially the same factor interpretations. A simulation study is conducted to evaluate the relative performance of these two approaches in terms of factor pattern recovery under different experimental conditions of sample size, overdetermination, and communality.The results show that principal component analysis performs better in factor recovery with small sample sizes (below 200). It was further shown that this tendency is more prominent when there are a small number of variables per factor. The present results are of practical use for factor analysts in the field of marketing and the social sciences.

Estimation of S&T Knowledge Production Function Using Principal Component Regression Model (주성분 회귀모형을 이용한 과학기술 지식생산함수 추정)

  • Park, Su-Dong;Sung, Oong-Hyun
    • Journal of Korea Technology Innovation Society
    • /
    • v.13 no.2
    • /
    • pp.231-251
    • /
    • 2010
  • The numbers of SCI paper or patent in science and technology are expected to be related with the number of researcher and knowledge stock (R&D stock, paper stock, patent stock). The results of the regression model showed that severe multicollinearity existed and errors were made in the estimation and testing of regression coefficients. To solve the problem of multicollinearity and estimate the effect of the independent variable properly, principal component regression model were applied for three cases with S&T knowledge production. The estimated principal component regression function was transformed into original independent variables to interpret properly its effect. The analysis indicated that the principal component regression model was useful to estimate the effect of the highly correlate production factors and showed that the number of researcher, R&D stock, paper or patent stock had all positive effect on the production of paper or patent.

  • PDF

A dimensional reduction method in cluster analysis for multidimensional data: principal component analysis and factor analysis comparison (다차원 데이터의 군집분석을 위한 차원축소 방법: 주성분분석 및 요인분석 비교)

  • Hong, Jun-Ho;Oh, Min-Ji;Cho, Yong-Been;Lee, Kyung-Hee;Cho, Wan-Sup
    • The Journal of Bigdata
    • /
    • v.5 no.2
    • /
    • pp.135-143
    • /
    • 2020
  • This paper proposes a pre-processing method and a dimensional reduction method in the analysis of shopping carts where there are many correlations between variables when dividing the types of consumers in the agri-food consumer panel data. Cluster analysis is a widely used method for dividing observational objects into several clusters in multivariate data. However, cluster analysis through dimensional reduction may be more effective when several variables are related. In this paper, the food consumption data surveyed of 1,987 households was clustered using the K-means method, and 17 variables were re-selected to divide it into the clusters. Principal component analysis and factor analysis were compared as the solution for multicollinearity problems and as the way to reduce dimensions for clustering. In this study, both principal component analysis and factor analysis reduced the dataset into two dimensions. Although the principal component analysis divided the dataset into three clusters, it did not seem that the difference among the characteristics of the cluster appeared well. However, the characteristics of the clusters in the consumption pattern were well distinguished under the factor analysis method.

Principal Component Analysis on Marine Casualties Occurred at Korean Littoral Sea in Recent 5 Years (최근 5년간 국내 연근해에서 발생한 해양사고에 대한 주성분분석)

  • KIM, Yeong-Sik
    • Journal of Fisheries and Marine Sciences Education
    • /
    • v.28 no.2
    • /
    • pp.465-472
    • /
    • 2016
  • Principal Component Analysis (PCA) is useful statistical technique for finding patterns in data, and expressing the data in such a way as to highlight their similarities and differences. In this paper, 1417 marine casualties occurred in Korean littoral sea in recent 5 years, were examined by the PCA. The main results obtained were as follows : 1. Most of marine casualties resulted from the human factors such as careless operation and insufficient engine maintenance. 2. Collision and standing mainly resulted from steering room-related human factors such as careless guard, inadequate ship-handling, however engine damage and fire explosion mainly resulted from engine room-related human factor such as bad handling of engine system. 3. No. 1 principal component represents accident frequency, No. 2 principal component represents the cause and No. 3 principal component represents the pattern of marine casualties, respectively.

Study on Influencing Factors of Traffic Accidents in Urban Tunnel Using Quantification Theory (In Busan Metropolitan City) (수량화 이론을 이용한 도시부 터널 내 교통사고 영향요인에 관한 연구 - 부산광역시를 중심으로 -)

  • Lim, Chang Sik;Choi, Yang Won
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.35 no.1
    • /
    • pp.173-185
    • /
    • 2015
  • This study aims to investigate the characteristics and types of car accidents and establish a prediction model by analyzing 456 car accidents having occurred in the 11 tunnels in Busan, through statistical analysis techniques. The results of this study can be summarized as below. As a result of analyzing the characteristics of car accidents, it was found that 64.9% of all the car accidents took place in the tunnels between 08:00 and 18:00, which was higher than 45.8 to 46.1% of the car accidents in common roads. As a result of analyzing the types of car accidents, the car-to-car accident type was the majority, and the sole-car accident type in the tunnels was relatively high, compared to that in common roads. Besides, people at the age between 21 and 40 were most involved in car accidents, and in the vehicle type of the first party to car accidents, trucks showed a high proportion, and in the cloud cover, rainy days or cloudy days showed a high proportion unlike clear days. As a result of analyzing the principal components of car accident influence factors, it was found that the first principal components were road, tunnel structure and traffic flow-related factors, the second principal components lighting facility and road structure-related factors, the third principal factors stand-by and lighting facility-related factors, the fourth principal components human and time series-related factors, the fifth principal components human-related factors, the sixth principal components vehicle and traffic flow-related factors, and the seventh principal components meteorological factors. As a result of classifying car accident spots, there were 5 optimized groups classified, and as a result of analyzing each group based on Quantification Theory Type I, it was found that the first group showed low explanation power for the prediction model, while the fourth group showed a middle explanation power and the second, third and fifth groups showed high explanation power for the prediction model. Out of all the items(principal components) over 0.2(a weak correlation) in the partial correlation coefficient absolute value of the prediction model, this study analyzed variables including road environment variables. As a result, main examination items were summarized as proper traffic flow processing, cross-section composition(the width of a road), tunnel structure(the length of a tunnel), the lineal of a road, ventilation facilities and lighting facilities.

Analysis of Air Temperature Factors Related to Difference of Fruit Characteristics According to Cultivating Areas of Persimmon (Diospyros kaki Thunb.) (감 재배지 간 과실 품질 차이에 관계한 기온요인 분석)

  • Kim, Ho-Cheol;Jeon, Kyung-Soo;Kim, Tae-Choon
    • Journal of Bio-Environment Control
    • /
    • v.17 no.2
    • /
    • pp.124-131
    • /
    • 2008
  • To investigate main air temperature factors correlated to difference of fruit characteristics according to cultivating areas, fruit and air temperature characteristics of eight cultivating areas of 'Fuyu' persimmon were analyzed by principle components and multiple regression analysis. The first principal components extracted from 16 air temperature factors was annual mean temperature, mean temperature during October, annual mean minimum extreme temperature, mean temperature during growing period, and so forth. The second principal components was mean temperature during May and June and so forth. And cumulative contribution was 91.4%. The five of eight cultivating area had clearly the difference of main factors or the correlated direction among cultivating areas. In multiple regression analysis between the extracted main factors and fruit characteristics, fruit hight were highly correlated with mean temperature during growing period ($X_8$) and cumulative temperature ($X_6$), and the regression equation was $Y=150.55-5.375X_8+ 0.014X_6(r^2=0.843)$. Also this regression equation was affected by mean minimum temperature during growing period, cumulative temperature, and mean temperature during August. Fruit diameter was negatively correlated with mean temperature during growing period, flesh browning rate and Hunter a value of peel color were positively correlated with mean minimum temperature during growing period and annual minimum air temperature, respectively.

Local Linear Logistic Classification of Microarray Data Using Orthogonal Components (직교요인을 이용한 국소선형 로지스틱 마이크로어레이 자료의 판별분석)

  • Baek, Jang-Sun;Son, Young-Sook
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.3
    • /
    • pp.587-598
    • /
    • 2006
  • The number of variables exceeds the number of samples in microarray data. We propose a nonparametric local linear logistic classification procedure using orthogonal components for classifying high-dimensional microarray data. The proposed method is based on the local likelihood and can be applied to multi-class classification. We applied the local linear logistic classification method using PCA, PLS, and factor analysis components as new features to Leukemia data and colon data, and compare the performance of the proposed method with the conventional statistical classification procedures. The proposed method outperforms the conventional ones for each component, and PLS has shown best performance when it is embedded in the proposed method among the three orthogonal components.

Procedure for the Selection of Principal Components in Principal Components Regression (주성분회귀분석에서 주성분선정을 위한 새로운 방법)

  • Kim, Bu-Yong;Shin, Myung-Hee
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.5
    • /
    • pp.967-975
    • /
    • 2010
  • Since the least squares estimation is not appropriate when multicollinearity exists among the regressors of the linear regression model, the principal components regression is used to deal with the multicollinearity problem. This article suggests a new procedure for the selection of suitable principal components. The procedure is based on the condition index instead of the eigenvalue. The principal components corresponding to the indices are removed from the model if any condition indices are larger than the upper limit of the cutoff value. On the other hand, the corresponding principal components are included if any condition indices are smaller than the lower limit. The forward inclusion method is employed to select proper principal components if any condition indices are between the upper limit and the lower limit. The limits are obtained from the linear model which is constructed on the basis of the conjoint analysis. The procedure is evaluated by Monte Carlo simulation in terms of the mean square error of estimator. The simulation results indicate that the proposed procedure is superior to the existing methods.