• Title/Summary/Keyword: Principal Component Factor Analysis

Search Result 347, Processing Time 0.032 seconds

Factor Analysis for Exploratory Research in the Distribution Science Field (유통과학분야에서 탐색적 연구를 위한 요인분석)

  • Yim, Myung-Seong
    • Journal of Distribution Science
    • /
    • v.13 no.9
    • /
    • pp.103-112
    • /
    • 2015
  • Purpose - This paper aims to provide a step-by-step approach to factor analytic procedures, such as principal component analysis (PCA) and exploratory factor analysis (EFA), and to offer a guideline for factor analysis. Authors have argued that the results of PCA and EFA are substantially similar. Additionally, they assert that PCA is a more appropriate technique for factor analysis because PCA produces easily interpreted results that are likely to be the basis of better decisions. For these reasons, many researchers have used PCA as a technique instead of EFA. However, these techniques are clearly different. PCA should be used for data reduction. On the other hand, EFA has been tailored to identify any underlying factor structure, a set of measured variables that cause the manifest variables to covary. Thus, it is needed for a guideline and for procedures to use in factor analysis. To date, however, these two techniques have been indiscriminately misused. Research design, data, and methodology - This research conducted a literature review. For this, we summarized the meaningful and consistent arguments and drew up guidelines and suggested procedures for rigorous EFA. Results - PCA can be used instead of common factor analysis when all measured variables have high communality. However, common factor analysis is recommended for EFA. First, researchers should evaluate the sample size and check for sampling adequacy before conducting factor analysis. If these conditions are not satisfied, then the next steps cannot be followed. Sample size must be at least 100 with communality above 0.5 and a minimum subject to item ratio of at least 5:1, with a minimum of five items in EFA. Next, Bartlett's sphericity test and the Kaiser-Mayer-Olkin (KMO) measure should be assessed for sampling adequacy. The chi-square value for Bartlett's test should be significant. In addition, a KMO of more than 0.8 is recommended. The next step is to conduct a factor analysis. The analysis is composed of three stages. The first stage determines a rotation technique. Generally, ML or PAF will suggest to researchers the best results. Selection of one of the two techniques heavily hinges on data normality. ML requires normally distributed data; on the other hand, PAF does not. The second step is associated with determining the number of factors to retain in the EFA. The best way to determine the number of factors to retain is to apply three methods including eigenvalues greater than 1.0, the scree plot test, and the variance extracted. The last step is to select one of two rotation methods: orthogonal or oblique. If the research suggests some variables that are correlated to each other, then the oblique method should be selected for factor rotation because the method assumes all factors are correlated in the research. If not, the orthogonal method is possible for factor rotation. Conclusions - Recommendations are offered for the best factor analytic practice for empirical research.

A Classification Method Using Data Reduction

  • Uhm, Daiho;Jun, Sung-Hae;Lee, Seung-Joo
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.12 no.1
    • /
    • pp.1-5
    • /
    • 2012
  • Data reduction has been used widely in data mining for convenient analysis. Principal component analysis (PCA) and factor analysis (FA) methods are popular techniques. The PCA and FA reduce the number of variables to avoid the curse of dimensionality. The curse of dimensionality is to increase the computing time exponentially in proportion to the number of variables. So, many methods have been published for dimension reduction. Also, data augmentation is another approach to analyze data efficiently. Support vector machine (SVM) algorithm is a representative technique for dimension augmentation. The SVM maps original data to a feature space with high dimension to get the optimal decision plane. Both data reduction and augmentation have been used to solve diverse problems in data analysis. In this paper, we compare the strengths and weaknesses of dimension reduction and augmentation for classification and propose a classification method using data reduction for classification. We will carry out experiments for comparative studies to verify the performance of this research.

Novel assessment method of heavy metal pollution in surface water: A case study of Yangping River in Lingbao City, China

  • Liu, Yingran;Yu, Hongming;Sun, Yu;Chen, Juan
    • Environmental Engineering Research
    • /
    • v.22 no.1
    • /
    • pp.31-39
    • /
    • 2017
  • The primary purpose of this research is to understand those elements that define heavy metals contamination and to propose a novel assessment method based on principal component analysis (PCA) in the Yangping River region of Lingbao City, China. This paper makes detailed calculations regarding such factors the single-factor assessment ($P_i$) and Nemerow's multi-factor index ($P_N$) of heavy metals found in the surface water of the Yangping River. The maximum values of $P_i$ (Cd) and $P_i$ (Pb) were determined to be 892.000 and 113.800 respectively. The maximum value of $P_N$ was calculated to be 639.836. The results of Pearson's correlation analysis, hierarchical cluster analysis, and PCA indicated heavy metal groupings as follows: Cu, Pb, Zn and As, Hg, Cd. The PCA-based pollution index ($P_{an}$) of samplings was subsequently calculated. The relative coefficient square was valued at 0.996 between $P_{an}$ and $P_N$, which indicated that $P_{an}$ is able to serve as a new heavy metal pollution index; not only this index able to eliminate the influence of the maximum value of $P_i$, but further, this index contains the principal component elements needed to evaluate heavy metal pollution levels.

The Application of SVD for Feature Extraction (특징추출을 위한 특이값 분할법의 응용)

  • Lee Hyun-Seung
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.43 no.2 s.308
    • /
    • pp.82-86
    • /
    • 2006
  • The design of a pattern recognition system generally involves the three aspects: preprocessing, feature extraction, and decision making. Among them, a feature extraction method determines an appropriate subspace of dimensionality in the original feature space of dimensionality so that it can reduce the complexity of the system and help to improve successful recognition rates. Linear transforms, such as principal component analysis, factor analysis, and linear discriminant analysis have been widely used in pattern recognition for feature extraction. This paper shows that singular value decomposition (SVD) can be applied usefully in feature extraction stage of pattern recognition. As an application, a remote sensing problem is applied to verify the usefulness of SVD. The experimental result indicates that the feature extraction using SVD can improve the recognition rate about 25% compared with that of PCA.

A Study on the Classification of Islands by PCA ( I ) (PCA에 의한 도서분류에 관한 연구( I ))

  • 이강우
    • The Journal of Fisheries Business Administration
    • /
    • v.14 no.2
    • /
    • pp.1-14
    • /
    • 1983
  • This paper considers a classification of the 88 islands located at Kyong-nam area in Korea, using by examples of 12 components of the islands. By means of principal component analysis 2 principle components were extracted, which explained a total of 73.7% of the variance. Using an eigen variable criterion (λ>1), no further principle components were discussed. Principal component 1 and 2 explained 63.4% and 10.3% of the total variance respectively, The representation of the unrelated factor scores along the first and second principal axes produced a new information with respect to the classification of the islands. Based upon the representation, 88 islands were classified into 6 groups i. e. A, B, C, D, E, and F according to similarity of the components among them in this paper. The "Group F" belongs to a miscellaneous assortment that does not fit into the logical category. category.

  • PDF

The Factor Clustering of Growing Stock Changes by Forest Policy using Principal Component Analysis (주성분 분석을 이용한 산림정책별 입목축적변화의 요인 군집)

  • Shin, Hye-Jin;Kim, Eui-Gyeong;Kim, Dong-Hyeon;Kim, Hyeon-Guen
    • Journal of agriculture & life science
    • /
    • v.46 no.2
    • /
    • pp.1-8
    • /
    • 2012
  • This study is a precedent study for deriving transfer function model between growing stock and forest management policies. Its goal is to solve the multicollinearity between forest works inducing growing stock changes through principal component analysis using annual time series data from 1997 to 2008. As the results, the total explanatory power showed 91.4% on the summarized 3 principal components. They were renamed 'good forest management' 'pest & insets management' 'forest fires' for conceptualization on the derived each component.

Analysis of the Impact of Trade Facilitation on China's Trade - Focused on APEC countries - (무역원활화가 중국 수출입에 미치는 영향 분석 - APEC 국가 중심으로 -)

  • Xuan Zhou;Chang-Hwan Choi
    • Korea Trade Review
    • /
    • v.47 no.4
    • /
    • pp.1-14
    • /
    • 2022
  • This study examines the impact of trade facilitation on China's trade for the period 2010-2017 using a gravity model with a measurement of APEC trade facilitation through principal component analysis. The empirical results confirmed that trade facilitation was a key factor to have a positive effect on Chinese exports and that the higher the level of trade facilitation in APEC countries, the more positive the increase in exports and quantities with China. Further, the size of the economy, the total population, and the border between the trading partner had a positive effect on Chinese trade volume. To promote economic growth through increase in trade volume, countries should actively improve trade facilitation and participate in global trade facilitation reform through continuous cooperation with trading partners.

Treatability Evaluation of $A_{2}O$ System by Principal Component Analysis (주성분분석에 의한 $A_{2}O$공법의 처리성 평가)

  • 김복현;이재형;이수환;윤조희
    • Journal of Environmental Health Sciences
    • /
    • v.18 no.2
    • /
    • pp.67-74
    • /
    • 1992
  • The lab-scale biological A$_{2}$O system was applied from treating piggery wastewater highly polluted organic material which nitrogen and phosphorous are much contained relatively in conversion with other wastewater. The objective of this study was to investigate the effect of variance parameters on the treatability of this system according to operation conditions. An obtained experimental data were analysed by using principal component analysis (PCA) method. The results are summarized as follows: 1. From Varimax rotated factor loading in raw wastewater, variance of factor 1 was 36.8% and cumulative percentage of variance from factor 1 to factor 4 was 81.5% and of these was related to BOD, TKN and BOD loading. 2. In anaerobic process, variance of factor 1 was 33.5% and cumulative percentage of variance from factor I to factor 4 was 81.8% and of these was related to PO$_{4}$-P, BOD, DO and Temperature. 3. In anoxic process, variance of factor 1 was 30.1% and cumulative percentage of variance from factor i to factor 4 was 84.3% and of these was related to pH, DO, TKN and temperature. 4. In aerobic process, variance of factor 1 was 43.8% and cumulative percentage of variance from factor 1 to factor 4 was 81.5% and of these was highly related to DO, PO$_{4}$-P and BOD. 5. It was better to be operated below 0.30 kg/kg$\cdot$day F/M ratio to keep over 90% of BOD and SS, 80% of TKN, and 60% of PO$_{4}$-P in treatment efficiencies. 6. Treatment efficiencies was over 93% of BOD and SS, 81% of TKN and 60% of PO$_{4}$-P at over 20$^{\circ}$C, respectively.

  • PDF

A Preliminary Study for Development of a Pain Questionnaire (통증 평가도구 개발을 위한 기초조사)

  • Yi Chung-hwi
    • The Journal of Korean Physical Therapy
    • /
    • v.1 no.1
    • /
    • pp.63-72
    • /
    • 1989
  • The present study was designed to investigate the general characteristics of pain patients and to analyze the properties of Korean pain expression terms as a preliminary step in the development of a pain questionnaire. Questionnaires were administered to 73 adult patients (53 males, 20 females) with knee, ankle, neck, low back, and shoulder pain. The mean duration of pain was 16.2 months (SE=3.3). The results were as fellows : 1. The data show that there are over 30 words in the Korean language to describe the varieties of pain experience even within this small sample. 2, There was low significant relationship between present pain intensity using visual analogue scale and the selected numbers of pain words from the pain questionnaire (p<.01). 3. In order to separate basic factors, a principal component analysis with varimax rotation was performed. The principal component analysis produced 8 factors. The proportion of variance explained by these factors was $71.0\%$. The first factor accounting $26.8\%$ of the variance was labeled 'cruelty and fear related pain' ; second 'pain produced from deep tissue' : third 'skin-punctuating related pain' ; and fourth 'miscellaneous and complicated pain'. Results of this study might be utilitzed in developing a pain questionnaire for pain patients.

  • PDF

Effects of Environmental Factors on Aeromonas spp. Population in Naktong Estuary (낙동강 하구 생태계의 환경요인과 Aeromonas spp. 분포와의 관계)

  • 전도용;권오섭;하영칠
    • Korean Journal of Microbiology
    • /
    • v.27 no.4
    • /
    • pp.391-397
    • /
    • 1989
  • Population of Aeromonas and environmental parameters were investigated at three sites from August 1986, to December, 1986 in Naktong Estuary. The variation range of Aeromonas was $4.3\times10^{2}-4.6\times 10^{4}$ MPN/100ml. The result of ANOVA indicates significant differences among the populations of Aeromonas in each site. The highest population of Aeromonas occurred at site 2, and the lowest at site 3-B. To scrutinize the effects of environmental parameters on the distribution of Aeromonas spp, principal component analysis and multiple stepwise regression were used. The results showed that distribution of Aeromonas spp. was mainly influenced by outflow of freshwater and inflow of inorganic nutrients and correlated with heterotrophic bacteria, available nitrogen, fecal coliform bacteria, and temperature.

  • PDF