• 제목/요약/키워드: Dimension Reduction Technique

검색결과 60건 처리시간 0.177초

A Classification Method Using Data Reduction

  • Uhm, Daiho;Jun, Sung-Hae;Lee, Seung-Joo
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제12권1호
    • /
    • pp.1-5
    • /
    • 2012
  • Data reduction has been used widely in data mining for convenient analysis. Principal component analysis (PCA) and factor analysis (FA) methods are popular techniques. The PCA and FA reduce the number of variables to avoid the curse of dimensionality. The curse of dimensionality is to increase the computing time exponentially in proportion to the number of variables. So, many methods have been published for dimension reduction. Also, data augmentation is another approach to analyze data efficiently. Support vector machine (SVM) algorithm is a representative technique for dimension augmentation. The SVM maps original data to a feature space with high dimension to get the optimal decision plane. Both data reduction and augmentation have been used to solve diverse problems in data analysis. In this paper, we compare the strengths and weaknesses of dimension reduction and augmentation for classification and propose a classification method using data reduction for classification. We will carry out experiments for comparative studies to verify the performance of this research.

A Note on Bootstrapping in Sufficient Dimension Reduction

  • Yoo, Jae Keun;Jeong, Sun
    • Communications for Statistical Applications and Methods
    • /
    • 제22권3호
    • /
    • pp.285-294
    • /
    • 2015
  • A permutation test is the popular and attractive alternative to derive asymptotic distributions of dimension test statistics in sufficient dimension reduction methodologies; however, recent studies show that a bootstrapping technique also can be used. We consider two types of bootstrapping dimension determination, which are partial and whole bootstrapping procedures. Numerical studies compare the permutation test and the two bootstrapping procedures; subsequently, real data application is presented. Considering two additional bootstrapping procedures to the existing permutation test, one has more supporting evidence for the dimension estimation of the central subspace that allow it to be determined more convincingly.

Progression-Preserving Dimension Reduction for High-Dimensional Sensor Data Visualization

  • Yoon, Hyunjin;Shahabi, Cyrus;Winstein, Carolee J.;Jang, Jong-Hyun
    • ETRI Journal
    • /
    • 제35권5호
    • /
    • pp.911-914
    • /
    • 2013
  • This letter presents Progression-Preserving Projection, a dimension reduction technique that finds a linear projection that maps a high-dimensional sensor dataset into a two- or three-dimensional subspace with a particularly useful property for visual exploration. As a demonstration of its effectiveness as a visual exploration and diagnostic means, we empirically evaluate the proposed technique over a dataset acquired from our own virtual-reality-enhanced ball-intercepting training system designed to promote the upper extremity movement skills of individuals recovering from stroke-related hemiparesis.

부분-수량화를 통한 시계열 자료 분석에서의 차원축소 (Dimension Reduction in Time Series via Partially Quanti ed Principal Componen)

  • 박진아;황선영
    • 응용통계연구
    • /
    • 제23권5호
    • /
    • pp.813-822
    • /
    • 2010
  • 차원 축소(dimension reduction) 기법은 주로 횡단면 자료 분석에서 널리 이용되어 왔으며 시계열 분석 분야에서의 적용은 상대적으로 미진한 실정이다. 본 논문에서는 부분-수량화를 통한 주성분분석 방법을 계절형 시계열에 적용시켜 시계열 자료의 차원 축소를 시도하고자 한다. 분석 방법론을 단계별로 제시하였으며 월별 실업률 자료 분석을 통해 설명하였다.

스프링 설계문제의 신뢰도 해석을 위한 크리깅 기반 차원감소법의 활용 (Kriging Dimension Reduction Method for Reliability Analysis in Spring Design)

  • 강진혁;안다운;원준호;최주호
    • 한국전산구조공학회:학술대회논문집
    • /
    • 한국전산구조공학회 2008년도 정기 학술대회
    • /
    • pp.422-427
    • /
    • 2008
  • This study is to illustrate the usefulness of Kriging Dimension Reduction Method(KDRM), which is to construct probability distribution of response function in the presence of the physical uncertainty of input variables. DRM has recently received increased attention due to its sensitivity-free nature and efficiency that considerable accuracy is obtained with only a few number of analyses. However, the DRM has a number of drawbacks such as instability and inaccuracy for functions with increased nonlinearity. As a remedy, Kriging interpolation technique is incorporated which is known as more accurate for nonlinear functions. The KDRM is applied and compared with MCS methods in a compression coil spring design problem. The effectiveness and accuracy of this method is verified.

  • PDF

가변적인 샘플링을 이용한 차원 감소법에 의한 신뢰도 해석 기법 (Reliability Analysis Using Dimension Reduction Method with Variable Sampling Points)

  • 육순민;민준홍;김동호;최동훈
    • 대한기계학회논문집A
    • /
    • 제33권9호
    • /
    • pp.870-877
    • /
    • 2009
  • This study provides how the Dimension Reduction (DR) method as an efficient technique for reliability analysis can acquire its increased efficiency when it is applied to highly nonlinear problems. In the highly nonlinear engineering systems, 4N+1 (N: number of random variables) sampling is generally recognized to be appropriate. However, there exists uncertainty concerning the standard for judgment of non-linearity of the system as well as possibility of diverse degrees of non-linearity according to each of the random variables. In this regard, this study judged the linearity individually on each random variable after 2N+1 sampling. If high non-linearity appeared, 2 additional sampling was administered on each random variable to apply the DR method. The applications of the proposed sampling to the examples produced the constant results with increased efficiency.

Face recognition by PLS

  • Baek, Jang-Sun
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2003년도 추계 학술발표회 논문집
    • /
    • pp.69-72
    • /
    • 2003
  • The paper considers partial least squares (PLS) as a new dimension reduction technique for the feature vector to overcome the small sample size problem in face recognition. Principal component analysis (PCA), a conventional dimension reduction method, selects the components with maximum variability, irrespective of the class information. So PCA does not necessarily extract features that are important for the discrimination of classes. PLS, on the other hand, constructs the components so that the correlation between the class variable and themselves is maximized. Therefore PLS components are more predictive than PCA components in classification. The experimental results on Manchester and ORL databases show that PLS is to be preferred over PCA when classification is the goal and dimension reduction is needed.

  • PDF

Comparison of Methods for Reducing the Dimension of Compositional Data with Zero Values

  • Song, Taeg-Youn;Choi, Byung-Jin
    • Communications for Statistical Applications and Methods
    • /
    • 제19권4호
    • /
    • pp.559-569
    • /
    • 2012
  • Compositional data consist of compositions that are non-negative vectors of proportions with the unit-sum constraint. In disciplines such as petrology and archaeometry, it is fundamental to statistically analyze this type of data. Aitchison (1983) introduced a log-contrast principal component analysis that involves logratio transformed data, as a dimension-reduction technique to understand and interpret the structure of compositional data. However, the analysis is not usable when zero values are present in the data. In this paper, we introduce 4 possible methods to reduce the dimension of compositional data with zero values. Two real data sets are analyzed using the methods and the obtained results are compared.

차원축소를 통한 다변량 시계열의 변동성 분석 및 응용 (Volatility Analysis for Multivariate Time Series via Dimension Reduction)

  • 송유진;최문선;황선영
    • Communications for Statistical Applications and Methods
    • /
    • 제15권6호
    • /
    • pp.825-835
    • /
    • 2008
  • 계량경제학 분야에서 널리 쓰이는 MGARCH(multivariate GARCH)모형은 여러개의 시계열자료들의 변동성을 함께 모형화한다. 그러나 변수가 많아질수록 추정해야 할 모수의 수가 급격하게 늘어나는 문제점이 있다. 본 연구에서는 인자 모형을 통해 자료의 차원을 축소시킴로써 이러한 문제를 해결하고자 하였다. 국내의 주가수익률 자료에 통계적 인자 모형과 fundamental factor model을 적용하여 각각의 의미 있는 인자들을 얻은 후 이를 MGARCH모형에 적합시켰다. 또한 두 인자모형을 바탕으로 얻어진 최종 모형들의 MSE, MAD와 VaR(Value at Risk)를 계산하여 예측력을 비교하고자 한다.

위성 영상데이터의 주성분변환 및 주성분 기반 영상분류 (Principal Component Transformation of the Satellite Image Data and Principal-Components-Based Image Classification)

  • 서용수
    • 한국지리정보학회지
    • /
    • 제7권4호
    • /
    • pp.24-33
    • /
    • 2004
  • 원격탐사(remote sensing) 기술의 비약적인 발전과 함께 위성 영상데이터의 분광대역수가 급속히 증가하고 있다. 대역수의 증가로 영상데이터량이 급격히 증가하게 되고, 이에 따라 이들 데이터를 처리하기 위해서는 처리속도가 빠른 영상처리 기술이 필요하게 되었다. 분광 대역 수를 줄여 빠르게 처리하는 한가지 방법으로 널리 사용되고 있는 것이 주성분 변환법이다. 본 논문에서는 주성분 변환법에 대한 처리과정에 대해 논하였으며, 위성 영상데이터를 주성분 변환한 결과인 주성분 영상데이터를 분석하였다. 분석결과 실험 영상데이터의 경우, 3개의 주성분($PC_1$, $PC_2$, $PC_3$)의 누적 백분율 분산 값이 99.1%로 이는 3개의 주성분이 거의 대부분의 정보를 가지고 있음을 알 수 있었다. 3개의 주성분 영상데이터만을 사용한다면 데이터 저장을 위한 메모리 용량이나 데이터 전송시간 및 처리시간을 크게 감소시킬 수 있다. 또한 본 논문에서는 주성분 영상데이터를 최대유사분류법과 신경회로망을 이용한 다층 퍼셉트론 분류법으로 분류하고 결과를 평가한 후, 주성분 변환법이 갖는 차원축소 효과를 분석하였다. 분석결과 주성분 3개를 사용한 분류결과와 주성분 6개를 사용한 분류결과간의 분류정답률이 크게 차이가 나지 않았다. 이는 분류에 사용하는 영상데이터 수를 6개 차원에서 3개 차원으로 줄여도 비슷한 분류성능을 얻을 수 있음을 확인할 수 있었다.

  • PDF