• 제목/요약/키워드: Discriminant analysis

검색결과 1,577건 처리시간 0.025초

Some Diagnostic Results in Discriminant Analysis

  • Bae, Whasoo;Hwang, Soonyoung
    • Journal of the Korean Statistical Society
    • /
    • 제30권1호
    • /
    • pp.139-151
    • /
    • 2001
  • Although lots of works are done in influence diagnostics, results in the multivariate analysis are quite rare. One of recent works done by Fung(1995) is about the single case influence diagnostics in the linear discriminant analysis. In this paper we extend Fung's results to the multiple cases diagnostics which are necessary in the linear discriminant analysis for two reasons among others; First, the masking effect cannot be detected by single case diagnostics and secondly two populations are concerned in the discriminant analysis, i.e., influential cases can occur in one or both populations.

  • PDF

CART의 예측 성능:은행 및 보험 회사 데이터 사용 (The Prediction Performance of the CART Using Bank and Insurance Company Data)

  • 박정선
    • 한국정보처리학회논문지
    • /
    • 제3권6호
    • /
    • pp.1468-1472
    • /
    • 1996
  • 본 연구에서는 CART(Classification and Regression Tree)가 예측을 함에 있어 통계적인 기법인 discriminant analysis와 비교된다. 은행 데이터를 사용하는 경우 discriminant analysis가 더 나은 성능을 보여줬으며, 보험 회사 데이터를 사용한 경 우 CART가 더 나은 성능을 보여줬다. 이러한 모순된 결과가 데이터의 성격을 분석함 으로 해석된다. 본 연구에서는 두가지 모델 모두 사용된 매개변수들인 사전 확률, 데 이터, 타입 I/II오류 코스트, 검증 방법에 의해 성능의 차이를 보여줬다.

  • PDF

관능특성 및 판별함수를 이용한 한우고기 맛 등급 분석 (Palatability Grading Analysis of Hanwoo Beef using Sensory Properties and Discriminant Analysis)

  • 조수현;서그러운달님;김동훈;김재희
    • 한국축산식품학회지
    • /
    • 제29권1호
    • /
    • pp.132-139
    • /
    • 2009
  • 본 연구에서는 1,300명의 소비자들이 직접 먹어보고 평가한 한우고기 데이터를 이용하여 쇠고기 맛 등급을 구분 해 내기 위한 판별분석 방법들을 비교하였다. 한우 관능평가의 주요 세 변수인 연도, 다즙성, 향미를 포함한 정준 판별분석과 대표적인 맛 변수로 여겨지는 전반적인 기호도 만을 이용하여 선형판별분석과 비모수 판별분석을 하였다. 전반적인 기호도와 같은 한 개의 변수만을 사용할 경우 두 가지 모두 비슷한 분류율을 나타내지만 선형판별 함수는 이해와 사용 측면에서 장점이 있었던 반면에 비모수적 방법은 커널함수와 띠폭에 대한 선택이 불편하지만 잘 선택하면 정확한 분류율을 높일 수 있는 장점이 있었다. 그러나 다른 정보를 가진 변수들이 있음에도 불구하고 한 개의 변수만을 이용한 판별 분석은 판별에 영향을 미치는 다른 중요한 변수들의 정보를 활용하지 못한다는 문제점이 있다. 한편, 정준판별분석의 경우 정준판별함수의 오분류율이 일변량 선형 판별함수와 비모수 판별함수의 오분류율에 비해 크게 떨어지지 않으면서 분포에 대한 특별한 가정이 필요하지 않아 통계적 가정이 까다롭지 않고 또한 맛에 중요한 요인인 연도, 다즙성, 향미의 세 개변수를 모두 사용하므로 맛 정보를 최대로 활용한다는 장점이 있었다. 따라서 본 연구결과 연도, 다즙성, 향미의 세가지 변수 정보를 모두 포함한 다변량 정준판별분석법을 이용하는 것이 맛 등급을 구분하는데 가장 적절할 것으로 판단되었다.

Principal Discriminant Variate (PDV) Method for Classification of Multicollinear Data: Application to Diagnosis of Mastitic Cows Using Near-Infrared Spectra of Plasma Samples

  • Jiang, Jian-Hui;Tsenkova, Roumiana;Yu, Ru-Qin;Ozaki, Yukihiro
    • 한국근적외분광분석학회:학술대회논문집
    • /
    • 한국근적외분광분석학회 2001년도 NIR-2001
    • /
    • pp.1244-1244
    • /
    • 2001
  • In linear discriminant analysis there are two important properties concerning the effectiveness of discriminant function modeling. The first is the separability of the discriminant function for different classes. The separability reaches its optimum by maximizing the ratio of between-class to within-class variance. The second is the stability of the discriminant function against noises present in the measurement variables. One can optimize the stability by exploring the discriminant variates in a principal variation subspace, i. e., the directions that account for a majority of the total variation of the data. An unstable discriminant function will exhibit inflated variance in the prediction of future unclassified objects, exposed to a significantly increased risk of erroneous prediction. Therefore, an ideal discriminant function should not only separate different classes with a minimum misclassification rate for the training set, but also possess a good stability such that the prediction variance for unclassified objects can be as small as possible. In other words, an optimal classifier should find a balance between the separability and the stability. This is of special significance for multivariate spectroscopy-based classification where multicollinearity always leads to discriminant directions located in low-spread subspaces. A new regularized discriminant analysis technique, the principal discriminant variate (PDV) method, has been developed for handling effectively multicollinear data commonly encountered in multivariate spectroscopy-based classification. The motivation behind this method is to seek a sequence of discriminant directions that not only optimize the separability between different classes, but also account for a maximized variation present in the data. Three different formulations for the PDV methods are suggested, and an effective computing procedure is proposed for a PDV method. Near-infrared (NIR) spectra of blood plasma samples from mastitic and healthy cows have been used to evaluate the behavior of the PDV method in comparison with principal component analysis (PCA), discriminant partial least squares (DPLS), soft independent modeling of class analogies (SIMCA) and Fisher linear discriminant analysis (FLDA). Results obtained demonstrate that the PDV method exhibits improved stability in prediction without significant loss of separability. The NIR spectra of blood plasma samples from mastitic and healthy cows are clearly discriminated between by the PDV method. Moreover, the proposed method provides superior performance to PCA, DPLS, SIMCA and FLDA, indicating that PDV is a promising tool in discriminant analysis of spectra-characterized samples with only small compositional difference, thereby providing a useful means for spectroscopy-based clinic applications.

  • PDF

PRINCIPAL DISCRIMINANT VARIATE (PDV) METHOD FOR CLASSIFICATION OF MULTICOLLINEAR DATA WITH APPLICATION TO NEAR-INFRARED SPECTRA OF COW PLASMA SAMPLES

  • Jiang, Jian-Hui;Yuqing Wu;Yu, Ru-Qin;Yukihiro Ozaki
    • 한국근적외분광분석학회:학술대회논문집
    • /
    • 한국근적외분광분석학회 2001년도 NIR-2001
    • /
    • pp.1042-1042
    • /
    • 2001
  • In linear discriminant analysis there are two important properties concerning the effectiveness of discriminant function modeling. The first is the separability of the discriminant function for different classes. The separability reaches its optimum by maximizing the ratio of between-class to within-class variance. The second is the stability of the discriminant function against noises present in the measurement variables. One can optimize the stability by exploring the discriminant variates in a principal variation subspace, i. e., the directions that account for a majority of the total variation of the data. An unstable discriminant function will exhibit inflated variance in the prediction of future unclassified objects, exposed to a significantly increased risk of erroneous prediction. Therefore, an ideal discriminant function should not only separate different classes with a minimum misclassification rate for the training set, but also possess a good stability such that the prediction variance for unclassified objects can be as small as possible. In other words, an optimal classifier should find a balance between the separability and the stability. This is of special significance for multivariate spectroscopy-based classification where multicollinearity always leads to discriminant directions located in low-spread subspaces. A new regularized discriminant analysis technique, the principal discriminant variate (PDV) method, has been developed for handling effectively multicollinear data commonly encountered in multivariate spectroscopy-based classification. The motivation behind this method is to seek a sequence of discriminant directions that not only optimize the separability between different classes, but also account for a maximized variation present in the data. Three different formulations for the PDV methods are suggested, and an effective computing procedure is proposed for a PDV method. Near-infrared (NIR) spectra of blood plasma samples from daily monitoring of two Japanese cows have been used to evaluate the behavior of the PDV method in comparison with principal component analysis (PCA), discriminant partial least squares (DPLS), soft independent modeling of class analogies (SIMCA) and Fisher linear discriminant analysis (FLDA). Results obtained demonstrate that the PDV method exhibits improved stability in prediction without significant loss of separability. The NIR spectra of blood plasma samples from two cows are clearly discriminated between by the PDV method. Moreover, the proposed method provides superior performance to PCA, DPLS, SIMCA md FLDA, indicating that PDV is a promising tool in discriminant analysis of spectra-characterized samples with only small compositional difference.

  • PDF

판별분석을 통한 패밀리레스토랑의 고객 분류와 마케팅전략에 관한 연구 (A Multiple Discriminant Approach to Identifying Frequent Users of Eating out at Family Restaurant)

  • 강종헌
    • 한국식품조리과학회지
    • /
    • 제18권1호
    • /
    • pp.109-118
    • /
    • 2002
  • The purpose of this study was to identify the behavioral, attitudinal, and demographic correlates of light, medium, and heavy users of eating out at family restaurants. Among 358 reponses from the subjects, 224 responses were utilized for the analysis, and 134 responses were reserved for validating the discriminant function. Descriptive statistics, reliability analysis, stepwise discriminant analysis, canonical discriminant analysis, and anova analysis were used for this study. The findings from this study were as follows: First, He behavioral characteristics were found to discriminate among the three usage groups. Second, it was found that heavy users expressed greater difference between perception and expectation on the quantity of food that are appropriately served and the consistent quality of food at every visit. Third, the usage rate of eating out was not dependent on the sex, but dependent on the companion, average expenditure, and the time of eating out in chi-square test. Finally, the results of the study provide some insight into the pattern of marketing strategies that can be successfully used by the managers of family restaurants.

일반화된 판별분석 기법을 이용한 능동소나 표적 식별 (Sonar Target Classification using Generalized Discriminant Analysis)

  • 김동욱;김태환;석종원;배건성
    • 한국정보통신학회논문지
    • /
    • 제22권1호
    • /
    • pp.125-130
    • /
    • 2018
  • 선형판별분석(LDA) 기법은 특징벡터의 차원을 줄이거나 클래스 식별에 이용되는 통계적 분석 방법이다. 그러나 선형 분리가 불가능한 데이터 집합의 경우에는 비선형 함수를 이용하여 특징벡터를 고차원의 공간으로 사상(mapping) 시켜줌으로써 선형 분리가 가능하도록 만들 수 있는데, 이러한 기법을 일반화된 판별분석(GDA) 또는 커널판별분석(KDA) 기법이라고 한다. 본 연구에서는 인터넷에 공개되어 있는 능동소나 표적신호에 LDA 및 GDA 기법을 이용하여 표적식별 실험을 수행하고, 그 결과를 비교/분석하였다. 실험 결과 104개의 테스트 데이터에 대해 LDA 기법으로는 73.08% 인식률을 얻었으나 GDA 기법으로는 95.19%로 기존의 MLP 또는 커널 기반 SVM에 비해 나은 성능을 보였다.

기술금융을 위한 부실 가능성 예측 최적 판별모형에 대한 연구 (A Study on the Optimal Discriminant Model Predicting the likelihood of Insolvency for Technology Financing)

  • 성웅현
    • 기술혁신학회지
    • /
    • 제10권2호
    • /
    • pp.183-205
    • /
    • 2007
  • 본 연구는 기술력평가에 근거해서 중소기업 부실예측 가능성을 사전에 예측할 수 있는 최적 판별 모형을 개발 제안하였다. 판별모형에 포함될 설명변수는 요인분석과 판별모형의 단계별 선택방법에 의하여 선정되었다. 분석결과 선형판별모형이 로지스틱판별모형보다 임계확률 관점에서 적절한 것으로 나타났다. 최적 선형판별모형의 분류 정분류율은 70.4%, 분류 예측력은 67.5%로 나타났다. 최적 선형판별모형의 활용도를 높이기 위해서 확실 범주와 유보범주를 구분할 수 있는 경계값을 설정하였다. 분석결과를 활용하면 기술금융 취급기관은 부실위험 평가와 더불어 기술금융 신청기업의 순위를 부여할 때 유용하게 사용할 수 있을 것으로 기대된다.

  • PDF

판별분석을 이용한 토지이용별 토양 특성 변화 연구

  • 고경석;김재곤;이진수;김탁현;이규호;조춘희;오인숙;정영욱
    • 한국지하수토양환경학회:학술대회논문집
    • /
    • 한국지하수토양환경학회 2005년도 총회 및 춘계학술발표회
    • /
    • pp.237-241
    • /
    • 2005
  • The physical and chemical characteristics of soils in a small watershed were investigated and the effect of geology and land use on soil quality were examined by using multivariate statistical methods, principal components analysis and discriminant analysis. It was considered that the accumulation of salts in the farmland soils indicated by electrical conductivity, contents of cations and anions and pH was caused by fertilizer input during cultivation. The contents of inorganic components are increased as following order: upland > orchard > paddy field > forest. The results of two discriminant analyses using water extractable inorganic components and their ratios by land use were also clearly classified by discriminant function 1 and 2. In discriminant analysis by components, discriminant function 1 indicated the effect of fertilizer application and increased as following order: upland > orchard > paddy field > forest soil.

  • PDF