Fuzzy Clustering Model using Principal Components Analysis and Naive Bayesian Classifier

Jun, Sung-Hae;

doi:10.3745/KIPSTB.2004.11B.4.485

The KIPS Transactions:PartB (정보처리학회논문지B)

Volume 11B Issue 4
/
Pages.485-490
/
2004
/
1598-284X(pISSN)

Korea Information Processing Society (한국정보처리학회)

DOI QR Code

Fuzzy Clustering Model using Principal Components Analysis and Naive Bayesian Classifier

주성분 분석과 나이브 베이지안 분류기를 이용한 퍼지 군집화 모형

Jun, Sung-Hae

전성해 (청주대학교 통계학과)

Published : 2004.08.01

https://doi.org/10.3745/KIPSTB.2004.11B.4.485 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In data representation, the clustering performs a grouping process which combines given data into some similar clusters. The various similarity measures have been used in many researches. But, the validity of clustering results is subjective and ambiguous, because of difficulty and shortage about objective criterion of clustering. The fuzzy clustering provides a good method for subjective clustering problems. It performs clustering through the similarity matrix which has fuzzy membership value for assigning each object. In this paper, for objective fuzzy clustering, the clustering algorithm which joins principal components analysis as a dimension reduction model with bayesian learning as a statistical learning theory. For performance evaluation of proposed algorithm, Iris and Glass identification data from UCI Machine Learning repository are used. The experimental results shows a happy outcome of proposed model.

자조의 표현에서 군집화는 주어진 데이터를 서로 유사한 개체들끼리 몇 개의 집단으로 묶는 작업을 수행한다. 군집화의 유사도 결정 측도는 맡은 연구들에서 매우 다양한 것들이 사용되었다. 하지만 군집화 결과의 성능 측정에 대한 객관적인 기준 설정이 어렵기 때문에 군집화 결과에 대한 해석은 매우 주관적이고, 애매한 경우가 많다. 퍼지 군집화는 이러한 주관적인 군집화 문제에 있어서 객관성 있는 군집 결정 방안을 제시하여 준다. 각 개체들이 특정 군집에 속하게 될 퍼지 멤버 함수값을 원소로 하는 유사도 행렬을 통하여 군집화를 수행한다. 본 논문에서는 차원 축소기법의 하나인 주성분 분석과 강력한 통계적 학습 이론인 베이지안 학습을 결합한 군집화 모형을 제안하여, 객관적인 퍼지 군집화를 수행하였다. 제안 알고리즘의 성능 평가를 위하여 UCI Machine Loaming Repository의 Iris와 Glass Identification 데이터를 이용한 실험 결과를 제시하였다.

Keywords

References

김기영, 전명식, 다변량 통계자료 분석, 자유아카데미, 1994
박민재, 전성해, 오경환, '붓스트랩 기법과 유전자 알고리즘을 이용한 최적 군집수 결정', 퍼지및지능시스템학회논문지, 제13권 제1호, pp.12-17, 2003
전성해, 오경환, 'MCMC 결측치 대체와 주성분 산점도 기반의 SOM을 이용한 희소한 웹 데이터 분석', 정보처리학회논문지D, Vol.10-D, No.2, pp.277-282, April, 2003 https://doi.org/10.3745/KIPSTD.2003.10D.2.277
한진우, 전성해, 오경환, '군집화를 위한 베이지안 학습 기반의 퍼지 규칙 추출', 한국정보과학회 2003 춘계학술발표논문집(II), April, 2003
J. S. Liu, J. L. Zhang, M. L. Palumbo, C. E. Lawrence, 'Bayesian Clustering with Variable and Transformation Selections,' Bayesian Statistics 7, Oxford University Press, 2003
J. C. Bezdek, 'Pattern Recognition with Fuzzy Objective Function Algorithms,' Plenum Press, 1987
C. M. Bishop, 'Neural Networks for Pattern Recognition,' Clarendon Press : Oxford, 1998
D. Dumitrescu, B. Lazzerini, L. C. Jain, 'Fuzzy Sets and their application to Clustering and Training,' The CRC Press, 2000
J. Han, M. Kamber, 'Data Mining : Concepts and Techniques,' Morgan Kaufmann, 2000
R. J. Hathaway, J. C. Bezdek, 'Switching Regression Models and Fuzzy Clustering,' IEEE Trans. Fuzzy Sets, Vol.1, pp.195-204, 1993 https://doi.org/10.1109/91.236552
S. J. Press, 'Bayesian Statistics : Principles, Models, and Applications,' John Wiley & Sons, 1989
H. J. Zimmermann, 'Fuzzy Set Theory and Its Application,' Kluwer Academic Publishers Group, 2001
C. P. Robert, G. Casella, 'Monte Carlo Statistical Methods,' Springer, 1999
http://www.ics.uci.edu/~mlearn/

The KIPS Transactions:PartB (정보처리학회논문지B)

Fuzzy Clustering Model using Principal Components Analysis and Naive Bayesian Classifier

주성분 분석과 나이브 베이지안 분류기를 이용한 퍼지 군집화 모형

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)