K-means Clustering using Grid-based Representatives

Park, Hee-Chang;Lee, Sun-Myung;

Journal of the Korean Data and Information Science Society

Volume 16 Issue 4
/
Pages.759-768
/
2005
/
1598-9402(pISSN)

The Korean Data and Information Science Society (한국데이터정보과학회)

K-means Clustering using Grid-based Representatives

Park, Hee-Chang (Department of Statistics, Changwon National University) ;
Lee, Sun-Myung (Department of Statistics, Changwon National University)

Published : 2005.11.30

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

K-means clustering has been widely used in many applications, such that pattern analysis, data analysis, market research and so on. It can identify dense and sparse regions among data attributes or object attributes. But k-means algorithm requires many hours to get k clusters, because it is more primitive and explorative. In this paper we propose a new method of k-means clustering using the grid-based representative value(arithmetic and trimmed mean) for sample. It is more fast than any traditional clustering method and maintains its accuracy.

Keywords

References

Proceedings of the fifth Berkeley symposium on mathematical statistics and probability v.1 Some methods for classification and analysis of multivariate observations MacQueen, J.
Finding Groups in Data: An Introduction to Cluster Analysis Kaufman, L.;Rousseeuw, P.J.
Proceedings of the 20th Very Large Data Bases Conference Efficient and effective clustering method for spatial data mining Ng, R.;Han, J.
Proceedings of The First Pacific-Asia Conference on Knowledge Discovery and Data Mining Clustering Large Data Sets with Mixed Numeric and Categorical Values Huang, Z.
Proceedings of SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery A Fast Clustering Algorithm to Cluster Very Large Categorical Data Sets in Data Mining Huang, Z.
Proceedings of Workshop on Mining Data for CRM Efficient k-medoids algorithms using multi-centroids with multi-runs sampling scheme Chu, S.C.;Roddick, J.F.;Pan, J.S.
Proceedings of Second International Conference on Knowledge Discovery and Data Mining An Incremental Multi-Centroid, Multi-Run Sampling Scheme for k-medoids-based Algorithms-Extended Report Chu, S.C.;Roddick, J.F.;Pan, J.S.

Journal of the Korean Data and Information Science Society

K-means Clustering using Grid-based Representatives

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)