Browse > Article
http://dx.doi.org/10.5351/KJAS.2005.18.2.329

A Comparative Study of Determining the Number of Clusters with a Method Proposed  

Chae, Seong-San (Department of Information and Statistics, Daejeon University)
Lim, Nam-Kyoo (Department of Information and Statistics, Daejeon University)
Publication Information
The Korean Journal of Applied Statistics / v.18, no.2, 2005 , pp. 329-341 More about this Journal
Abstract
A method of determining the number of clusters is proposed based on some asymptotic results on the Rand's(1971} $C_k$, k = 2, 3, . . ., N - 1, statistic. Simulation is conducted to compare the proposed method with Chae and Warde(1991), and Huh and Lee(2004).
Keywords
Agglomerative clustering algorithms; Rand's Ck statistics;
Citations & Related Records
연도 인용수 순위
  • Reference
1 DuBien, J. L., Warde, W. D. and Chae, S. S. (2004). Moments of Rand's C statistic in cluster analysis, Statistics & Probability Letters, 69, 243-252   DOI   ScienceOn
2 Fowlkes, E. B. and Mallows, C. L. (1983). A method for comparing two hierarchical clusterings, Journal of American Statistical Association, 78, 553-569   DOI   ScienceOn
3 Idrissi, A. (2000). Contribution it l'Unification de Criteres d'Association pour variables qualitatives, Ph.D., Paris: Universite Pierre et Marie Curie
4 Lance, G. N. and Williams, W. T. (1967). A general theory of classificatory sorting strategies. 1. Hierarchical systems, The Computer Journal, 9, 373-380   DOI
5 Lengyel, T. (1984). On a recurrence involving Stirling numbers, European Journal of Combinatorics, 5, 313-321   DOI
6 Rand, W. M. (1971). Objective criteria for the evaluation of clustering methods, Journal of American Statistical Association, 66, 846-850   DOI   ScienceOn
7 이석훈, 박래현, 김응환 (1995). 쿨롱네트워크를 이용한 집락분석, <응용통계연구>, 8 , 39-50
8 채성산 (1997). 재표본추출 및 검정을 통한 집락수의 예측, 대전대, <자연과학>, 8, 73-88
9 허명회, 이용구 (2004). K-평균 군집화의 재현성 평가 및 응용, <응용통계연구>, 17 , 135-144
10 Becker, H. W., and Riodan, J. (1934). The arithmetic of bell and Stirling numbers, American Journal of Mathematics, 70, 385-394   DOI   ScienceOn
11 Chae, S. S., DuBien, J. L. and Warde W. D. (2004). A method of predicting the number of clusters using asymptotic results on $C_k$, Computational Statistics & Data Analysis, (in revise)
12 Chae, S. S. and Warde W. D. (1991). A method to predict the number of clusters, Journal of the Korean Statistical Society, 20, 162-176
13 Chae, S. S. and Warde W. D. (2005). Effect of using principal coordinates and principal components on retrieval of clusters, Computational Statistics & Data Analysis, (in press)
14 DuBien J. L. and Warde, W. D. (1987). A comparison of agglomerative clustering method with respect to noise, Communication in Statistics, Theory and Method, 16, 1433-1460   DOI   ScienceOn