Browse > Article

Selection of Cluster Hierarchy Depth in Hierarchical Clustering using K-Means Algorithm  

Lee, Won-Hee (Dept. of Electronics & Information Engineering, Chonbuk National University)
Lee, Shin-Won (Dept. of Electronics & Information Engineering, Chonbuk National University)
Chung, Sung-Jong (Dept. of Electronics & Information Engineering, Chonbuk National University)
An, Dong-Un (Dept. of Electronics & Information Engineering, Chonbuk National University)
Publication Information
Abstract
Many papers have shown that the hierarchical clustering method takes good-performance, but is limited because of its quadratic time complexity. In contrast, with a large number of variables, K-means reduces a time complexity. Think of the factor of simplify, high-quality and high-efficiency, we combine the two approaches providing a new system named CONDOR system with hierarchical structure based on document clustering using K-means algorithm. Evaluated the performance on different hierarchy depth and initial uncertain centroid number based on variational relative document amount correspond to given queries. Comparing with regular method that the initial centroids have been established in advance, our method performance has been improved a lot.
Keywords
K-means; Document clustering; Information Retrieval; Hierarchical clustering;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Hyung Jin Oh "Analysis of Document Clustering Varing Cluster Centroid Decisions," Proceedings of IEEK Summer Conference, 2002
2 Michael Steinbach, George Karypis, Vipin Kumar, "A Comparison of Document Clustering Techniques," Technical Report #00_034, Department of Computer Science and Engineering, University of Minnesota, 2000
3 Ramon A. Mollineda, Enrique Vidal. "A relative approach to hierarchical clustering", 2000
4 Baeza-Yates, Rebeiro-Neto, "Modern Information Retrieval," Addison-Wesley
5 KhaledAlsabti, Sanjay Ranka, Vineet Singh, "An Efficient K-Means Clustering Algorithm," IIPS 11th International Parallel Processing Symposium, 1998
6 Tapas Kanung, "The Analysis of a Simple k-Means Clustering Algorithms" in Proceedings of the sixteenth annual symposium on Computational geometry, 2000
7 Vivisimo http://vivisimo.com
8 Hai-nan Jin, Shin-won Lee, Dong-un An, Sung-jong Chung, "A Study on Cluster Hierarchy Depth in Hierarchical Clustering," Proceedings of the 20th KIPS Spring Conference, 2004
9 Soon Cheol Park, Dong-un An, "CONDOR Information Retrieval System," Korea Society Industrial Information Systems. Vol. 8 No.4, 2003
10 Sang-seon Yi, Shin-won Lee, Dong-un An, Sung-jong Chung, "A Study on Cluster Topic Selection in Hierarchical Clustering," Proceedings of the 20th KIPS Spring Conference, 2004
11 Qin He, "A Review of Clustering Algorithms as Applied in IR," UIUCLIS—1999/6+IRG