Analysis of Document Clustering Varing Cluster Centroid Decisions

클러스터 중심 결정 방법에 따른 문서 클러스터링 성능 분석

  • Published : 2002.06.01

Abstract

K-means clustering algorithm is a very popular clustering technique, which is used in the field of information retrieval. In this paper, We deal with the problem of K-means Algorithm from the view of creating the centroids and suggest a method reflecting document feature and considering the context of each document to determine the new centroids during the process of forming new centroids. For experiment, We used the automatic document summarizer to summarize the Reuter21578 newslire test dataset and achieved 20% improved results to the recall metrics.

Keywords