Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2003.10B.3.297

The Study on Improvement of Cohesion of Clustering in Incremental Concept Learning  

Baek, Hey-Jung (숭실대학교 대학원 컴퓨터학과)
Park, Young-Tack (숭실대학교 컴퓨터학부)
Abstract
Nowdays, with the explosive growth of the web information, web users Increase requests of systems which collect and analyze web pages that are relevant. The systems which were develop to solve the request were used clustering methods to improve the duality of information. Clustering is defining inter relationship of unordered data and grouping data systematically. The systems using clustering provide the grouped information to the users. So, they understand the information efficiently. We proposed a hybrid clustering method to cluster a large quantity of data efficiently. By that method, We generate initial clusters using COBWEB Algorithm and refine them using Ezioni Algorithm. This paper adds two ideas in prior hybrid clustering method to increment accuracy and efficiency of clusters. Firstly, we propose the clustering method considering weight of attributes of data. Second, we redefine evaluation functions which generate initial clusters to increase efficiency in clustering. Clustering method proposed in this paper processes a large quantity of data and diminish of dependancy on sequence of input of data. So the clusters are useful to make user profiles in high quality. Ultimately, we will show that the proposed clustering method outperforms the pervious clustering method in the aspect of precision and execution speed.
Keywords
Hybrid Clustering; Weight; Evaluation Function; COBWEB; Etzioni;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Mark Devaney, Ashwin Ram, 'Efficient Feature Selection in Conceptual Clustring,' Machine Learning: Proceeding of the Fourteenth International Conference, Nashville, 1997
2 Oren Zamir, Oren Etzioni, Omid Madani and Richard M. Karp, 'Fast and Intuitive Clustering of Web Documents,' KDD'97, 1997
3 Hartigan, J. A., 'Clustering Algorighms,' Wiley, New York, 1975
4 Doug Fisher, 'Interative Opgimization and Simplification of Hierarchical Clusterings,' AI Access foundation and Morgan Kaufmann Publishers, 1996
5 Robert R. Korfhage, 'Information Storage and Retrieval,' Wiley Computer Publishing, 1997
6 Gennari, J. H., Langley, P. & Fisher, D. H., 'Models of incremental concept formation,' Artificial Intelligence, 40, pp.11-61, 1989   DOI   ScienceOn
7 Gluck, M & Corter, J., 'Information, uncertainty and the utility of categories,' Proceeding of the Seventh Annual Conference of the Cognitive Science Society, pp.283-287, Irvine,CA : Lawrence Erlbaum, 1985
8 양찬범, '웹 에이전트를 위한 통합방식 문서 클리스터링' 숭실대학교 석사학위논문, 1999
9 T. M. Mitchell, 'Machine Learning,' McGraw Hill, 1997
10 Kathleen Mckusick, Kevin Thompson, 'COBWEB/3:A Portable Implementation,' NASA Ames Reserch , Technical Report FIA-90-6-18-2, 1990
11 Richard C. Dubes and Anil K. Jain., 'Algorithms for Clustering Data,' Prentice Hall, 1988
12 Wettschereck, D., Aha, D. W. & Mohri,T. 'A review and empirical evalution of feature weighting methods for a class of lazy learning algorithms,' Artificial Intelligence Review, 11, pp.273-314, 1997   DOI