The Journal of Society for e-Business Studies (한국전자거래학회지)
- Volume 7 Issue 1
- /
- Pages.75-86
- /
- 2002
- /
- 2288-3908(pISSN)
- /
- 2765-3846(eISSN)
Development of A Web Mining System Based On Document Similarity
문서 유사도 기반의 웹 마이닝 시스템 개발
Abstract
In this study, we proposed design issues and structure of a web mining system and develop a system for the purpose of knowledge integration under world wide web environments resulted from our developing experiences. The developed system consists of three main functions: 1) gathering documents utilizing a search agent; 2) determining similarity coefficients between any two documents from term frequencies; 3) clustering documents based on similarity coefficients. It is believed that the developed system can be utilized for discovery of knowledge in relatively narrow domains such as news classification, index term generation in knowledge management.