The Journal of Information Technology and Database (정보기술과데이타베이스저널)
- Volume 8 Issue 1
- /
- Pages.117-128
- /
- 2001
- /
- 1226-3559(pISSN)
A Feature Selection Technique for an Efficient Document Automatic Classification
효율적인 문서 자동 분류를 위한 대표 색인어 추출 기법
Abstract
Recently there are many researches of text mining to find interesting patterns or association rules from mass textual documents. However, the words extracted from informal documents are tend to be irregular and there are too many general words, so if we use pre-exist method, we would have difficulty in retrieving knowledge information effectively. In this paper, we propose a new feature extraction method to classify mass documents using association rule based on unsupervised learning technique. In experiment, we show the efficiency of suggested method by extracting features and classifying of documents.
Keywords