Representative Keyword Extraction from Few Documents through Fuzzy Inference

퍼지 추론을 이용한 소수 문서의 대표 키워드 추출

  • 노순억 (금오공과대학교 대학원 컴퓨터공학과) ;
  • 김병만 (금오공과대학교 대학원 컴퓨터공학과) ;
  • 허남철 (대구미래대학 컴퓨터정보처리학과)
  • Published : 2001.12.01

Abstract

In this work, we propose a new method of extracting and weighting representative keywords(RKs) from a few documents that might interest a user. In order to extract RKs, we first extract candidate terms and then choose a number of terms called initial representative keywords (IRKS) from them through fuzzy inference. Then, by expanding and reweighting IRKS using term co-occurrence similarity, the final RKs are obtained. Performance of our approach is heavily influenced by effectiveness of selection method of IRKS so that we choose fuzzy inference because it is more effective in handling the uncertainty inherent in selecting representative keywords of documents. The problem addressed in this paper can be viewed as the one of calculating center of document vectors. So, to show the usefulness of our approach, we compare with two famous methods - Rocchio and Widrow-Hoff - on a number of documents collections. The results show that our approach outperforms the other approaches.

Keywords