DOI QR코드

DOI QR Code

고정키어구 추출을 통한 디지털 문서의 도메인 특정 주석

Domain Specific Annotation of Digital Documents through Keyphrase Extraction

  • 발행 : 2011.04.30

초록

In this paper, we propose a methodology to annotate the digital documents through keyphrase extraction using domain specific taxonomy. Limitation of the existing keyphrase extraction algorithms is that output keyphrases may contain irrelevant information along with relevant ones. The quality of the generated keyphrases by the existing approaches does not meet the required level of accuracy. Our proposed approach exploits semantic relationships and hierarchical structure of the classification scheme to filter out irrelevant keyphrases suggested by Keyphrase Extraction Algorithm (KEA++). Our experimental results proved the accuracy of the proposed algorithm through high precision and low recall.

키워드