Implementation of Text Summarize Automation Using Document Length Normalization

문서 길이 정규화를 이용한 문서 요약 자동화 시스템 구현

  • 이재훈 (조선대학교 전자계산학과) ;
  • 김영천 (조선대학교 전자계산학과) ;
  • 이성주 (조선대학교 전자계산학과)
  • Published : 2001.12.01

Abstract

With the rapid growth of the World Wide Web and electronic information services, information is becoming available on-Line at an incredible rate. One result is the oft-decried information overload. No one has time to read everything, yet we often have to make critical decisions based on what we are able to assimilate. The technology of automatic text summarization is becoming indispensable for dealing with this problem. Text summarization is the process of distilling the most important information from a source to produce an abridged version for a particular user or task. Information retrieval(IR) is the task of searching a set of documents for some query-relevant documents. On the other hand, text summarization is considered to be the task of searching a document, a set of sentences, for some topic-relevant sentences. In this paper, we show that document information, that is more reliable and suitable for query, using document length normalization of which is gained through information retrieval . Experimental results of this system in newspaper articles show that document length normalization method superior to other methods use query itself.

Keywords