Browse > Article

Korean Summarization System using Automatic Paragraphing  

김계성 (경일대학교 교양학부)
이현주 (경북대학교 컴퓨터공학과)
이상조 (경북대학교 컴퓨터공학과)
Abstract
In this paper, we describes a system that extracts important sentences from Korean newspaper articles using automatic paragraphing. First, we detect repeated words between sentences. Through observation of the repeated words, this system compute Closeness Degree between Sentences(CDS ) from the degree of morphological agreement and the change of grammatical role. And then, it automatically divides a document into meaningful paragraphs using the number of paragraph defined by the user´s need. Finally. it selects one representative sentence from each paragraph and it generates summary using representative sentences. Though our system doesn´t utilize some features such as title, sentence position, rhetorical structure, etc., it is able to extract meaningful sentences to be included in the summary.
Keywords
Closeness Degree between Sentences(CDS ); Automatic Paragraphing; Text Summarization; Sentence Extraction;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Daniel Marcu, 'Discourse trees are good indicators of importane in text,' In I. Mani and M. Maybury editors, Advances in Automatic Text Summarization, pages 123-136, The MIT Press, 1999
2 Regina Barzilay, 'Lexical Chains for Summarization,' M.Sc. degree of Ben-Gurion University of the Negev, 1997
3 김상수, 김계성, 노태길, 이상조, '문서 요약을 위한 조응대용 해결', 제29회 정보과학회 추계학술발표논문집(B), 2002
4 정영규, 이현주, 이상조, '신문기사 요약문 생성을 위한 구문 분석기 구현', 제28회 정보과학회 춘계학술발표논문집(B), 2001
5 Gerard Salton, Michael J. McGill, Introduction to Modern Information Retrieval, McGraw-Hill, 1983
6 Inderjeet Mani and Mark T. Maybury, Advances in automatic text summarization, The MIT Press, 1999
7 Inderjeet Mani, Automatic Summarization, John Benjamins Publishing Company, 2001
8 Marti A. Hearst, 'Multi-paragraph segmentation of expository text,' In Proceedings of the 32nd Annual Meeting of the Association for Computational Lingustics(ACL), Las Cruces, NM, June 1994   DOI
9 담화 연구의 기초, 이원표 역, 한국문화사, 1999
10 Gale William, Kenneth W.Church, and David Yarowsky, 'Estimating upper and lower bounds on the performance of word-sense disambiguation programs,' In Proceedings of the 30th Annual Meeting of the Association for Computational Linguistics(ACL-92), pages 249-256, 1992   DOI