Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2006.13B.2.149

Document Summarization Based on Sentence Clustering Using Graph Division  

Lee Il-Joo (동원대학 모바일컨텐츠과)
Kim Min-Koo (아주대학교 정보 및 컴퓨터공학부)
Abstract
The main purpose of document summarization is to reduce the complexity of documents that are consisted of sub-themes. Also it is to create summarization which includes the sub-themes. This paper proposes a summarization system which could extract any salient sentences in accordance with sub-themes by using graph division. A document can be represented in graphs by using chosen representative terms through term relativity analysis based on co-occurrence information. This graph, then, is subdivided to represent sub-themes through connected information. The divided graphs are types of sentence clustering which shows a close relationship. When salient sentences are extracted from the divided graphs, summarization consisted of core elements of sentences from the sub-themes can be produced. As a result, the summarization quality will be improved.
Keywords
Document Summarization; Term Relation; Sentence Clustering; Graph Division;
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 김철언, 그래프론과 알고리듬, POSTEC PRESS, 1997
2 Skorochodko,E.F., 'Adaptive method of automatic abstracting and indexing,' Information Processing 71: Processing of the IFIP Congress 71, ed. by Freiman, pp.1179-1182, NorthHolland Publishing Company, 1972
3 C.J.van Rijsbergen., 'A Theoritical Basis for the Use of Co-occurrence Data in Information Retrieval,' Journal of Documentation.Vol.33:106-119,1977   DOI   ScienceOn
4 김재훈, 김준홍, '도합유사도를 이용한 한국어 문서요약 시스템' 한국 인지과학회 논문지 제12권 제1.2호, pp.35-42, 2001   과학기술학회마을
5 Salton.G., Singhal.A., Mitra.M., and Buckly.C., 'Automatic text structuring and summarization,' Information Processing and Management, Vol.33, No.2, 1997   DOI   ScienceOn
6 박성배, 장병탁, 'Co-Trained Support Vector Machines을 이용한 문서분류' 한국정보과학회 봄 학술발표 논문집 (B), 제29권 1호, pp. 259-261, 2002   과학기술학회마을
7 Julian Kupiec, Jan Pedersen, and Francine Chen, 'A Trainable Document Summarizer,' In Proceedings of ACM-SIGIR'95, pp.68-73,1995   DOI
8 Barzilay, Regina and Michael Elhadad, 'Lexical Chains for Text Summarization', Master's thesis, Ben-Gurion University, 1997
9 류제, '단어의 공기 관계 그래프를 이용한 문서의 핵심 문장 ?추출에 관한 연구' 호서대학교 벤처전문대학원 석사학위논문, 2000
10 정영미, 최상희, '문장 클러스터링에 기반한 자동요약 모형' 한국정보관리학회지, 제18권 3호, pp.159-178, 2001   과학기술학회마을
11 류동원, 이종혁, '단어공기정보를 이용한 자동화 문서요약' 한국정보과학회학술논문발표지 27권 1호, pp.345-347, 2000   과학기술학회마을
12 Inderjeet Mani, Automatic Summarization, John Benjarnins Publishing Co., 2001
13 Marti A Hearst, 'Multi-paragraph segmentation of expository text,' In Proceedings of the 32nd Annual Meeting of the ACL, June, 1994   DOI
14 Morris. A.H., Kasper and G.M, Adams. D.A., 'The effects and limitations of automated text condensing on reading comprehension performance,' Information systems Research, 3(1), pp.17-35, 1992   DOI
15 http://www.itl.nistgov/iaui/894.02/Irelated_projects/tipster_sumnac
16 http://www.isi.edu/-cyl/ROUGE/
17 Mary McKenna, Elizabeth D.Liddy, 'Evaluation of Automatic Text Summarization Across Multiple Documents,' MAl Symposium, 1998
18 Sparck Jones, K., 'Automatic summarizing.factors and directions,' Advances in Automatic Text Summarization, pp.1-12, The MIT Press. 1999
19 H.P.Edmundson, 'New Methods in Automatic Extracting,' Journal of the ACM, 16(2), 1969   DOI