Browse > Article
http://dx.doi.org/10.5391/JKIIS.2012.22.5.656

Document Summarization Using Mutual Recommendation with LSA and Sense Analysis  

Lee, Dong-Wook (성균관대학교 컴퓨터공학과)
Baek, Seo-Hyeon (성균관대학교 컴퓨터공학과)
Park, Min-Ji (경희대학교 컴퓨터공학과)
Park, Jin-Hee (성균관대학교 컴퓨터공학과)
Jung, Hye-Wuk (성균관대학교 컴퓨터공학과)
Lee, Jee-Hyong (성균관대학교 컴퓨터공학과)
Publication Information
Journal of the Korean Institute of Intelligent Systems / v.22, no.5, 2012 , pp. 656-662 More about this Journal
Abstract
In this paper, we describe a new summarizing method based on a graph-based and a sense-based analysis. In the graph-based analysis, we convert sentences in a document into word vectors and calculate the similarity between each sentence using LSA. We reflect this similarity of sentences and the rarity scores of words in sentences to define weights of edges in the graph. Meanwhile, in the sense-based analysis, in order to determine the sense of words, subjectivity or objectivity, we built a database which is extended from the golden standards using Wordnet. We calculate the subjectivity of sentences from the sense of words, and select more subjective sentences. Lastly, we combine the results of these two methods. We evaluate the performance of the proposed method using classification games, which are usually used to measure the performances of summarization methods. We compare our method with the MS-Word auto-summarization, and verify the effectiveness of ours.
Keywords
Document Summarization; Mutual Recommendation; Latent Semantic Analysis; Sense Analysis;
Citations & Related Records
연도 인용수 순위
  • Reference
1 R. Mihalcea, "Graph-based Ranking Algorithms for Sentence Extraction, Applied to Text Summarization," In Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, 2004.
2 Jade Goldsteiny, Summarizing Text Documents: Sentence Selection and Evaluation Metrics, Language Technologies Institute Carnegie Mellon University, 1999.
3 Scott Deerwester, "Indexing by Latent Semantic Analysis," Journal of the American Society for Information Science, 1990.
4 G. A. Miller, "WordNet: An online lexical database," Int. J . Lexicograph, 1990.
5 Word frequency list based on Project Gutenberg available at : http://en.wiktionary.org/wiki/Wiktionary:Frequency_lists, 2012.
6 F. Su, K. Markert, From Words to Senses: A case Study of Subjectivity Recognition, School of Computing University of Leeds, 2008.
7 김영택 외, 자연언어처리, 생능출판사, 2003.