Browse > Article
http://dx.doi.org/10.6109/jkiice.2017.21.11.2133

XML Document Keyword Weight Analysis based Paragraph Extraction Model  

Lee, Jongwon (Department of Computer Engineering, Paichai University)
Kang, Inshik (Korea University of Media Arts)
Jung, Hoekyung (Department of Computer Engineering, Paichai University)
Abstract
The analysis of existing XML documents and other documents was centered on words. It can be implemented using a morpheme analyzer, but it can classify many words in the document and cannot grasp the core contents of the document. In order for a user to efficiently understand a document, a paragraph containing a main word must be extracted and presented to the user. The proposed system retrieves keyword in the normalized XML document. Then, the user extracts the paragraphs containing the keyword inputted for searching and displays them to the user. In addition, the frequency and weight of the keyword used in the search are informed to the user, and the order of the extracted paragraphs and the redundancy elimination function are minimized so that the user can understand the document. The proposed system can minimize the time and effort required to understand the document by allowing the user to understand the document without reading the whole document.
Keywords
Compression; Document Analysis; Keyword Frequency; Keyword Weight; Paragraph Extraction;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 B. J. Noh, Z. S. Xu, J. G. Lee, D. H. Park, Y. H. Chung, "Keyword Network Based Repercussion Effect Analysis of Foot-and-Mouth Disease Using Online News," Korean Institute of Information Technology, vol. 14, no. 9, pp. 143-152, Sep. 2016.
2 S. J. Choi, J. W. Lee, "A Morphological Analysis Method of Prediction place-Event Performance by Online News Titles," Korea Association of Community Welfare Studies, vol. 21, no. 1, pp. 15-32, Feb. 2016.
3 H. S. Ha, B. Y. Hwang, "Keyword Filtering about Disaster and the Method of Detecting Area in Detecting Real-Time Event Using Twitter," Korea Information Processing Society, vol. 5, no. 7, pp. 345-350, Jul. 2016.
4 J. C. Shin, C. Y. Ock, "A Korean Morphological Analyzer using a Pre-analyzed Partial Word-phrase Dictionary," Korean Institute of Information Scientists and Engineering, vol. 39, no. 5, pp. 415-424, May 2012.
5 S. H. Na, J. I. Kim, E. J. Lee, P. K. Kim, "A Study on the Short Text Categorization using SNS Feature Informations," Korean Institute of Information Technology, vol. 14, no. 6, pp. 159-165, Jun. 2016.
6 H. Y. Lee, J. S. Lee, B. D. Kang, S. W. Yang, "Functional Expansion of Morphological Analyzer Based on Longest Phrase Matching For Efficient Korean Parsing," Digital Contents Society, vol. 17, no. 3, pp. 203-210, Jun. 2016.   DOI
7 J. Y. Lee, J. H. Lee, Y. H. Park, "A design and implementation of the management system for number of keyword searching results using Google searching engine," The Korea Institute of Information and Communication Engineering, vol. 20, no. 5, pp. 880-886, May 2016.   DOI