Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2006.13D.3.309

Concept Extraction Technique from Documents Using Domain Ontology  

Mun Hyeon-Jeong (창원대학교 컴퓨터공학과)
Woo Yong-Tae (창원대학교 컴퓨터공학과)
Abstract
We propose a novel technique to categorize XML documents and extract a concept efficiently using domain ontology. First, we create domain ontology that use text mining technique and statistical technique. We propose a DScore technique to classify XML documents by using the structural characteristic of XML document. We also present TScore technique to extract a concept by comparing the association term set of domain ontology and the terms in the XML document. To verify the efficiency of the proposed technique, we perform experiment for 295 papers in the computer science area. The results of experiment show that the proposed technique using the structural information in the XML documents is more efficient than the existing technique. Especially, the TScore technique effectively extract the concept of documents although frequency of term is few. Hence, the proposed concept-based retrieval techniques can be expected to contribute to the development of an efficient ontology-based knowledge management system.
Keywords
Ontology; Knowledge Management System; KDD; Concept-based Retrieval;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 S. Decker, M. Erdmann, D. Fensel, and R. Studer, Ontobroker: Ontology Based Access to Distributed and Semi-Structured Information, In R. Meersman et al., editors, Database Semantics: Semantic Issues in Multimedia Systems, Kluwer Academic Publisher, pp.351-369, 1999
2 K. Knight and S. Luk, 'Building a Large-Scale Knowledge Base for Machine Translation,' Proc. of the AAAI, 1994
3 Cycorp, 'Cyc Knowledge Server,' http://www.cyc.com, 2002
4 H. J. Mun, J. Y. Lee and Y. T. Woo, 'A Domain Ontology Creation Method for Ontology-based Knowledge Management Model,' Int'l Journal of ACIS, pp.99-108, 2005
5 goRank,.com, 'Google Ontology Analysis,' http://www.gorank.com/researeh/google_ontology_analysis.php, 2004
6 D. L. McGuinness, 'Ontological Issues for Knowledge-Enhanced Search,' Proc. of the Formal Ontology in Information Systems, pp.302-316, 1998
7 최옥경, 한상용, '자동화된 통합 프레임워크를 위한 시맨틱 웹 기반의 정보 검색 시스템,' 한국정보처리학회 논문지, Vol.13, No.1, pp.129-136, 2006   과학기술학회마을   DOI
8 E. Hyvonen and et al 'Finish Museum on the Semantic Web User's Perspective,' Proc. of the Museums and the Web, 2004
9 A. Maedche, 'A Machine Learning Perspective for the Semantic Web,' Proc. of the Semantic Web Working Symposium, 2001
10 김명숙, 공용해, '온톨로지-DTD 정합에 의한 XML 질의 확장,' 한국정보처리학회 논문지, Vol.12, No.5 pp.773-780, 2005   과학기술학회마을   DOI
11 이무훈, 조현규, 조현성, 조성훈, 장창복, 최의인, '웹 문서의 의미적 연관성 기술을 위한 온톨로지 에디터,' 한국정보처리학회 논문지, Vol.12, No.5, pp.881-888, 2005   과학기술학회마을   DOI
12 오삼균, 'Web Ontology Languages와 그 활용에 관한 고찰,' 데이터베이스연구학회지, Vol.18, No.3, pp.63-79, 2002
13 Y. Sure and et al., On-To-Knowledge: Semantic Web Enabled Knowledge Management, J. Wiley and Sons, 2002
14 G. A. Miller, 'WordNet : A Lexical Database for English,' Communication of the ACM, Vol.38, No.11, pp.39-41, 1995   DOI   ScienceOn
15 E. Hyvonen, S. Saarela, K. Viljanen, 'Ontogator: combining view- and ontology-based search with semantic browsing,' Proc. of the XML Finland Conference, 2003
16 P. V. Benjamins, D. Fensel, and A. G. Perez, 'Knowledge Management through Ontologies,' Proc. of the Practical Aspects of Knowledge Management, 1998
17 R. Benjamins and D. Fensel, 'The Ontological Engineering Initiative$(KA)^{2″}$,' Proc. of Formal Ontologies in Information Systems, pp.287-301, 1998