Browse > Article
http://dx.doi.org/10.3745/KTSDE.2018.7.8.281

A Method for Extracting Relationships Between Terms Using Pattern-Based Technique  

Kim, Young Tae ((주)케이엔씨 기업부설연구소)
Kim, Chi Su (공주대학교 컴퓨터공학부)
Publication Information
KIPS Transactions on Software and Data Engineering / v.7, no.8, 2018 , pp. 281-286 More about this Journal
Abstract
With recent increase in complexity and variety of information and massively available information, interest in and necessity of ontology has been on the rise as a method of extracting a meaningful search result from massive data. Although there have been proposed many methods of extracting the ontology from a given text of a natural language, the extraction based on most of the current methods is not consistent with the structure of the ontology. In this paper, we propose a method of automatically creating ontology by distinguishing a term needed for establishing the ontology from a text given in a specific domain and extracting various relationships between the terms based on the pattern-based method. To extract the relationship between the terms, there is proposed a method of reducing the size of a searching space by taking a matching set of patterns into account and connecting a join-set concept and a pattern array. The result is that this method reduces the size of the search space by 50-95% without removing any useful patterns from the search space.
Keywords
Ontology; Terms; Relationship; Extraction; Join-Set; Pattern;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Y. T. Kim, J. H. Lim, and C. S. Kim, "UML changes for efficient ontology development," Journal of the Korea Academia-Industrial Cooperation Society, Vol.9, No2, pp.415-421, 2008.   DOI
2 B. Smith, and C. Fellbaum, "Medical WordNet: a new methodology for the construction and validation of information resources for consumer health," In Proceedings of the 20th International Conference on Computational Linguistics, pp.371, 2004.
3 C. F. Baker, C. J. Fillmore, and J. B. Lowe, "The Berkeley FrameNet Project," In Proceedings of the 17th International Conference on Computational Linguistics, pp.86-90, 1998.
4 J. Gonzalo, F. Verdejo, I. Chugur, and J. Cigarran, "Indexing with WordNet synsets can improve Text Retrieval," arXiv preprint cmp-lg/9808002., 1998.
5 M. Q. Stearns, C. Price, K. A. Spackman, and A. Y. Wang, "SNOMED clinical terms: Overview of the development process and project status," In Proceedings of AMIA Symposium, pp.662-666, 2001.
6 X. Yang and J. Su, "Coreference resolution using semantic relatedness information from automatically discovered patterns," In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp.528-535, 2007.
7 O. Bodenreider, A. Burgun and T. C. Rindflesch, "Lexically suggested hyponymic relations among medical terms and their representation in the UMLS," In TIA'2001: Proceedings of Terminology and Artificial Intelligence, pp.11-21, 2001.
8 M. A. Hearst, "Automatic acquisition of hyponyms from large text corpora," In Proceedings of the 14th Conference on Computational Linguistics, pp.539-545, 1992.
9 P. Cimiano and S. Staab, "Learning by Googling," ACM SIGKDD Explorations Newsletter, Vol.6, No.2, pp.24-33, 2004.   DOI
10 S. Soderland, "Learning information extraction rules for semi-structured and free text," Machine Learning, Vol.34, No.3, pp.233-272, 1999.   DOI