DOI QR코드

DOI QR Code

워드넷과 구글에 기반한 온톨로지 개체의 일반화

Generalization of Ontology Instances Based on WordNet and Google

  • 강신재 (대구대학교 정보통신대학 컴퓨터.IT공학부) ;
  • 강인수 (경성대학교 멀티미디어대학 컴퓨터정보학부)
  • 투고 : 2008.11.14
  • 심사 : 2009.05.28
  • 발행 : 2009.06.25

초록

본 논문은 온톨로지의 지식을 확장하기 위하여 웹 페이지 등 텍스트에서 추출된 온톨로지 개체(ontology instances)를 일반화하는 방법을 제시한다. 이를 위해서는 단어 의미 중의성 해소 과정이 필수적인데, 구글, 워드넷과 같은 오픈 API와 어휘 리소스를 이용하여 비교사학습 방법으로 해결하는 방법을 제안한다. 실험 결과 기존 연구에 비해 15.8%의 성능 향상을 얻을 수 있었다.

In order to populate ontology, this paper presents a generalization method of ontology instances, extracted from texts and web pages, by using unsupervised learning techniques for word sense disambiguation, which uses open APIs and lexical resources such as Google and WordNet. According to the experimental results, our method achieved a 15.8% improvement over the previous research.

키워드

참고문헌

  1. M. A. Hearst, 'Automatic acquisition of hyponyms from large text corpora,' In Proceedings of the 14th International Conference on Computational Linguistics, 1992
  2. R. Snow, D. Jurafsky, and A. Y. Ng, 'Learning syntactic patterns for automatic hypernym discovery,' In Proceedings of Advances in Neural Information Processing Systems, 2004
  3. D. Faure, and C. Nedellec, 'Knowledge acquisition of predicate argument structures from technical texts using machine learning: the system ASIUM,' In Proceedings of the European Knowledge Acquisition Workshop (EKAW), 1999
  4. P. Cimiano, and S. Staab, 'Learning concept hierarchies from text with a guided agglomerative clustering algorithm,' In Proceedings of ICML-2005 Workshop on Learning and Extending Ontologies by using Machine Learning Methods, 2005
  5. P. Buitelaar, and P. Cimiano, 'Ontology learning from text,' Tutorial Notes at 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL-2006), 2006
  6. I. P. Klapaftis, and S. Manandhar, 'Google & WordNet based word sense disambiguation,' In Proceedings of ICML-2005 Workshop on Learning and Extending Ontologies by using Machine Learning Methods, 2005
  7. P. M. Ryu, and K. S. Choi, 'An Information-theoretic approach to taxonomy extraction for ontology learning,' In Ontology Learning from Text: Methods, Evaluation and Applications, Frontiers in Artificial Intelligence and Applications, IOS Press, Amsterdam, Vol. 123, July 2005
  8. A. Wierzbicka, Semantic Primitives. Frankfurt a. M.: Athen$\ddot{u}$um-Verl, 1972
  9. C. Fellbaum, WordNet: An Electronic Lexical Database (Language, Speech, Communication), MIT Press, May 1998
  10. E. Agirre, and P. Edmonds, Word Sense Disambiguation: Algorithms and Applications, Springer, 2006
  11. G. Pirr$\grave{o}$, N. Seco, 'Design, Implementation and Evaluation of a New Similarity Metric Combining Feature and Intrinsic Information Content'. ODBASE 2008, LNCS, Springer Verlag, 2008
  12. J. Jiang, and D. Conrath, 'Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy,' In Proc. ROCLING X, 1997
  13. D. Lin, 'An Information-Theoretic Definition of Similarity,' In Proc. of Conf. on Machine Learning, pp. 296-304, 1998
  14. P. Resnik, 'Information Content to Evaluate Semantic Similarity in a Taxonomy,' In Proc. of IJCAI 1995, pp. 448-453, 1995
  15. A. Tversky, Features of similarity, Psychological Review, Vol. 84, No. 2, pp. 327-352, 1977 https://doi.org/10.1037/0033-295X.84.4.327

피인용 문헌

  1. Conflict Resolution of Patterns for Generating Linked Data From Tables vol.24, pp.3, 2014, https://doi.org/10.5391/JKIIS.2014.24.3.285