Range Detection of Wa/Kwa Parallel Noun Phrase by Alignment method

정렬기법을 활용한 와/과 병렬명사구 범위 결정

  • 최용석 (한국표준과학연구원 지식정보팀) ;
  • 신지애 (정보통신대학교 공학부) ;
  • 최기선 (한국과학기술원 전산학과) ;
  • 김기태 (한국표준과학연구원 지식정보팀) ;
  • 이상태 (한국표준과학연구원 지식정보팀)
  • Published : 2008.10.24

Abstract

In natural language, it is common that repetitive constituents in an expression are to be left out and it is necessary to figure out the constituents omitted at analyzing the meaning of the sentence. This paper is on recognition of boundaries of parallel noun phrases by figuring out constituents omitted. Recognition of parallel noun phrases can greatly reduce complexity at the phase of sentence parsing. Moreover, in natural language information retrieval, recognition of noun with modifiers can play an important role in making indexes. We propose an unsupervised probabilistic model that identifies parallel cores as well as boundaries of parallel noun phrases conjoined by a conjunctive particle. It is based on the idea of swapping constituents, utilizing symmetry (two or more identical constituents are repeated) and reversibility (the order of constituents is changeable) in parallel structure. Semantic features of the modifiers around parallel noun phrase, are also used the probabilistic swapping model. The model is language-independent and in this paper presented on parallel noun phrases in Korean language. Experiment shows that our probabilistic model outperforms symmetry-based model and supervised machine learning based approaches.

Keywords