• Title/Summary/Keyword: Lexical Ambiguity

Search Result 26, Processing Time 0.02 seconds

Noun Sense Disambiguation Based-on Corpus and Conceptual Information (말뭉치와 개념정보를 이용한 명사 중의성 해소 방법)

  • 이휘봉;허남원;문경희;이종혁
    • Korean Journal of Cognitive Science
    • /
    • v.10 no.2
    • /
    • pp.1-10
    • /
    • 1999
  • This paper proposes a noun sense disambiguation method based-on corpus and conceptual information. Previous research has restricted the use of linguistic knowledge to the lexical level. Since knowledge extracted from corpus is stored in words themselves, the methods requires a large amount of space for the knowledge with low recall rate. On the contrary, we resolve noun sense ambiguity by using concept co-occurrence information extracted from an automatically sense-tagged corpus. In one experimental evaluation it achieved, on average, a precision of 82.4%, which is an improvement of the baseline by 14.6%. considering that the test corpus is completely irrelevant to the learning corpus, this is a promising result.

  • PDF

Automatic Construction of Generalized Lexical Information for Syntactic Ambiguity Resolution (구문 분석에서의 중의성 해소를 위한 일반화된 어휘정보의 자동 구축 및 적용)

  • Chung, Hoo-Jung;Hwang, Young-Sook;Kwak, Yong-Jae;Park, So-Young;Rim, Hae-Chang
    • Annual Conference on Human and Language Technology
    • /
    • 1998.10c
    • /
    • pp.269-275
    • /
    • 1998
  • 구문 분석에서의 중의성을 해결하는 데 어휘정보가 유용하다는 것은 잘 알려져 있다. 그러나 기존의 어휘정보 구축 방법들은 많은 수작업을 요구하거나, 자동으로 구축하는 경우에는 어휘 자체를 그대로 사용함에 따라 심각한 자료 회귀성 문제가 발생했다. 본 논문에서는 구문 분석에서의 중의성 해소를 위해 원시 코퍼스와 시소러스를 이용하여 개념 수준(conceptual-level)의 일반화된 술어-인자 어휘정보를 자동으로 구축하고, 이를 파서에 적용하는 방법을 제안하고자 한다. 제안한 방법으로 구축한 일반화된 어휘정보를 파서를 이용하여 명사구의 지배소 결정 실험에 적용하여 본 결과, 정확도가 85.9%에서 91.5%로 향상되었다. 또, 미지격 결정 실험에 대해서는 86.32 %의 격 결정 성공률을 보여주었다.

  • PDF

A Sentence Reduction Method using Part-of-Speech Information and Templates (품사 정보와 템플릿을 이용한 문장 축소 방법)

  • Lee, Seung-Soo;Yeom, Ki-Won;Park, Ji-Hyung;Cho, Sung-Bae
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.5
    • /
    • pp.313-324
    • /
    • 2008
  • A sentence reduction is the information compression process which removes extraneous words and phrases and retains basic meaning of the original sentence. Most researches in the sentence reduction have required a large number of lexical and syntactic resources and focused on extracting or removing extraneous constituents such as words, phrases and clauses of the sentence via the complicated parsing process. However, these researches have some problems. First, the lexical resource which can be obtained in loaming data is very limited. Second, it is difficult to reduce the sentence to languages that have no method for reliable syntactic parsing because of an ambiguity and exceptional expression of the sentence. In order to solve these problems, we propose the sentence reduction method which uses templates and POS(part of speech) information without a parsing process. In our proposed method, we create a new sentence using both Sentence Reduction Templates that decide the reduction sentence form and Grammatical POS-based Reduction Rules that compose the grammatical sentence structure. In addition, We use Viterbi algorithms at HMM(Hidden Markov Models) to avoid the exponential calculation problem which occurs under applying to Sentence Reduction Templates. Finally, our experiments show that the proposed method achieves acceptable results in comparison to the previous sentence reduction methods.

Generalized LR Parser with Conditional Action Model(CAM) using Surface Phrasal Types (표층 구문 타입을 사용한 조건부 연산 모델의 일반화 LR 파서)

  • 곽용재;박소영;황영숙;정후중;이상주;임해창
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.1_2
    • /
    • pp.81-92
    • /
    • 2003
  • Generalized LR parsing is one of the enhanced LR parsing methods so that it overcome the limit of one-way linear stack of the traditional LR parser using graph-structured stack, and it has been playing an important role of a firm starting point to generate other variations for NL parsing equipped with various mechanisms. In this paper, we propose a conditional Action Model that can solve the problems of conventional probabilistic GLR methods. Previous probabilistic GLR parsers have used relatively limited contextual information for disambiguation due to the high complexity of internal GLR stack. Our proposed model uses Surface Phrasal Types representing the structural characteristics of the parse for its additional contextual information, so that more specified structural preferences can be reflected into the parser. Experimental results show that our GLR parser with the proposed Conditional Action Model outperforms the previous methods by about 6-7% without any lexical information, and our model can utilize the rich stack information for syntactic disambiguation of probabilistic LR parser.

Combinatory Categorial Grammar for the Syntactic, Semantic, and Discourse Analyses of Coordinate Constructions in Korean (한국어 병렬문의 통사, 의미, 문맥 분석을 위한 결합범주문법)

  • Cho, Hyung-Joon;Park, Jong-Cheol
    • Journal of KIISE:Software and Applications
    • /
    • v.27 no.4
    • /
    • pp.448-462
    • /
    • 2000
  • Coordinate constructions in natural language pose a number of difficulties to natural language processing units, due to the increased complexity of syntactic analysis, the syntactic ambiguity of the involved lexical items, and the apparent deletion of predicates in various places. In this paper, we address the syntactic characteristics of the coordinate constructions in Korean from the viewpoint of constructing a competence grammar, and present a version of combinatory categorial grammar for the analysis of coordinate constructions in Korean. We also show how to utilize a unified lexicon in the proposed grammar formalism in deriving the sentential semantics and associated information structures as well, in order to capture the discourse functions of coordinate constructions in Korean. The presented analysis conforms to the common wisdom that coordinate constructions are utilized in language not simply to reduce multiple sentences to a single sentence, but also to convey the information of contrast. Finally, we provide an analysis of sample corpora for the frequency of coordinate constructions in Korean and discuss some problematic cases.

  • PDF

Criticism of Landscape Urbanism - Focused on Internal Structures of the Discourse - (랜드스케이프 어바니즘의 비판적 견해에 대한 고찰 - 담론의 내재적 체계를 중심으로 -)

  • Kim, Youngmin
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.43 no.2
    • /
    • pp.87-104
    • /
    • 2015
  • As the influence of Landscape Urbanism has grown, various criticisms on the discourse also have increased. A study on critical opinions of Landscape Urbanism is necessary to fully comprehend the theoretical structure of the discourse and its limitations. This study introduced the concept of Intension and Extension, which is used in the field of Logics and Semiotic, as an analytical tool to interpret various criticisms based on different views in a more objective and synthetic way. After examining the development of criticisms of Landscape Urbanism, 30 texts with important critiques on the theory were selected and analyzed. Criticisms can be classified as internal criticism and external criticism according to specific topics they are engaged with. The study only covers internal criticism as a research scope. The internal criticisms on Landscape Urbanism are re-categorized into topics of theory, practice and the relation between theory and practice. Vagueness of concepts and error in concepts are two types criticism related to the issue of theory. Lexical Ambiguity and Intensional Vagueness are the main causes of conceptual vagueness in Landscape Urbanism. Conceptual vagueness related with the problem of redefining an existing concept through expanding its meaning reveals a structural dilemma. There are three types of criticism included in the topic of practice: absence of practical results, form-oriented practice, and ambiguous identity in practical results. Ambiguous identity is caused by Extensional Vagueness allowing borderline cases. Because these borderline cases overlap with extensions of landscape architecture, it is hard to differentiate projects of Landscape Urbanism and those of conventional landscape architecture. Most criticisms on the relation between theory and practice raise the question on the practical method. Two types of criticism are engaged with the topic of the practical method: errors in practical methods and absence of practical methods. The absence of practical methods is a fundamental problem of Landscape Urbanism which is hard to solve by the proposed solutions. However, these structural problems are not only the weak point but also the factor that is able to prove the potentials expand the scope of Landscape Urbanism. In addition to the results of the study, internal criticisms on Landscape Urbanism should be examined in the following studies in order to predict the next direction of Landscape Urbanism.