• Title/Summary/Keyword: syntactic information

Search Result 422, Processing Time 0.028 seconds

Using Syntactic Unit of Morpheme for Reducing Morphological and Syntactic Ambiguity (형태소 및 구문 모호성 축소를 위한 구문단위 형태소의 이용)

  • Hwang, Yi-Gyu;Lee, Hyun-Young;Lee, Yong-Seok
    • Journal of KIISE:Software and Applications
    • /
    • v.27 no.7
    • /
    • pp.784-793
    • /
    • 2000
  • The conventional morphological analysis of Korean language presents various morphological ambiguities because of its agglutinative nature. These ambiguities cause syntactic ambiguities and they make it difficult to select the correct parse tree. This problem is mainly related to the auxiliary predicate or bound noun in Korean. They have a strong relationship with the surrounding morphemes which are mostly functional morphemes that cannot stand alone. The combined morphemes have a syntactic or semantic role in the sentence. We extracted these morphemes from 0.2 million tagged words and classified these morphemes into three types. We call these morphemes a syntactic morpheme and regard them as an input unit of the syntactic analysis. This paper presents the syntactic morpheme is an efficient method for solving the following problems: 1) reduction of morphological ambiguities, 2) elimination of unnecessary partial parse trees during the parsing, and 3) reduction of syntactic ambiguity. Finally, the experimental results show that the syntactic morpheme is an essential unit for reducing morphological and syntactic ambiguity.

  • PDF

A Study of Disambiguation Method To Improve The Syntactic Analysis System (구문 분석의 결과로 나타나는 구조의 모호성을 해결하기 위한 방법 연구)

  • Park, Yong Uk
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.4
    • /
    • pp.2764-2769
    • /
    • 2015
  • In this paper, we present a Korean syntactic analysis system which can generate all possible syntactic trees in a given sentence. Therefore, the number of syntactic trees by this syntactic analysis system can be increased exponentially. To solve this problem, we suggest a segmentation method and maximum connected unit in a segmentation. Maximum connected unit is a combined unit which contains all morphemes in a segmentation. According to the input sentence, it is possible one or more maximum connected unit in a segmentation. We extract 516 sentences to experiment randomly from the text book of Korean middle school. We could reduce about 28% of the number of syntactic trees.

Robust Syntactic Annotation of Corpora and Memory-Based Parsing

  • Hinrichs, Erhard W.
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 2002.02a
    • /
    • pp.1-1
    • /
    • 2002
  • This talk provides an overview of current work in my research group on the syntactic annotation of the T bingen corpus of spoken German and of the German Reference Corpus (Deutsches Referenzkorpus: DEREKO) of written texts. Morpho-syntactic and syntactic annotation as well as annotation of function-argument structure for these corpora is performed automatically by a hybrid architecture that combines robust symbolic parsing with finite-state methods ("chunk parsing" in the sense Abney) with memory-based parsing (in the sense of Daelemans). The resulting robust annotations can be used by theoretical linguists, who lire interested in large-scale, empirical data, and by computational linguists, who are in need of training material for a wide range of language technology applications. To aid retrieval of annotated trees from the treebank, a query tool VIQTORYA with a graphical user interface and a logic-based query language has been developed. VIQTORYA allows users to query the treebanks for linguistic structures at the word level, at the level of individual phrases, and at the clausal level.

  • PDF

An analysis and correction of the phonological and syntactic errors in korean dialogues for a robust dialogue system (견고한 대화시스템을 위한 한국어 대화체의 음운론적, 구문론적 오류 분석 및 복구)

  • 김영길;김한우;최병욱
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.34C no.5
    • /
    • pp.55-65
    • /
    • 1997
  • In many cases, a dialogue system can't extract the correct analysis information of a user's spoken utterance, because of its own ungrammatical components. Therefore, in order to perform a correct before it performs the syntactic processing. In this paper, we use a real dialogue corpus and classify these ungrammatical errors as 4 categories : phonological, syntactic, semantic errors that consist of speech reparis and inversions, and propose an algorithm to detect and correct the errors. In short, this paper proposes a method to detect and correct the speech repairs and inversions that are classified as the phonological and syntactic errors to implement a robust dialogue system. And, through the test of real dialogue data, this paper shows an efficiency of the proposed algorithm.

  • PDF

A Constraint-based Approach to English Gerunds

  • Kim, Yong-Beom
    • Language and Information
    • /
    • v.7 no.2
    • /
    • pp.117-137
    • /
    • 2003
  • This paper attempts to provide an alternative analysis involving categorical issues related to English gerunds. Especially, this paper rejects Maulof's approach that creates a new syntactic category gerund by mixing nominal and verbal categories. This paper identifies two syntactic structures in English gerunds: nominal gerunds and verbal gerunds. This distinction is based on syntactic and semantic characteristics of each type and is intended to account for the external distribution and endocentricity of the construction. Treating verbal gerunds syntactically as verbal categories, this paper proposes that English verbal gerunds act like other verbal categories such as infinitives whereas nominal gerunds behaves much like derived nominals. This paper proposes a few lexical rules that can take care of the two types of gerunds. The proposal can be extended to prepositional complements as well as sentential subject positions. This proposal not only resolves the issues involving distributional properties of the gerund construction but also captures syntactic parallelism observable between gerunds and other verbal constructions in English.

  • PDF

Interface between Morphology and Syntax: A Constraint-Based and Lexicalist Approach

  • Kim, Jong-Bok
    • Language and Information
    • /
    • v.2 no.1
    • /
    • pp.177-213
    • /
    • 1998
  • conflicting criterial used in identifying words have called the lexical integrity principle into question. That is, cases where the morphological word does not coincide with the syntactic word have notivated the syntactic view of word derivation, as pointed out by Bresnan and Mchombo(1995). Further, the implicit desire to make the clausal structure of Korean parallel to those posited for English(Chomsky 1991) and French(Pollock 1989) has also led most of the current literature on Korean morphology to claim that Korean verbal inflections head their own functional projections such as AgrP, TP, and MP im syntax. In this paper, I will first argue against such a syntactic view. After reviewing some basic properties of Korean verbal inflections, I will show that the evidence from mismatch phenomena supports the lexical integrity principle over the head-movement theories of word derivation. Then, I will propose a theory of lexical grammar which maintains the lexical integrity principle while retaining the effects of functional projections and syntactic movement.

  • PDF

Eojeol Syntactic Tag Prediction of Korean Text using Entropy Guided CRF (엔트로피 지도 CRF를 이용한 한국어 어절 구문태그 예측)

  • Oh, Jin-Young;Cha, Jeong-Won
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.15 no.5
    • /
    • pp.395-399
    • /
    • 2009
  • In this work, we describe the syntactic tag prediction system for Korean using the decision tree and CRFs. Generally they select features by their intuition. It depends on their prior knowledge. In this works, we combine features systematically using the decision tree. We also analyze errors and optimize features for the best performance. From the result of experiments, we can see that the proposed method is effective for the syntactic tag estimation and will be helpful for the syntactic analysis.

The Role of Distributional Cues in the Acquisition of Verb Argument Structures

  • Kim, Mee-Sook
    • Language and Information
    • /
    • v.7 no.1
    • /
    • pp.87-99
    • /
    • 2003
  • This paper investigates the role of input frequency in the acquisition of verb argument structures based on distributional information of a corpus of utterances derived from the English CHILDES database (MacWhinney 1993). It has been widely accepted that children successfully learn verb argument structures by innate language mechanisms, such as linking rules which connect verb meanings and its syntactic structures. In contrast, an approach to language acquisition called “statistical language learning” has currently claimed that children could succeed in acquiring syntactic structures in the absence of innate language mechanisms, making use of distributional properties of the input. In this paper, I evaluate the feasibility of the statistical learning in acquiring verb argument structures, based on distributional information about locative verbs in parental input. The naturalistic data allow us to investigate to what extent the statistical learning approach can and cannot help children succeed in learning the syntax of locative verbs. Based on the results of English database analysis, I show that there is rich statistical information for learning the syntactic possibilities of locative verbs in parental input, despite some limitations in the statistical learning approach.

  • PDF

Automatic Construction of Syntactic Relation in Lexical Network(U-WIN) (어휘망(U-WIN)의 구문관계 자동구축)

  • Im, Ji-Hui;Choe, Ho-Seop;Ock, Cheol-Young
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.10
    • /
    • pp.627-635
    • /
    • 2008
  • An extended form of lexical network is explored by presenting U-WIN, which applies lexical relations that include not only semantic relations but also conceptual relations, morphological relations and syntactic relations, in a way different with existing lexical networks that have been centered around linking structures with semantic relations. So, This study introduces the new methodology for constructing a syntactic relation automatically. First of all, we extract probable nouns which related to verb based on verb's sentence type. However we should decided the extracted noun's meaning because extracted noun has many meanings. So in this study, we propose that noun's meaning is decided by the example matching rule/syntactic pattern/semantic similarity, frequency information. In addition, syntactic pattern is expanded using nouns which have high frequency in corpora.