• Title/Summary/Keyword: Syntactic Analysis

Search Result 261, Processing Time 0.02 seconds

Syntactic Category Prediction for Improving Parsing Accuracy in English-Korean Machine Translation (영한 기계번역에서 구문 분석 정확성 향상을 위한 구문 범주 예측)

  • Kim Sung-Dong
    • The KIPS Transactions:PartB
    • /
    • v.13B no.3 s.106
    • /
    • pp.345-352
    • /
    • 2006
  • The practical English-Korean machine translation system should be able to translate long sentences quickly and accurately. The intra-sentence segmentation method has been proposed and contributed to speeding up the syntactic analysis. This paper proposes the syntactic category prediction method using decision trees for getting accurate parsing results. In parsing with segmentation, the segment is separately parsed and combined to generate the sentence structure. The syntactic category prediction would facilitate to select more accurate analysis structures after the partial parsing. Thus, we could improve the parsing accuracy by the prediction. We construct features for predicting syntactic categories from the parsed corpus of Wall Street Journal and generate decision trees. In the experiments, we show the performance comparisons with the predictions by human-built rules, trigram probability and neural networks. Also, we present how much the category prediction would contribute to improving the translation quality.

High Speed Korean Dependency Analysis Using Cascaded Chunking (다단계 구단위화를 이용한 고속 한국어 의존구조 분석)

  • Oh, Jin-Young;Cha, Jeong-Won
    • Journal of the Korea Society for Simulation
    • /
    • v.19 no.1
    • /
    • pp.103-111
    • /
    • 2010
  • Syntactic analysis is an important step in natural language processing. However, we cannot use the syntactic analyzer in Korean for low performance and without robustness. We propose new robust, high speed and high performance Korean syntactic analyzer using CRFs. We treat a parsing problem as a labeling problem. We use a cascaded chunking for Korean parsing. We label syntactic information to each Eojeol at each step using CRFs. CRFs use part-of-speech tag and Eojeol syntactic tag features. Our experimental results using 10-fold cross validation show significant improvement in the robustness, speed and performance of long Korea sentences.

Connectivity Effects and Questions as Specificational Subjects

  • Yoo, Eun-Jung
    • Language and Information
    • /
    • v.10 no.2
    • /
    • pp.21-45
    • /
    • 2006
  • Connectivity effects have been central issues in dealing with specificational pseudoclefts. While syntactic approaches motivate their analysis in order to explain connectivity effects in terms of a connected clause, these accounts have numerous problems including a wide range of anti-connectivity effects that constitute crucial counterevidence. On the other hand, semantic accounts of connectivity effects treat BV and BT connectivity by independent interpretive mechanisms providing a more fundamental explanation for connectivity effects. Yet existing semantic accounts have limitations in explaining syntactic properties and syntactic connectivity effects in SPCs, and in accounting for BV anti-connectivity effects in English. Focusing on BV connectivity, this paper explores how the relevant (anti-)connectivity facts can be accounted for by an analysis that provides both an elaborate syntactic analysis of SPCs and a semantic mechanism for bound anaphora. Based on Yoo's (2005) non-deletion based, question-answer pair analysis of SPCs, this paper shows that a functional question analysis of a specificational subject, when combined with a theory of operator scope and a non-configurational condition on bound anaphora, can explain various BV (anti-)connectivity patterns in SPCs and related constructions.

  • PDF

An analysis and correction of the phonological and syntactic errors in korean dialogues for a robust dialogue system (견고한 대화시스템을 위한 한국어 대화체의 음운론적, 구문론적 오류 분석 및 복구)

  • 김영길;김한우;최병욱
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.34C no.5
    • /
    • pp.55-65
    • /
    • 1997
  • In many cases, a dialogue system can't extract the correct analysis information of a user's spoken utterance, because of its own ungrammatical components. Therefore, in order to perform a correct before it performs the syntactic processing. In this paper, we use a real dialogue corpus and classify these ungrammatical errors as 4 categories : phonological, syntactic, semantic errors that consist of speech reparis and inversions, and propose an algorithm to detect and correct the errors. In short, this paper proposes a method to detect and correct the speech repairs and inversions that are classified as the phonological and syntactic errors to implement a robust dialogue system. And, through the test of real dialogue data, this paper shows an efficiency of the proposed algorithm.

  • PDF

Eojeol Syntactic Tag Prediction of Korean Text using Entropy Guided CRF (엔트로피 지도 CRF를 이용한 한국어 어절 구문태그 예측)

  • Oh, Jin-Young;Cha, Jeong-Won
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.15 no.5
    • /
    • pp.395-399
    • /
    • 2009
  • In this work, we describe the syntactic tag prediction system for Korean using the decision tree and CRFs. Generally they select features by their intuition. It depends on their prior knowledge. In this works, we combine features systematically using the decision tree. We also analyze errors and optimize features for the best performance. From the result of experiments, we can see that the proposed method is effective for the syntactic tag estimation and will be helpful for the syntactic analysis.

An Analysis of Noun-modifying Adverbs for Structural Disambiguation (구조적 중의성 해결을 위한 명사 수식 부사 연구)

  • Hwang, Seon Yeong;Lee, Gong Ju
    • Korean Journal of Cognitive Science
    • /
    • v.13 no.4
    • /
    • pp.42-42
    • /
    • 2002
  • An adverb has been generally defined as what modifies verbs or adjectives in Korean, but we can find that some adverbs can modify nouns. These kinds of adverbs lead a structural analysis complicated; therefore, they should be exceptionally processed by a syntactic parser. In this paper, we categorize a noun-modifying adverb and characterize that from a syntactic analysis standpoint. And also, we propose a method to handle noun-modifying adverbs for improving the accuracy of syntactic analysis. By using this proposed method, we can show that the parser increases it′s accuracy from 81.9 to 83.6% on testing corpus.

An Analysis of Noun-modifying Adverbs for Structural Disambiguation (구조적 중의성 해결을 위한 명사 수식 부사 연구)

  • 황선영;이공주
    • Korean Journal of Cognitive Science
    • /
    • v.13 no.4
    • /
    • pp.43-53
    • /
    • 2002
  • An adverb has been generally defined as what modifies verbs or adjectives in Korean, but we can find that some adverbs can modify nouns. These kinds of adverbs lead a structural analysis complicated; therefore, they should be exceptionally processed by a syntactic parser. In this paper, we categorize a noun-modifying adverb and characterize that from a syntactic analysis standpoint. And also, we propose a method to handle noun-modifying adverbs for improving the accuracy of syntactic analysis. By using this proposed method, we can show that the parser increases it's accuracy from 81.9 to 83.6% on testing corpus.

  • PDF

Application of Natural Language Processing(1) : Understanding of the Hangul Sentences for Simple Computer Manipulation (자연어 활용(1) : 간편한 컴퓨터 조작을 위한 한글 문장 이해에 관한 연구)

  • 장덕성;이동애
    • Korean Journal of Cognitive Science
    • /
    • v.3 no.1
    • /
    • pp.41-60
    • /
    • 1991
  • Most of the PC users manipulate the computer by using a few commands which are familiar with them. However by using Hangul sentences instead of using DOS commands, the optimal commands can be generated and flexibility can be provided. For this purpose, the conversion method of the input sentence into DOS commands is studied by means of morphological analysis, syntactic analysis, semantic analysis, and conceptual analysis. Tabular parsing is used in morphological analysis. case grammar is used in syntactic and semantic analysis. Case grammar is used in syntactic and semantic analysis. The meaning of sentence is represeented by the semantic network, from which we can generate a sequence DOS commands.

Continuous Speech Recognition using Syntactic Analysis and One-Stage DMS/DP (구문 분석과 One-Stage DMS/DP를 이용한 연속음 인식)

  • 안태옥
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.3
    • /
    • pp.201-207
    • /
    • 2004
  • This paper is a study on the recognition of continuous speech and uses a method of speech recognition using syntactic analysis and one-stage DMS/DP. In order to perform the speech recognition, first of all, we make DMS model by section division algorithm and let continuous speech data be recognized through One-stage DMS/DP method using syntactic analysis. Besides the speech recognition experiments of proposed method, we experiment the conventional one-stage DP method under the equivalent environment of data and conditions. From the recognition experiments, it is shown that Ole-stage DMS/DP using syntactic analysis is superior to conventional method.

A Study of ECG Pattern Classification of Using Syntactic Pattern Recognition (신택틱 패턴 인식 알고리즘에 의한 심전도 신호의 패턴 분류에 관한 연구)

  • 남승우;이명호
    • Journal of Biomedical Engineering Research
    • /
    • v.12 no.4
    • /
    • pp.267-276
    • /
    • 1991
  • This paper describes syntactic pattern recognition algorithm for pattern recognition and diagnostic parameter extraction of ECG signal. ECG signal which is represented linguistic string is evaluated by pattern grammar and its interpreter-LALR(1) parser for pattern recognition. The proposed pattern grammar performs syntactic analysis and semantic evaluation simultaneously. The performance of proposed algorithm has been evaluated using CSE database.

  • PDF