• Title/Summary/Keyword: syntactic information

Search Result 422, Processing Time 0.028 seconds

Prediction of Prosodic Break Using Syntactic Relations and Prosodic Features (구문 관계와 운율 특성을 이용한 한국어 운율구 경계 예측)

  • Jung, Young-Im;Cho, Sun-Ho;Yoon, Ae-Sun;Kwon, Hyuk-Chul
    • Korean Journal of Cognitive Science
    • /
    • v.19 no.1
    • /
    • pp.89-105
    • /
    • 2008
  • In this paper, we suggest a rule-based system for the prediction of natural prosodic phrase breaks from Korean texts. For the implementation of the rule-based system, (1) sentence constituents are sub-categorized according to their syntactic functions, (2) syntactic phrases are recognized using the dependency relations among sub-categorized constituents, (3) rules for predicting prosodic phrase breaks are created. In addition, (4) the length of syntactic phrases and sentences, the position of syntactic phrases in a sentence, sense information of contextual words have been considered as to determine the variable prosodic phrase breaks. Based on these rules and features, we obtained the accuracy over 90% in predicting the position of major break and no break which have high correlation with the syntactic structure of the sentence. As for the overall accuracy in predicting the whole prosodic phrase breaks, the suggested system shows Break_Correct of 87.18% and Juncture Correct of 89.27% which is higher than that of other models.

  • PDF

Ordering a Left-branching Language: Heaviness vs. Givenness

  • Choi, Hye-Won
    • Language and Information
    • /
    • v.13 no.1
    • /
    • pp.39-56
    • /
    • 2009
  • This paper investigates ordering alternation phenomena in Korean using the dative construction data from Sejong Corpus of Modern Korean (Kim, 2000). The paper first shows that syntactic weight and information structure are distinct and independent factors that influence word order in Korean. Moreover, it reveals that heaviness and givenness compete each other and exert diverging effects on word order, which contrasts the converging effects of these factors shown in word orders of right-branching languages like English. The typological variation of syntactic weight effect poses interesting theoretical and empirical questions, which are discussed in relation to processing efficiency in ordering.

  • PDF

The Study on the Principles of Selecting Korean Particle 'Ka' and 'Nun' Using Korean-English Parallel Corpus (한영 병렬 말뭉치를 이용한 한국어 조사 '가'와 '는'의 선택 원리 연구)

  • Yoo, Hyun-Kyung;An, Ye-Ri;Yang, Su-Hyang
    • Language and Information
    • /
    • v.11 no.1
    • /
    • pp.1-23
    • /
    • 2007
  • This study aims to research into the meaning of Korean particle 'ka' and 'nun' inductively by examining the correspondences of those particles and English articles on the Korean-English parallel corpus. The correspondences were checked in three ways: semantically, syntactically and pragmatically. This study found that when the semantic or syntactic tier is not salient, the pragmatic tier is activated and particles are selected according to the pragmatic elements such as the amount of information or the change of topic. However, if the meaning of the particles is salient or if there is any syntactic motive, particles are selected in accordance with the semantic or syntactic elements. Former studies which focused on one of those three tiers cannot properly explain such correspondences on the Korean-English parallel corpus. This study shows that semantic, syntactic and pragmatic tiers hierarchically affect the selection of a particle and that the selection process is also related to speaker's intention. This dimensional analysis of particles is expected to contribute to theoretical studies and applied studies like Korean language education as well.

  • PDF

Syntactic Analysis and Keyword Expansion for Performance Enhancement of Information Retrieval System (정보 검색 시스템의 성능 향상을 위한 구문 분석과 검색어 확장)

  • 윤성희
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.5 no.4
    • /
    • pp.303-308
    • /
    • 2004
  • Natural language query is the best user interface for the users of information retrieval systems. This paper Proposes a retrieval system with expanded keyword from syntactically-analyzed structures of user's natural language query based on natural language processing technique. Through the steps combining or splitting the compound nouns based on syntactic tree traversal, and expanding the other-formed or shorten-formed keyword into multiple keyword, the system performance was enhanced up to 11.3% precision and 4.7% correctness.

  • PDF

Processing Nominal Suffixes in Korean: Evidence from Priming Experiments

  • Ahn, Hee-Don;An, Duk-Ho;Choi, Jung-Yun;Hwang, Jong-Bai;Jeon, Moon-Gee;Kim, Ji-Hyon
    • Language and Information
    • /
    • v.15 no.1
    • /
    • pp.1-12
    • /
    • 2011
  • This study investigates morphologically complex nouns in Korean through a series of priming studies. Two experiments examined whether morphological affixes on Korean nouns were decomposed or processed as a whole. Two types of morphological affixes were examined: morpho-syntactic case markers and the plural marker '-tul'. Results showed that priming occurred for the plural marker with SOAs of 80 ms and 160 ms, but no priming occurred for the morpho-syntactic case markers. These results suggest that the morphological processing for these two types of affixes differ. We argue that Korean nouns with the plural suffix are decomposed into the stem and affix, supporting the Decomposition Model (Pinker & Ullman, 2002). We suggest that while plural markers are truly morphological affixes, case markers in Korean are morpho-syntactic, and thus presuppose the existence of other syntactic elements, such as the matrix verb, hence the lack of priming effects.

  • PDF

Connectivity Effects and Questions as Specificational Subjects

  • Yoo, Eun-Jung
    • Language and Information
    • /
    • v.10 no.2
    • /
    • pp.21-45
    • /
    • 2006
  • Connectivity effects have been central issues in dealing with specificational pseudoclefts. While syntactic approaches motivate their analysis in order to explain connectivity effects in terms of a connected clause, these accounts have numerous problems including a wide range of anti-connectivity effects that constitute crucial counterevidence. On the other hand, semantic accounts of connectivity effects treat BV and BT connectivity by independent interpretive mechanisms providing a more fundamental explanation for connectivity effects. Yet existing semantic accounts have limitations in explaining syntactic properties and syntactic connectivity effects in SPCs, and in accounting for BV anti-connectivity effects in English. Focusing on BV connectivity, this paper explores how the relevant (anti-)connectivity facts can be accounted for by an analysis that provides both an elaborate syntactic analysis of SPCs and a semantic mechanism for bound anaphora. Based on Yoo's (2005) non-deletion based, question-answer pair analysis of SPCs, this paper shows that a functional question analysis of a specificational subject, when combined with a theory of operator scope and a non-configurational condition on bound anaphora, can explain various BV (anti-)connectivity patterns in SPCs and related constructions.

  • PDF

A Study on Definition Related to Passive and Causative in Korean Dictionary. (피동·사동과 관련한 국어사전의 뜻풀이에 대하여)

  • CHOE, Ho Chol
    • Korean Linguistics
    • /
    • v.48
    • /
    • pp.333-354
    • /
    • 2010
  • When defining the word related to passive and causative in Korean dictionary, the meaning of headword can be explained by linking them to other related words. The link could be expressed into two forms; the one is 'passive verb causative verb of A' and the other is 'passive form causative form of A.' Whichever the dictionary takes, the important thing is that the content to which it refer should be correct. However the format of 'passive verb causative verb of A' and 'passive form causative form of A' is problematic because the definition of headword does not contain semantic information but syntactic or morphological information. Generic concept 'passive form causative form' and 'passive verb causative verb' refers to respectively morphological and syntactic level but specific concept 'A' refers to semantic level. These morphological, syntactic and semantic level can not be a same dimension so the size of their denotation can not be compared. The way of transform syntactic dimension 'passive verb causative verb' and morphological dimension 'passive form causative form' into semantic dimension is removing 'verb' and 'form' from 'passive verb causative verb' and 'passive form causative form' respectively. Therefore the expression 'passive verb causative verb of A' or 'passive form causative form of A' ought be changed into 'passive causative of A.'

Syntactic Category Prediction for Improving Parsing Accuracy in English-Korean Machine Translation (영한 기계번역에서 구문 분석 정확성 향상을 위한 구문 범주 예측)

  • Kim Sung-Dong
    • The KIPS Transactions:PartB
    • /
    • v.13B no.3 s.106
    • /
    • pp.345-352
    • /
    • 2006
  • The practical English-Korean machine translation system should be able to translate long sentences quickly and accurately. The intra-sentence segmentation method has been proposed and contributed to speeding up the syntactic analysis. This paper proposes the syntactic category prediction method using decision trees for getting accurate parsing results. In parsing with segmentation, the segment is separately parsed and combined to generate the sentence structure. The syntactic category prediction would facilitate to select more accurate analysis structures after the partial parsing. Thus, we could improve the parsing accuracy by the prediction. We construct features for predicting syntactic categories from the parsed corpus of Wall Street Journal and generate decision trees. In the experiments, we show the performance comparisons with the predictions by human-built rules, trigram probability and neural networks. Also, we present how much the category prediction would contribute to improving the translation quality.

High Speed Korean Dependency Analysis Using Cascaded Chunking (다단계 구단위화를 이용한 고속 한국어 의존구조 분석)

  • Oh, Jin-Young;Cha, Jeong-Won
    • Journal of the Korea Society for Simulation
    • /
    • v.19 no.1
    • /
    • pp.103-111
    • /
    • 2010
  • Syntactic analysis is an important step in natural language processing. However, we cannot use the syntactic analyzer in Korean for low performance and without robustness. We propose new robust, high speed and high performance Korean syntactic analyzer using CRFs. We treat a parsing problem as a labeling problem. We use a cascaded chunking for Korean parsing. We label syntactic information to each Eojeol at each step using CRFs. CRFs use part-of-speech tag and Eojeol syntactic tag features. Our experimental results using 10-fold cross validation show significant improvement in the robustness, speed and performance of long Korea sentences.