• Title/Summary/Keyword: syntactic

Search Result 717, Processing Time 0.027 seconds

A Formal Specification and Accuracy Checking of 2+1 View Integrated Metamodel Using Z and Object-Z (Z/Object-Z 사용한 2+1 View 통합 메타모델의 정형 명세와 명확성 검사)

  • Song, Chee-Yang
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.1
    • /
    • pp.449-459
    • /
    • 2014
  • The proposed 2+1 view integrated metamodel defined formerly with a graphical class model can not be guaranteed the syntactic clarity and accuracy precisely for the metamodel due to the informal specification. This paper specifies the syntactic semantics formally for the 2+1 view integrated metamodel using Z and Object-Z and checks the accuracy of the metamodel with Z/Eves tool. The formal specification is expressed in Z and Object-Z schema separately for syntax and statics semantics of the 2+1 view integrated metamodel, which applying the converting rule between class model and Z/Object-Z. The accuracy of the Z specification for the metamodel is verified using Z/Eves tool, which can check the syntax, type, and domain of the Z specification. The transformation specification and checking of the 2+1 view integrated metamodel can help establish more accurate the syntactic semantics of its construct and check the accuracy of the metamodel.

A Study on the Textuality of Sijo Poetry (시조의 텍스트성(textuality) 연구)

  • Im Jong-Chan
    • Sijohaknonchong
    • /
    • v.21
    • /
    • pp.5-22
    • /
    • 2004
  • If Sijo poetry is referred to a text which is composed of sentences. its textuality can be explored in terms of 1) the figurative words used in each line. 2) the logical sequence between lines. 3) the syntactic composition of each line. With the ancient Sijo poem. 1) it is composed of logical sentences as a result of extremely restraining from using figurative words that could prevent the reader from grasping the logical sequence within the work; 2) there is a clear cohesion between lines that can make each work perfectly coherent; 3) each line has a balanced syntactic structure, so the entire structure of a Sijo poem is '6 phrases in lines'. With the modem Sijo peom. 1) it abounds in figurative words, which prevent the work from having a logical sequence, and sometimes even from having three lines; 2) there is a loose cohesion between lines which can't make each work coherent; 3) it sometimes destroys the syntactic structure, '6 phrases in 3 lines', unique to traditional Sijo poetry. I think that this trend of modern Sijo poetry can cause haphazard the existence of modern Sijo poetry.

  • PDF

Clustering-based Statistical Machine Translation Using Syntactic Structure and Word Similarity (문장구조 유사도와 단어 유사도를 이용한 클러스터링 기반의 통계기계번역)

  • Kim, Han-Kyong;Na, Hwi-Dong;Li, Jin-Ji;Lee, Jong-Hyeok
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.4
    • /
    • pp.297-304
    • /
    • 2010
  • Clustering method which based on sentence type or document genre is a technique used to improve translation quality of SMT(statistical machine translation) by domain-specific translation. But there is no previous research using sentence type and document genre information simultaneously. In this paper, we suggest an integrated clustering method that classifying sentence type by syntactic structure similarity and document genre by word similarity information. We interpolated domain-specific models from clusters with general models to improve translation quality of SMT system. Kernel function and cosine measures are applied to calculate structural similarity and word similarity. With these similarities, we used machine learning algorithms similar to K-means to clustering. In Japanese-English patent translation corpus, we got 2.5% point relative improvements of translation quality at optimal case.

The Language Change and Language Processing (언어 변화와 언어 처리 - '는게/는데' 문법 화와 자동 태깅 시스템-)

  • 최운호
    • Korean Journal of Cognitive Science
    • /
    • v.10 no.2
    • /
    • pp.35-43
    • /
    • 1999
  • This paper aims to research the language changes in modern Korean and its effect to the language processing systems. In modern Korean. the syntactic constructions l like [Adnominal Ending + Bound Noun ( + Postposition)] are changing into the morphological constructions, and some of these constructions are reflected in the written language. For example. the syntactic construction [Ad nominal Ending + '-de (Bound N Noun)' (+ Postposition) ) co-exists with the mixed form '-neunde' and [Adnominal Ending + 'geot' (Bound Noun) + '-j' (Postposition)) does with ' neunge'. These constructions are used frequently in the spoken language. As like other verbal endings, these forms also participate in the construction of the complex sentence, and these forms have its own case function fused into themselves So, the analytic approach to these forms can make great effect on the automatic morphological analysis systems. automatic tagging systems. and the syntactic analysis systems. So. in the design phase of a language processing systems, the language change phenomena like these must be taken l into consideration.

  • PDF

Loaming Syntactic Constraints for Improving the Efficiency of Korean Parsing (한국어 구문분석의 효율성을 개선하기 위한 구문제약규칙의 학습)

  • Park, So-Young;Kwak, Yong-Jae;Chung, Hoo-Jung;Hwang, Young-Sook;Rim, Hae-Chang
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.10
    • /
    • pp.755-765
    • /
    • 2002
  • In this paper, we observe various syntactic information for Korean parsing and propose a method to learn constraints and improve the efficiency of a parsing model by using the constraints. The proposed method has the following three characteristics. First, it improves the parsing efficiency since we use constraints that can prevent the parser from generating unsuitable candidates. Second, it is robust on a given Korean sentence because the attributes for the constraints are selected based on the syntactic and lexical idiosyncrasy of Korean. Third, it is easy to acquire constraints automatically from a treebank by using a decision tree learning algorithm. The experimental results show that the parser using acquired constraints can reduce the number of overgenerated candidates up to 1/2~1/3 of candidates and it runs 2~3 times faster than the one without any constraints.

A comparative study of Entity-Grid and LSA models on Korean sentence ordering (한국어 텍스트 문장정렬을 위한 개체격자 접근법과 LSA 기반 접근법의 활용연구)

  • Kim, Youngsam;Kim, Hong-Gee;Shin, Hyopil
    • Korean Journal of Cognitive Science
    • /
    • v.24 no.4
    • /
    • pp.301-321
    • /
    • 2013
  • For the task of sentence ordering, this paper attempts to utilize the Entity-Grid model, a type of entity-based modeling approach, as well as Latent Semantic analysis, which is based on vector space modeling, The task is well known as one of the fundamental tools used to measure text coherence and to enhance text generation processes. For the implementation of the Entity-Grid model, we attempt to use the syntactic roles of the nouns in the Korean text for the ordering task, and measure its impact on the result, since its contribution has been discussed in previous research. Contrary to the case of German, it shows a positive result. In order to obtain the information on the syntactic roles, we use a strategy of using Korean case-markers for the nouns. As a result, it is revealed that the cues can be helpful to measure text coherence. In addition, we compare the results with the ones of the LSA-based model, discussing the advantages and disadvantages of the models, and options for future studies.

  • PDF

A Morpheme-unit Korean Feature-Based Brammer (KFG) with the X-bar Theoretic Notion of Headedness (X-바 이론의 중심어 개념을 도입한 형태소 단위의 한국어 자질 기반 문법)

  • Park, So-Yeong;Hwang, Yeong-Suk;Im, Hae-Chang
    • Journal of KIISE:Software and Applications
    • /
    • v.26 no.10
    • /
    • pp.1247-1259
    • /
    • 1999
  • 본 논문에서는 한국어 문장형성원리를 간결하게 제시할 수 있도록 X-바 이론의 중심어 개념을 도입한 한국어 자질기반 문법을 제안한다. 제안하는 문법은 어절에 관계없이 나타나는 한국어의 문법현상을 명확히 설명할 수 있도록 어절 대신 형태소를 기본단위로 한다. 그리고, 한국어의 구문범주가 지닌 의미정보와 기능정보를 자질을 이용하여 독립적으로 표현하며, 구문범주간의 결합관계를 바탕으로 하는 자질연산을 수행하여 문장을 분석한다. 또한, 한국어의 부분자유어순과 생략현상에 대해 견고하게 분석할 수 있도록 자질연산을 이진결합중심의 CNF(Chomsky Normal Form)로 제한한다. 이렇게 구성된 한국어 자질기반 문법은 규칙을 직관적이고도 간단하게 기술하며, 한국어의 다양한 문장들을 견고하게 분석한다. SERI Test Suites 97과 신문기사에서 746문장을 추출하여 실험한 결과 94%~99%의 적용율을 보였다.Abstract In this paper, we propose a Korean feature-based grammar(KFG) which adopts the X-bar theoretic notion of headedness for a precise representation of Korean syntactic structure. In order to explain various language phenomena in a given sentence, we use not the word but the morpheme as a constituent unit of KFG. We use features manifesting both the syntactic information and the semantic information of Korean syntactic categories, and feature operations based on the association relationship between two categories. In addition, we restrict feature operations to CNF(Chomsky Normal Form) binary form, which provides a robust representation for properties in Korean such as the frequent ellipsis and the partial free-order. The KFG is intuitive, simple, and versatile in representing most Korean sentences. The experimental result shows 94%~99% coverage on 746 sentences extracted from SERI Test Suites 97 and newspaper sentences.

Determination of Thematic Roles according to Syntactic Relations Using Rules and Statistical Models in Korean Language Processing (한국어 전산처리에서 규칙과 확률을 이용한 구문관계에 따른 의미역 결정)

  • 강신재;박정혜
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.8 no.1
    • /
    • pp.33-42
    • /
    • 2003
  • This paper presents an efficient determination method of thematic roles from syntactic relations using rules and statistical model in Korean language processing. This process is one of the main core of semantic analysis and an important issue to be solved in natural language processing. It is problematic to describe rules for determining thematic roles by only using general linguistic knowledge and experience, since the final result may be different according to the subjective views of researchers, and it is impossible to construct rules to cover all cases. However, our hybrid method is objective and efficient by considering large corpora, which contain practical usages of Korean language, and case frames in the Sejong Electronic Lexicon of Korean, which is being developed by dozens of Korean linguistic researchers. To determine thematic roles more correctly, our system uses syntactic relations, semantic classes, morpheme information, position of double subject. Especially by using semantic classes, we can increase the applicability of our system.

  • PDF

Generalized LR Parser with Conditional Action Model(CAM) using Surface Phrasal Types (표층 구문 타입을 사용한 조건부 연산 모델의 일반화 LR 파서)

  • 곽용재;박소영;황영숙;정후중;이상주;임해창
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.1_2
    • /
    • pp.81-92
    • /
    • 2003
  • Generalized LR parsing is one of the enhanced LR parsing methods so that it overcome the limit of one-way linear stack of the traditional LR parser using graph-structured stack, and it has been playing an important role of a firm starting point to generate other variations for NL parsing equipped with various mechanisms. In this paper, we propose a conditional Action Model that can solve the problems of conventional probabilistic GLR methods. Previous probabilistic GLR parsers have used relatively limited contextual information for disambiguation due to the high complexity of internal GLR stack. Our proposed model uses Surface Phrasal Types representing the structural characteristics of the parse for its additional contextual information, so that more specified structural preferences can be reflected into the parser. Experimental results show that our GLR parser with the proposed Conditional Action Model outperforms the previous methods by about 6-7% without any lexical information, and our model can utilize the rich stack information for syntactic disambiguation of probabilistic LR parser.