• Title/Summary/Keyword: syntactic

Search Result 717, Processing Time 0.023 seconds

A Program-Plagiarism Checker using Abstract Syntax Tree (구문트리 비고를 통한 프로그램 유형 복제 검사)

  • 김영철;김성근;염세훈;최종명;유재우
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.7_8
    • /
    • pp.792-802
    • /
    • 2003
  • Earlier program plagiarism check systems are performed by using simple text, attribute or token string base on match techniques. They have difficulties in checking program styles which have nothing to do with program syntax such as indentation, spacing and comments. This paper introduces a plagiarism check model which compares syntax-trees for the given programs. By using syntax-trees, this system can overcome the weekness of filtering program styles and have advantage of comparing the structure of programs by syntax and semantic analysis. Our study introduces syntactic tree creation, unparsing and similarity check algorithms about C/C++ program plagiarism checking for internet cyber education and estimate plagiarism pattern.

Semantic Aspects of Negation as Schema (부정 스키마의 의미론적 양상)

  • Tae, Kang-Soo
    • The KIPS Transactions:PartB
    • /
    • v.9B no.1
    • /
    • pp.23-28
    • /
    • 2002
  • A fundamental problem in building an intelligent agent is that an agent does not understand the meaning of its perception or its action. One reason that an agent cannot understand the world is partially caused by a syntactic approach that converts a semantic feature into a simple string. To solve this problem, Cohen introduces a semantic approach that an agent autonomously learns a meaningful representation of physical schemas, on which some advanced conceptual structures are built, from physically interacting with environment using its own sensors and effectors. However, Cohen does not deal with a meta level of conceptual primitive that makes recognizing a schema possible. We propose that negation is a meta schema that enables an agent to recognize a physical schema. We prove some semantic aspects of negation.

A Corpus-based Hybrid Translation System for Limited Domain (제한된 도메인을 위한 코퍼스 기반의 하이브리드 번역 시스템)

  • Kang, Un-Gu;Kim, Sung-Hyun;Lee, Byung-Mun;Lee, Young-Ho
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.11
    • /
    • pp.826-836
    • /
    • 2010
  • This paper proposes a hybrid machine translation system which integrates SMT, RBMT, and PBMT in serial manner. SMT in our project has been implemented as a Quasi-syntax-based system where monotone search is done, given a preprocessed string of foreign language. Preprocessing includes rule-based reordering, NE recognition, clausal splitting, and attaching pattern translation information at the end of the input text. For lengthy & complex sentences, clausal splitting turned out to generate better translation than normal input.

Restoring Functional Word and Noun-Verb Syntactic Relations for Korean Compound Noun Analysis (단위 명사간 보-술 관계를 이용한 한국어 복합 명사의 문장 복원)

  • Yang, Seong-Il;Kim, Young-Kil;Seo, Young-Ae;Park, Eun-Jin;Ra, Dong-Yul
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2007.05a
    • /
    • pp.694-695
    • /
    • 2007
  • 한국어 문장의 구성은 명사, 동사와 같은 내용어와 조사, 어미와 같은 기능어로 크게 나눌 수있다. 문장의 핵심적인 의미 전달은 내용어에 의해 이루어지며, 한국어 명사구의 경우 잦은 기능어의 생략으로 명사 나열에 의한 복합 명사가 발생된다. 이렇게 발생되는 복합 명사를 구성하는 단위 명사들은 일부 문장 성분을 생략시켜 발생된 것으로, 생략 성분의 복원에 의해 본래의 문장 형태를 추정할 수 있다. 한국어 복합 명사의 경우, 생략되는 문장 성분은 대부분 접사, 조사와 같은 기능어로 국한되며, 기능어의 복원은 단위 명사 간의 격 관계와 의미 관계를 분석하여 이루어질 수 있다. 본 논문에서는 단위 명사간의 보-술 관계를 이용하여 복합 명사를 구성하는 단위 명사 간의 의존 관계를 추정하고, 추정된 의존 관계에 의해 생략된 격조사와 용언화 접사를 복원하는 방법을 제안한다. 구조 분석에서 사용되는 의미 격틀에 의해 결정되는 격 관계는 격조사와 용언화 접사의 복원을 결정하며, 올바른 본래의 문장 표현 복원을 위해 관형격 조사와 관형격 어미를 비롯한 특별한 형태의 복원은 통계 정보와 휴리스틱 규칙으로 결정한다.

  • PDF

Automatic Construction of Generalized Lexical Information for Syntactic Ambiguity Resolution (구문 분석에서의 중의성 해소를 위한 일반화된 어휘정보의 자동 구축 및 적용)

  • Chung, Hoo-Jung;Hwang, Young-Sook;Kwak, Yong-Jae;Park, So-Young;Rim, Hae-Chang
    • Annual Conference on Human and Language Technology
    • /
    • 1998.10c
    • /
    • pp.269-275
    • /
    • 1998
  • 구문 분석에서의 중의성을 해결하는 데 어휘정보가 유용하다는 것은 잘 알려져 있다. 그러나 기존의 어휘정보 구축 방법들은 많은 수작업을 요구하거나, 자동으로 구축하는 경우에는 어휘 자체를 그대로 사용함에 따라 심각한 자료 회귀성 문제가 발생했다. 본 논문에서는 구문 분석에서의 중의성 해소를 위해 원시 코퍼스와 시소러스를 이용하여 개념 수준(conceptual-level)의 일반화된 술어-인자 어휘정보를 자동으로 구축하고, 이를 파서에 적용하는 방법을 제안하고자 한다. 제안한 방법으로 구축한 일반화된 어휘정보를 파서를 이용하여 명사구의 지배소 결정 실험에 적용하여 본 결과, 정확도가 85.9%에서 91.5%로 향상되었다. 또, 미지격 결정 실험에 대해서는 86.32 %의 격 결정 성공률을 보여주었다.

  • PDF

Boolean Formulation of Korean Natural Language Queries Using Syntactic Analysis (구문 분석에 기반한 자연어 질의로부터의 불리언 질의 생성)

  • Park, Mi-Hwa;Won, Hyung-Suk;Lee, Won-Il;Lee, Geun-Bae
    • Annual Conference on Human and Language Technology
    • /
    • 1998.10c
    • /
    • pp.73-80
    • /
    • 1998
  • 본 연구는 자연어 질의의 형태 및 구문 정보를 바탕으로 불리언 질의를 생성하는데 그 목적을 둔다. 일반적으로 대부분의 상용정보검색시스템은 입력형식을 검색성능이 종은 불리언 형태로 하고 있으나, 일반 사용자는 자신이 원하는 정보를 불리언 형태로 표현하는데 익숙하지 않다. 그러므로 본 정보검색시스템은 자연어 질의를 기본 입력형태로 하여 사용자의 편의성을 높이고, 이 질의를 범주문법에 기반한 구문분석 결과에 의해 복합명사를 고려한 불리언 형태로 변환하여 검색을 수행함으로써 시스템의 검색 성능의 향상을 도모하였다. 정보검색 실험용 데이터 모음인 KTSET2.0으로 실험한 결과 본 논문에서 제안한 자연어 질의로부터 자동 생성된 불리언 질의의 검객성능이 KTSET2.0에서 제공하는 수동으로 추출한 불리언 질의보다 8% 더 우수한 성능을 보였고, 기존 자연어질의 시스템이 수용해온 방법인 형태소 분석을 거쳐 불용어를 제거한 후 Vector 모델을 적용하여 검색을 수행한 경우보다는 23% 더 나은 성능을 보였다.

  • PDF

Comparison Shopping System Based on RSS with Ontology Matching (온톨로지 매칭을 이용한 RSS 기반의 비교쇼핑 시스템)

  • Park, Sang-Un
    • The Journal of Information Systems
    • /
    • v.20 no.3
    • /
    • pp.41-61
    • /
    • 2011
  • In order to buy products through the Internet, consumers dissipate much time and efforts in collecting and comparing product information from various online shopping malls. Consumers can save their efforts by using price comparison sites, but there are some shortcomings in comparison shopping. Firstly, comparison sites do not show the lowest price of some products that are selling in shopping malls. Secondly, the product information provided by comparison sites is sometimes wrong. Thirdly, there are too many results. In order to overcome the shortcomings, we suggested a comparison shopping system based on RSS by using ontology matching. We used the current RSS standard for syntactic interoperability instead of suggesting new standards. Moreover, we used ontology matching for semantic interoperability to compare product information with different ontologies. The suggested ontology matching consists of three steps. The first step is finding exact sense from WordNet for a given product category, and the second step is searching for matching product category candidates from the products of RSS feeds. The final step is calculating similarities of the candidates with the target product category. From the experiments, we could get better recall rates that are suitable for e-commerce environments and the results show that our system is effective in product comparison.

Phrasing Patterns before and after that in English: The Cases of Complement and Relative Clauses (영어 that 전/후의 구설정 패턴: 보문절과 관계대명사절을 중심으로)

  • Han, Hye-Seung;Lee, Joo-Kyeong
    • Speech Sciences
    • /
    • v.14 no.4
    • /
    • pp.53-64
    • /
    • 2007
  • This paper attempts to verify the theoretical claims in Syntax ($Bo{\check{s}}kovi{\acute{c}}$ & Lasnik, 2003; Kim, 1999, 2004) about the phrasing in English that-clauses, presenting an acoustic experiment conducted to observe the patterns of edge tones at the boundaries of that. In the experiment, two different that clauses, complement and relative clauses, were varied in forms (that-retention, that-deletion, adverb insertion before that) and length. Results showed that edge tones, if occurred, mostly showed up before the complement clauses in that-deletion sentences (67%), and that their positions polarized in adverb insertion sentences (56% before toot and 44% after toot). In the relative clauses, phrasing mostly occurred before toot as opposed to after toot in that-retention (73%) and adverb insertion sentences (87%). Additionally, phrasing tends to occur more frequently as the sentences get longer. The results suggest that the previous claims based on syntax are not consistent with the results of the current phonetic experiment. This may be interpreted as stating that syntactic boundaries do not always indicate phonetic phrasing, and that there may be some other factors to determine phrasing patterns, for example, rhythmic phrasing operating at the surface level of speech.

  • PDF

Visualization Techniques for Massive Source Code (대용량 소스코드 시각화기법 연구)

  • Seo, Dong-Su
    • The Journal of Korean Association of Computer Education
    • /
    • v.18 no.4
    • /
    • pp.63-70
    • /
    • 2015
  • Program source code is a set of complex syntactic information which are expressed in text forms, and contains complex logical structures. Structural and logical complexity inside source code become barriers in applying visualization techniques shown in traditional big-data approaches when the volume of source code become over ten-thousand lines of code. This paper suggests a procedure for making visualization of structural characteristics in source code. For this purpose, this paper defines internal data structures as well as inter-procedural relationships among functions. The paper also suggests a means of outlining the structural characteristics of source code by visualizing the source codes with network forms The result of the research work can be used as a means of controling and understanding the massive volume of source code.

Stability of Early Language Development of Verbally-Precocious Korean Children from 2 to 3 Year-old (조기언어발달 아동의 초기 언어능력의 안정성)

  • Lee, Kwee-Ock
    • The Korean Journal of Community Living Science
    • /
    • v.19 no.4
    • /
    • pp.673-684
    • /
    • 2008
  • The purpose of this study is to compare the complexity of language level between verbally-precocious and typically-developing children from 2 to 3 years-old. Participants were 15 children classified as verbally-precocious were scored at the mean 56.85(expressive language) and 88.82(receptive language), and another 15 children classified as typically developing did at the mean 33.51(expressive language) and 58.01(receptive language) on MCDI-K. Each child's spontaneous utterances in interaction with her caregiver were collected at three different times with 6 months interval. All of the utterances were transcribed and analyzed for the use of MLU and lexical diversity by using KCLA. Summarizing the overall results, verbally-precocious children had significantly higher language abilities than typically-developing children at each time, and there were significant differences between two groups in syntactic and semantic language development, showing that verbally-precocious children indicated distinctive MLU and lexical diversity. These results suggest a high degree of stability in precocious verbal status, with variations in language complexity during conversations contributing to later differences in their language ability.

  • PDF