• Title/Summary/Keyword: lexical/semantic features

Search Result 21, Processing Time 0.025 seconds

A Study of Intrinsic and Extrinsic Semantic Features of Korean Nouns: Focusing on the Categories of Grains, Fruits and Vegetables (한국어 명사의 내재적/외재적 의미특징 연구: 곡식, 과일, 채소 범주를 중심으로)

  • 정영철;이정모
    • Korean Journal of Cognitive Science
    • /
    • v.15 no.1
    • /
    • pp.43-67
    • /
    • 2004
  • Using qualitative research methodology, this study has investigated the semantic features of 39 nouns, which are classified into the categories of grains, fruits and vegetables. A survey has been conducted with a substantial number of undergraduate students, who were asked to describe any semantic features they associated with the lexical items within the three categories. The analysis of the survey data shows that the concepts of examples of fruits are defined predominantly by intrinsic semantic features, while those of grains and vegetables are defined noticeably by extrinsic semantic features rather than intrinsic ones. Intrinsic semantic features are any properties inherent in an object itself and extrinsic semantic features are defined as any properties constructed by association with other objects or personal experiences in a certain situation. However, this study does not maintain that either intrinsic or extrinsic semantic features solely define the concepts of the examples of the three categories. Instead, it concludes that both kinds of semantic features are involved in the representation of the concepts of those vocabularies, with intrinsic features salient in the category of fruits and extrinsic features salient in the categories of gains and vegetables.

  • PDF

Korean Semantic Role Labeling Using Semantic Frames and Synonym Clusters (의미 프레임과 유의어 클러스터를 이용한 한국어 의미역 인식)

  • Lim, Soojong;Lim, Joon-Ho;Lee, Chung-Hee;Kim, Hyun-Ki
    • Journal of KIISE
    • /
    • v.43 no.7
    • /
    • pp.773-780
    • /
    • 2016
  • Semantic information and features are very important for Semantic Role Labeling(SRL) though many SRL systems based on machine learning mainly adopt lexical and syntactic features. Previous SRL research based on semantic information is very few because using semantic information is very restricted. We proposed the SRL system which adopts semantic information, such as named entity, word sense disambiguation, filtering adjunct role based on sense, synonym cluster, frame extension based on synonym dictionary and joint rule of syntactic-semantic information, and modified verb-specific numbered roles, etc. According to our experimentations, the proposed present method outperforms those of lexical-syntactic based research works by about 3.77 (Korean Propbank) to 8.05 (Exobrain Corpus) F1-scores.

Predicate Recognition Method using BiLSTM Model and Morpheme Features (BiLSTM 모델과 형태소 자질을 이용한 서술어 인식 방법)

  • Nam, Chung-Hyeon;Jang, Kyung-Sik
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.1
    • /
    • pp.24-29
    • /
    • 2022
  • Semantic role labeling task used in various natural language processing fields, such as information extraction and question answering systems, is the task of identifying the arugments for a given sentence and predicate. Predicate used as semantic role labeling input are extracted using lexical analysis results such as POS-tagging, but the problem is that predicate can't extract all linguistic patterns because predicate in korean language has various patterns, depending on the meaning of sentence. In this paper, we propose a korean predicate recognition method using neural network model with pre-trained embedding models and lexical features. The experiments compare the performance on the hyper parameters of models and with or without the use of embedding models and lexical features. As a result, we confirm that the performance of the proposed neural network model was 92.63%.

Relation Extraction based on Extended Composite Kernel using Flat Lexical Features (평면적 어휘 자질들을 활용한 확장 혼합 커널 기반 관계 추출)

  • Chai, Sung-Pil;Jeong, Chang-Hoo;Chai, Yun-Soo;Myaeng, Sung-Hyon
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.8
    • /
    • pp.642-652
    • /
    • 2009
  • In order to improve the performance of the existing relation extraction approaches, we propose a method for combining two pivotal concepts which play an important role in classifying semantic relationships between entities in text. Having built a composite kernel-based relation extraction system, which incorporates both entity features and syntactic structured information of relation instances, we define nine classes of lexical features and synthetically apply them to the system. Evaluation on the ACE RDC corpus shows that our approach boosts the effectiveness of the existing composite kernels in relation extraction. It also confirms that by integrating the three important features (entity features, syntactic structures and contextual lexical features), we can improve the performance of a relation extraction process.

The semantic structure of the Russian humor in the works of Michael Zadornov (자도르노프 작품 속에 나라난 러시아 유머의 의미군조)

  • 안병팔
    • Lingua Humanitatis
    • /
    • v.6
    • /
    • pp.321-357
    • /
    • 2004
  • In this article the structure of modern Russian humor is analyzed on the basis of some theories: bi-sociation theory (Koestler 1964), semantic script theory of verbal humor, using the concept of semantic presupposition, pragmatic felicity condition (Searle 1969; Levinson 1983) and grammatical rules (Chomsky 1965). Up to now the listed former theories were not examined and less analyzed by the semantic structure in the study of the structure of Russian humor(HcaeBa 1969; 3 $a_{OPHOB}$ 1991; 1992). Kreps (1981), who analyzed the works of Zoschenko, presented 21 types of humor, using the term 'humoreme'(Kpenc 1981, 36-37). These types are the list of the available means of humor that work not in the base of semantic criteria, but in the base of means of literary rhetoric. Kreps presented types of humor means, such as contradiction, antonymic substitution, macaronic speech and correlation of humoremes in the various types of humor. Apart from Kreps, Manakov (MaHaKOB 1986, 61-79) also studied these problems. He also set the system of the basic types of humor. Manakov introduced the linguistic means of humor of some Russian writers: Gogol, Tchechov. The means that Manakov showed with detailed examples, are trope, epithet, comic comparison, comic metaphor, comic periphrasis, euphemism, pun, zeugma, comic toponym, comic onomatopoeia, mania of foreign vocabulary, folk etymology, dialect etc. But these studies don't explain why these means make the works humorous. An, B.p tried to answer this question (안병팔 1997 a; b). An B.p. explains contexts of humor through the Release theory, the Superiority theory and the Incongruity theory. An, B.p. explained the process of deviation from the grammatical norms through morpho-syntactic and lexical means. But in these studies the humor was not analyzed by the semantic criteria. In order to linguistically evaluate various means of humor formation, it is necessary to elicit its deep structure, which makes it possible to research the formation and interpretation of humor. For this purpose this article, being based on the Incongruity theory, defined the structure of humor as negation of presupposition. Of course the former traditional studies also well shared the concept of 'contradiction' and 'contrast' of humor structure, but they didn't explain the structure by semantic differential features. This study, analyzing the works of' Zadornov, M., tried to note that through the negation of semantic presupposition the structure of contradiction is formed with semantic differential features on the semantic, syntactic or lexical dimensions.

  • PDF

A Muti-Resolution Approach to Restaurant Named Entity Recognition in Korean Web

  • Kang, Bo-Yeong;Kim, Dae-Won
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.12 no.4
    • /
    • pp.277-284
    • /
    • 2012
  • Named entity recognition (NER) technique can play a crucial role in extracting information from the web. While NER systems with relatively high performances have been developed based on careful manipulation of terms with a statistical model, term mismatches often degrade the performance of such systems because the strings of all the candidate entities are not known a priori. Despite the importance of lexical-level term mismatches for NER systems, however, most NER approaches developed to date utilize only the term string itself and simple term-level features, and do not exploit the semantic features of terms which can handle the variations of terms effectively. As a solution to this problem, here we propose to match the semantic concepts of term units in restaurant named entities (NEs), where these units are automatically generated from multiple resolutions of a semantic tree. As a test experiment, we applied our restaurant NER scheme to 49,153 nouns in Korean restaurant web pages. Our scheme achieved an average accuracy of 87.89% when applied to test data, which was considerably better than the 78.70% accuracy obtained using the baseline system.

ON THE INCANTATORY FEATURES OF KOREAN SHAMANIC LANGUAGE (한국 무속어의 주술적 특성과 그 해석)

  • Choong-yon Park
    • Lingua Humanitatis
    • /
    • v.1 no.1
    • /
    • pp.295-321
    • /
    • 2001
  • This paper attempts to demonstrate how the linguistic and mythological features of the shamanic language make it incantatory, or ′enchanting′. Passages used in shamanic rites manifest linguistic characteristics that point to their own norms and conventions, as well as some mythological features that contribute to the undecipherablity of the shamanic language. Focusing on the estranged linguistic and mythological features, I propose that shamanic languages can be best interpreted in terms of the linguistic hierarchization, a notion that has been developed since Roman Jakobson′s poetics. The present study adopts Eisele′s framework that reinterprets Jakobsonian hierarchization into a slightly revised notion on the basis of the "degree of combinatorial freedom" and the "degree of semantic immediacy", looking into a set of paradigm examples in search of some parallel structures characterizing the shamanic language. The enchanting effect of this peculiar form of language, it is argued, is due mostly to the frequent use of lexical parallelism, which works in the reverse direction of the normal process of interpretation.

  • PDF

Using Small Corpora of Critiques to Set Pedagogical Goals in First Year ESP Business English

  • Wang, Yu-Chi;Davis, Richard Hill
    • Asia Pacific Journal of Corpus Research
    • /
    • v.2 no.2
    • /
    • pp.17-29
    • /
    • 2021
  • The current study explores small corpora of critiques written by Chinese and non-Chinese university students and how strategies used by these writers compare with high-rated L1 students. Data collection includes three small corpora of student writing; 20 student critiques in 2017, 23 student critiques from 2018, and 23 critiques from the online Michigan MICUSP collection at the University of Michigan. The researchers employ Text Inspector and Lexical Complexity to identify university students' vocabulary knowledge and awareness of syntactic complexity. In addition, WMatrix4® is used to identify and support the comparison of lexical and semantic differences among the three corpora. The findings indicate that gaps between Chinese and non-Chinese writers in the same university classes exist in students' knowledge of grammatical features and interactional metadiscourse. In addition, critiques by Chinese writers are more likely to produce shorter clauses and sentences. In addition, the mean value of complex nominal and coordinate phrases is smaller for Chinese students than for non-Chinese and MICUSP writers. Finally, in terms of lexical bundles, Chinese student writers prefer clausal bundles instead of phrasal bundles, which, according to previous studies, are more often found in texts of skilled writers. The current study's findings suggest incorporating implicit and explicit instruction through the implementation of corpora in language classrooms to advance skills and strategies of all, but particularly of Chinese writers of English.

A Comparative Study on Korean Connective Morpheme '-myenseo' to the Chinese expression - based on Korean-Chinese parallel corpus (한국어 연결어미 '-면서'와 중국어 대응표현의 대조연구 -한·중 병렬 말뭉치를 기반으로)

  • YI, CHAO
    • Cross-Cultural Studies
    • /
    • v.37
    • /
    • pp.309-334
    • /
    • 2014
  • This study is based on the Korean-Chinese parallel corpus, utilizing the Korean connective morpheme '-myenseo' and contrasting with the Chinese expression. Korean learners often struggle with the use of Korean Connective Morpheme especially when there is a lexical gap between their mother language. '-myenseo' is of the most use Korean Connective Morpheme, it usually contrast to the Chinese coordinating conjunction. But according to the corpus, the contrastive Chinese expression to '-myenseo' is more than coordinating conjunction. So through this study, can help the Chinese Korean language learners learn easier while studying '-myenseo', because the variety Chinese expression are found from the parallel corpus that related to '-myenseo'. In this study, firstly discussed the semantic features and syntactic characteristics of '-myenseo'. The significant semantic features of '-myenseo' are 'simultaneous' and 'conflict'. So in this chapter the study use examples of usage to analyse the specific usage of '-myenseo'. And then this study analyse syntactic characteristics of '-myenseo' through the subject constraint, predicate constraints, temporal constraints, mood constraints, negatives constraints. then summarize them into a table. And the most important part of this study is Chapter 4. In this chapter, it contrasted the Korean connective morpheme '-myenseo' to the Chinese expression by analysing the Korean-Chinese parallel corpus. As a result of the analysis, the frequency of the Chinese expression that contrasted to '-myenseo' is summarized into

    . It can see from the table that the most common Chinese expression comparative to '-myenseo' is non-marker patterns. That means the connection of sentence in Korean can use connective morpheme what is a clarifying linguistic marker, but in Chinese it often connect the sentence by their intrinsic logical relationships. So the conclusion of this chapter is that '-myenseo' can be comparative to Chinese conjunction, expression, non-marker patterns and liberal translation patterns, which are more than Chinese conjunction that discovered before. In the last Chapter, as the conclusion part of this study, it summarized and suggest the limitations and the future research direction.

  • Component-Based VHDL Analyzer for Reuse and Embedment (재사용 및 내장 가능한 구성요소 기반 VHDL 분석기)

    • 박상헌;손영석
      • Proceedings of the IEEK Conference
      • /
      • 2003.07b
      • /
      • pp.1015-1018
      • /
      • 2003
    • As increasing the size and complexity of hard-ware and software system, more efficient design methodology has been developed. Especially design-reuse technique enables fast system development via integrating existing hardware and software. For this technique available hardware/software should be prepared as component-based parts, adaptable to various systems. This paper introduces a component-based VHDL analyzer allowing to be embedded in other applications, such as simulator, synthesis tool, or smart editor. VHDL analyzer parses VHDL description input, and performs lexical, syntactic, semantic checking, and finally generates intermediate-form data as the result. VHDL has full-features of object-oriented language such as data abstraction, inheritance, and polymorphism. To support these features special analysis algorithm and intermediate form is required. This paper summarizes practical issues on implementing high-performance/quality VHDL analyzer and provides its solution that is based on the intensive experience of VHDL analyzer development.

    • PDF