• Title/Summary/Keyword: lexical information

Search Result 324, Processing Time 0.028 seconds

Interface between Morphology and Syntax: A Constraint-Based and Lexicalist Approach

  • Kim, Jong-Bok
    • Language and Information
    • /
    • v.2 no.1
    • /
    • pp.177-213
    • /
    • 1998
  • conflicting criterial used in identifying words have called the lexical integrity principle into question. That is, cases where the morphological word does not coincide with the syntactic word have notivated the syntactic view of word derivation, as pointed out by Bresnan and Mchombo(1995). Further, the implicit desire to make the clausal structure of Korean parallel to those posited for English(Chomsky 1991) and French(Pollock 1989) has also led most of the current literature on Korean morphology to claim that Korean verbal inflections head their own functional projections such as AgrP, TP, and MP im syntax. In this paper, I will first argue against such a syntactic view. After reviewing some basic properties of Korean verbal inflections, I will show that the evidence from mismatch phenomena supports the lexical integrity principle over the head-movement theories of word derivation. Then, I will propose a theory of lexical grammar which maintains the lexical integrity principle while retaining the effects of functional projections and syntactic movement.

  • PDF

Automatic Construction Method of Unknown Word Lexical Dictionary (Unknown Word Lexical Dictionary의 자동 생성 방법)

  • Hwang, Myung-Gwon;Youn, Byung-Su;Jeong, Il-Yong;Kim, Pan-Koo
    • Annual Conference of KIPS
    • /
    • 2008.05a
    • /
    • pp.3-6
    • /
    • 2008
  • 본 연구는 의미적 정보 검색을 위한 연구 중의 하나로, 현재까지의 의미적 문서 검색에서 큰 걸림돌이었던 사전에 정의되지 않은 단어(Unknown Word)들의 어휘 사전(Lexical Dictionary)을 자동으로 생성하기 위한 것이다. 이를 위해 UW를 기존의 영어 어휘 사전인 워드넷(WordNet)에 정의되지 않은 단어로 간주하고, 웹 문서의 입력을 통하여 UW와 관련된 단어들을 추출하여 의미적 관련 정도를 확률적, 의미적 방법으로 측정한다. 본 논문에서는 UW Lexical Dictionary를 자동으로 구축하기 위한 방법에 대해서만 기술하였고, 정량적이고 객관적인 평가는 포함하지 않고 있다. 하지만 본 연구의 효용성을 확인하기 위한 몇 가지 문서로부터 추출된 결과는 본 연구가 상당히 의미적이며 가치가 높을 것으로 기대되고 있다.

An Efficient Index Term Extraction Method in IR using Lexical Chains (정보검색에서 어휘체인을 이용한 효과적인 색인어 추출 방안)

  • Kang, Bo-Yeong;Lee, Sang-Jo
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.8
    • /
    • pp.584-594
    • /
    • 2002
  • In information retrieval or digital library, one of the most important factors is to find out the exact information which users need. In this paper, we present an efficient index term extraction method which makes it possible to guess the content of documents and get the information more exactly. To find out index terms in a document, we use lexical chains. Before generating lexical chains, we roughly disambiguate the senses of nouns in a document using specific concept, called semantic window. Semantic window is that we look ahead semantic relations of peripheral nouns and disambiguate the senses of nouns. After generating lexical chains with sense-disambiguated nouns, we find out strong chains by some metrics and extract index terms from a few strong chains. We evaluated our system, using results of a key phrase extraction system, KEA. This system works in general domains of documents Including Information Retrieval and Digital Library.

Multi-level Mapping of Ontologies Based on Lexical and Structural Information (어휘와 구조 정보에 기반한 온톨로지의 다단계 매핑)

  • Hwang, Se-Chan;Kang, Sin-Jae
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.22 no.1
    • /
    • pp.42-48
    • /
    • 2012
  • Since the Semantic Web emerged, ontology has been widely used in web environment. Even ontologies belong to the same domain, they may contain same meaning different words, or different meaning same words according to their development background and the type of utilization. In order to share and reuse the ontologies, ontology mapping is required. This paper presents a ontology mapping method that consists of the initial process of multi-level mapping based on lexical information, and the second mapping process using the lexical results and structural similarity. Mapping performance was improved by additionally expanding structural information of blank nodes, which have no lexical information. Through experiments, our method achieved 86.38% in F1-measure.

The Effect of Syllable Frequency, Syllable Type and Final Consonant on Hangeul Word and Pseudo-word Lexical Decision: An Analysis of the Korean Lexicon Project Database (한글 두 글자 단어와 비단어의 어휘판단에 글자 빈도, 글자 유형, 받침이 미치는 영향: KLP 자료의 분석)

  • Myong Seok Shin;ChangHo Park
    • Korean Journal of Cognitive Science
    • /
    • v.34 no.4
    • /
    • pp.277-297
    • /
    • 2023
  • This study attempted to find out how lexical decision of two-syllable words or pseudo-words is affected by syllabic information, such as syllable frequency, syllable (i.e. vowel) type, and presence of final consonant (i.e. batchim), through the analysis of the Korean Lexicon Project Database (KLP-DB). Hierarchical regression of RT data showed that lexical decision of words was influenced by the frequency of the first syllable, the syllable type of the first and second syllables, batchim for the first and second syllables, and also by the interaction of the two syllable types and the interaction of syllable frequency and batchim of the second syllable. For pseudo-words lexical decision was influenced by the frequency of the first and second syllables, syllable type of the first syllable, and batchim for the first and second syllables, and also by the interaction of the two syllable frequencies, the interaction of the two syllable types, and the interaction of syllable frequency and batchim of the first syllable. Word frequency had a strong effect on lexical decision of words, while syllabic information had a stable effect on the lexical decision of pseudo-words. These results indicate that syllabic information should be seriously considered in constructing word and pseudo-word lists and interpreting lexical decision time. Understanding the effect of syllabic information will also contribute to the understanding of word recognition process.

The Voice Dialing System Using Dynamic Hidden Markov Models and Lexical Analysis (DHMM과 어휘해석을 이용한 Voice dialing 시스템)

  • 최성호;이강성;김순협
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.28B no.7
    • /
    • pp.548-556
    • /
    • 1991
  • In this paper, Korean spoken continuous digits are ercognized using DHMM(Dynamic Hidden Markov Model) and lexical analysis to provide the base of developing voice dialing system. After segmentation by phoneme unit, it is recognized. This system can be divided into the segmentation section, the design of standard speech section, the recognition section, and the lexical analysis section. In the segmentation section, it is segmented using the ZCR, O order LPC cepstrum, and Ai, parameter of voice speech dectaction, which is changed according to time. In the standard speech design section, 19 phonemes or syllables are trained by DHMM and designed as a standard speech. In the recognition section, phomeme stream are recognized by the Viterbi algorithm.In the lexical decoder section, finally recognized continuous digits are outputed. This experiment shiwed the recognition rate of 85.1% using data spoken 7 times of 21 classes of 7 continuous digits which are combinated all of the occurence, spoken by 10 man.

  • PDF

Extraction of Thematic Roles from Dictionary Definitions

  • Mc-Hale, Michael-L.;Myaeng, Sung-H.
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 1996.02a
    • /
    • pp.137-146
    • /
    • 1996
  • Our research goal has been the development of a domain independent natural language processing (NLP) system suitable for information retrieval. As part of that research, we have investigated ways to automatically extend the semantics of a lexicon derived from machine-readable lexical sources. This paper details the extraction of thematic roles derived from lexical patterns in a machine-readable dictionary.

  • PDF

Deep Lexical Semantics: The Ontological Ascent

  • Hobbs, Jerry R.
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 2007.11a
    • /
    • pp.29-41
    • /
    • 2007
  • Concepts of greater and greater complexity can be constructed by building systems of entities, by relating other entities to that system with a figure-ground relation, by embedding concepts of figure-ground in the concept of change, by embedding that in causality, and by coarsening the granularity and beginning the process over again. This process can be called the Ontological Ascent. It pervades natural language discourse, and suggests that to do lexical semantics properly, we must carefully axiomatize abstract theories of systems of entities, the figure-ground relation, change, causality, and granularity. In this paper, I outline what these theories should look like.

  • PDF

Mapping the Terms of Medicinal Material and Formula Classification to International Standard Terminology

  • Kim, Jin-Hyun;Kim, Chul;Yea, Sang-Jun;Jang, Hyun-Chul;Kim, Sang-Kyun;Kim, Young-Eun;Kim, Chang-Seok;Song, Mi-Young
    • International Journal of Contents
    • /
    • v.7 no.4
    • /
    • pp.108-115
    • /
    • 2011
  • The current study aims to analyze the acceptance of International Standard Terminology (IST) related to herbs and formulas used in Korea. It also intends to examine limitations of each term source by linking texts for herbal medicine research and formula research used in schools of oriental medicine with medicinal substance-formula classification names within the IST framework. This study examined 64 medicinal classification names of IST, including synonyms, 41 formula classification names, 65 classification names of "Herbal Medicine Study," 89 medicinal classification names of "Shin's Clinical Herbal Medicine Study," and lastly 83 formula classification names of "Formula Study." Data on their chief virtue, efficacy and characteristics as medicinal substances were extracted from their definitions, and such data were used to perform Chinese character-English mapping using the IST. The outcomes of the mapping were then analyzed in terms of both lexical matching and semantic matching. In terms of classification names for medicinal substances, "Herbal Medicine Study" had 60.0% lexical matching, whereas "Shin's Clinical Herbal Medicine Study" had 48.3% lexical matching. When semantic matching was also applied, "Herbal Medicine Study" showed a value of 87.7% and "Shin's Clinical Herbal Medicine Study" 74.2%. In terms of formula classification names, lexical matching was 28.9% of 83 subjects, and when semantic matching was also considered, the value was 30.1%. When the conceptual elements of this study were applied, some IST terms that are classified with other codes were found to be conceptually consistent, and some terms were not accepted due to different depths in the classification systems of each source.

The Syllable Frequency Effect in Semantic Categorization Tasks in Korean

  • Kim, Ji-Hye;Kwon, You-An;Nam, Ki-Chun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.5 no.10
    • /
    • pp.1879-1890
    • /
    • 2011
  • Previous studies of syllable frequency effects have proposed that inhibitory effects due to high first syllable frequency were the products of competitions between activated lexical candidates within a lexical level. However, these studies have primarily used lexical decision tasks to examine the nature of syllable frequency effects. This study investigates whether a syllable frequency effect can arise in semantic categorization tasks and whether phonologically or orthographically defined syllables interact with semantically related variables such as morphological family size. If the syllable frequency effect was created by activations and competitions on a lexical level, it is highly possible that the effect was related to semantic categorization tasks. To test this hypothesis, we conducted two experiments. In Experiment 1, morphological family size and phonological syllable frequency were factorially manipulated. In Experiment 2, morphological family size and orthographic syllable frequency were factorially manipulated. The results demonstrate that morphemes have no relationship with phonological syllables but do with orthographic syllables. This suggests that phonological syllables and orthographic syllables have different roles in the syllable frequency effect on visual word recognition process.