• 제목/요약/키워드: Semantic Phrase

검색결과 62건 처리시간 0.019초

명사 의미 부류를 이용한 연속된 명사열의 구묶음 (Chunking of Contiguous Nouns using Noun Semantic Classes)

  • 안광모;서영훈
    • 한국콘텐츠학회논문지
    • /
    • 제10권3호
    • /
    • pp.10-20
    • /
    • 2010
  • 본 논문에서는 조사가 없는 연속된 명사열 중 하나의 명사처럼 쓰일 수 있는 명사열을 복합명사구라 정의하고, 의미 정보를 이용한 복합명사구의 구묶음 방법을 제시한다. 복합명사구의 구묶음에는 구문분석 말뭉치에서 추출한 명사쌍과 이들의 의미부류정보를 이용한다. 이러한 명사쌍과 의미부류정보는 신뢰도를 위해 세종말뭉치의 구문분석 말뭉치와 상세사전을 기반으로 구축하였다. 이들 정보를 이용한 복합명사구 구묶음 모듈은 길이(명사의 수)가 2 이상인 복합명사구에 대해서도 구묶음을 수행할 수 있다. 복합명사구 구묶음을 위해 '왼쪽명사-오른쪽명사' 쌍 38,940개, '왼쪽명사-오른쪽명사의미부류' 쌍 65,629개, '왼쪽명사의미부류-오른쪽명사' 쌍 46,094개, '왼쪽명사의미부류-오른쪽명사의미부류' 쌍 45,243개의 정보를 구축하여 이용하였다. 실험을 위하여 신문기사의 내용으로 이루어진 세종형태소분석 말뭉치로부터 길이가 3 이상인 조사와 결합하지 않은 연속된 명사열을 포함하는 1,000 문장을 임의로 선별하였으며, 실험결과는 86.89%의 정밀도와 80.48%의 재현율, 그리고 83.56%의 f-measure를 보였다.

영어 어휘 의미 정보와 피치 액센트 (Lexical Semantic Information and Pitch Accent in English)

  • 전윤실;김기호;이용재
    • 음성과학
    • /
    • 제10권3호
    • /
    • pp.187-209
    • /
    • 2003
  • In this paper, we examine if the lexical information of the verb and its noun object affects the pitch accent patterns of the verb phrase focus. Three types of verb-object combinations with different semantic weights are discussed: when the verbs have optional direct objects, when the objects have the greater semantic weight relative to verbs, and when the verbs and the objects have equal semantic weight. Argument-structure-based works note that the pitch accent location in a focused phrase is closely related to the argument structure and contextual information. For example, it has been argued that contextually new noun objects receive accent while given noun objects don't. Contrary to nouns, verbs can be accented or not in verb phrase focus regardless of whether they are given information or new information (Selkirk 1984, 1992). However, the production experiment in this paper shows that the accenting of verbs is not fully optional, but influenced by the lexical semantic information of the verbs. The accenting of noun objects with given information is possible and the deaccenting of new noun objects also occurs depending on the lexical information of the noun objects. The results demonstrate that in addition to argument structure and information by means of context sentences, the lexical semantic information of words influences the pitch accent location in focused phrase.

  • PDF

슬루싱의 두 가지 의미 유형 (Two Semantic Types of Korean Sluicing Constructions)

  • 위혜경
    • 한국언어정보학회지:언어와정보
    • /
    • 제19권2호
    • /
    • pp.109-125
    • /
    • 2015
  • In this study, I attempt to show two points about Korean sluicing. First, the semantic source of the null subject of the copula phrase in Korean sluicing is a null pronoun. This null subject pronoun may refer to either the antecedent indefinite individual or the antecedent event of the preceding clause. Second, depending on the presence/absence of postpositions in the remant wh-phrase, sluicing constructions are classified into two different semantic types: i) an equative clause and ii) a predicational clause.

  • PDF

Phrase-Chunk Level Hierarchical Attention Networks for Arabic Sentiment Analysis

  • Abdelmawgoud M. Meabed;Sherif Mahdy Abdou;Mervat Hassan Gheith
    • International Journal of Computer Science & Network Security
    • /
    • 제23권9호
    • /
    • pp.120-128
    • /
    • 2023
  • In this work, we have presented ATSA, a hierarchical attention deep learning model for Arabic sentiment analysis. ATSA was proposed by addressing several challenges and limitations that arise when applying the classical models to perform opinion mining in Arabic. Arabic-specific challenges including the morphological complexity and language sparsity were addressed by modeling semantic composition at the Arabic morphological analysis after performing tokenization. ATSA proposed to perform phrase-chunks sentiment embedding to provide a broader set of features that cover syntactic, semantic, and sentiment information. We used phrase structure parser to generate syntactic parse trees that are used as a reference for ATSA. This allowed modeling semantic and sentiment composition following the natural order in which words and phrase-chunks are combined in a sentence. The proposed model was evaluated on three Arabic corpora that correspond to different genres (newswire, online comments, and tweets) and different writing styles (MSA and dialectal Arabic). Experiments showed that each of the proposed contributions in ATSA was able to achieve significant improvement. The combination of all contributions, which makes up for the complete ATSA model, was able to improve the classification accuracy by 3% and 2% on Tweets and Hotel reviews datasets, respectively, compared to the existing models.

Automatic Detection of Korean Accentual Phrase Boundaries

  • Lee, Ki-Yeong;Song, Min-Suck
    • The Journal of the Acoustical Society of Korea
    • /
    • 제18권1E호
    • /
    • pp.27-31
    • /
    • 1999
  • Recent linguistic researches have brought into focus the relations between prosodic structures and syntactic, semantic or phonological structures. Most of them prove that prosodic information is available for understanding syntactic, semantic and discourse structures. But this result has not been integrated yet into recent Korean speech recognition or understanding systems. This study, as a part of integrating prosodic information into the speech recognition system, proposes an automatic detection technique of Korean accentual phrase boundaries by using one-stage DP, and the normalized pitch pattern. For making the normalized pitch pattern, this study proposes a method of modified normalization for Korean spoken language. For the experiment, this study employs 192 sentential speech data of 12 men's voice spoken in standard Korean, in which 720 accentual phrases are included, and 74.4% of the accentual phrase boundaries are correctly detected while 14.7% are the false detection rate.

  • PDF

‘-은/는’의 분포에 대하여 (On the Distribution of‘-(N)un’in Korean)

  • 염재일
    • 한국언어정보학회지:언어와정보
    • /
    • 제5권2호
    • /
    • pp.57-74
    • /
    • 2001
  • In this paper, I propose syntactic, semantic and pragmatic restrictions on the distribution of the contrastive topic marker‘-(n)un’in Korean. A contrastive topic is associated with another focus. The association with focus is subject to syntactic islands. On the other hand, there is no syntactic restriction between a phrase attached with‘-(n)un’and a focused expression within the ‘-(n)un’phrase itself. In this area there is a semantic requirement that the alternatives generated by a focused expression be maintained up to the phrase attached with‘-(n)un’. Finally, when‘-(n)un’is used in an embedded clause, the whole sentence becomes natural when the contrastive topic introduced by‘-(n)un’and its alternative contrastive topic, which is presupposed by the contrastive topic marker, jointly constitute a more complex topic which is related to the whole context. And exclusiveness facilitates the formation of the whole complex context.

  • PDF

자유 발화 자료에 나타난 {그래가지고}의 접속 부사화 (A Study on the Lexicalization of {Geuraegajigo} Based on the Spontaneous Speech Corpus)

  • 하영우;신지영
    • 한국어학
    • /
    • 제64권
    • /
    • pp.195-223
    • /
    • 2014
  • The aim of this paper is to study the morphemization of {Geuraegajigo} based on a spontaneous speech corpus. For this purpose, the distributions, the semantic functions, and the intonational phrase pattterns of the connective {Geuraegajigo} have been analyzed based on the corpus. The results are as follow; at first, coalescence that comes with a morphemization process was found, resulting in many variations. Secondly, there are three functions of it: [Direct/Indirect interrelationship], [Enumerate conjunction], and [Discourse marker]. And this semantic/functional diversity has many similarities with conjunctive adverbs. Lastly, intonational phrase patterns of {Geuraegajigo} accord with those of conjunctive adverbs. Especially, the discourse strategic IP pattern is connected with the short variation type. In conclusion, {Geuraegajigo} has finished turning into a conjunctive adverb through morphemization.

한국농촌계획 온톨로지 구축을 위한 상호정보 기반 단어연결망 분석 (Word Network Analysis based on Mutual Information for Ontology of Korean Rural Planning)

  • 이제명
    • 농촌계획
    • /
    • 제23권3호
    • /
    • pp.37-51
    • /
    • 2017
  • There has been a growing concern on ontology especially in recent knowledge-based industry and defining a field-customized semantic word network is essential for building it. In this paper, a word network for ontology is established with 785 publications of Korean Society of Rural Planning(KSRP), from 1995 to 2017. Semantic relationships between words in the publications were quantitatively measured with the 'normalized pointwise mutual information' based on the information theory. Appearance and co-appearance frequencies of nouns and adjectives in phrases are analyzed based on the assumption that a 'noun phrase' represents a single 'concept'. The word network of KSRP was compared with that of $WordNet^{TM}$, a world-wide thesaurus network, for the verification. It is proved that the KSRP's word network, established in this paper, provides words' semantic relationships based on the common concepts of Korean rural planning research field. With the results, it is expecting that the established word network can present more opportunity for preparation of the fourth industrial revolution to the field of the Korean rural planning.

THE SEMANTIC STRUCTURE OF JAPANESE ADJECTIVES WITH -TAI DERIVATIONAL SUFFIX

  • Ikeya, Akira
    • 한국언어정보학회:학술대회논문집
    • /
    • 한국언어정보학회 1996년도 Language, Information and Computation = Selected Papers from the 11th Pacific Asia Conference on Language, Information and Computation, Seoul
    • /
    • pp.157-166
    • /
    • 1996
  • This paper treats the japanese adjective phrase forming derivational suffix -tai from a new point of view: firstly it tries to approach from a semantic standpoint by applying the proposal made in Ikeya (1991). It will be shown that adjective phrases formed by -tai fits nicely with the semantic structure proposed by Ikeya. Secondly, we attempt to 'derive' -tai sentences by adopting a basic framework of HPSG so that we can 'derive' them without having recourse to transformational operations, that is, in a monostratal way. In tackling the problem we have tried to incorporate many ideas proposed so far on this issue.

  • PDF

라틴아메리카 일부 지역에서 사용되는 양보 접속사구 todavía que의 특성 (The Concessive Conjunctive Phrase todavía que in Some Areas of Latin America)

  • 곽재용
    • 비교문화연구
    • /
    • 제46권
    • /
    • pp.143-171
    • /
    • 2017
  • 본 연구는 라틴아메리카의 일부 지역인 안데스와 리버플레이트 지역에서 양보접속사구로 기능하는 [todavía que]에 대한 형태-통사적 그리고 의미적 특성을 연구하고자 한다. Real Academia $Espa{\tilde{n}}ola$(2009)에 따르면 부사 todavía는 초점과 관련된 다양한 맥락에서 사용되며 특히 안데스와 리버플레이트 지역에서는 [todavía que]가 양보구문을 형성하며 첨가적 해석을 갖는다고 한다. 케추아어의 접미사 '-raq'과 아이마라어의 접미사 '-raki '는 스페인어의 부사 todavía와 동일한 의미를 갖는다. 그러나 이 두 원주민 언어에서는 상기의 접미사가 첨가 초점사의 기능을 수행하지만 스페인어에서는 첨가 초점사로 쓰이는 접미사는 없으므로 이러한 범주적 불일치로 인해 [todavía que]구조에서는 todavía가 부사의 형태로 접속사 que와 함께 첨가 초점사로 사용된다. 본 연구는 이러한 형태-통사적 그리고 의미적 특성은 두 원주민어들과 스페인어의 언어접촉 현상에서 비롯된 것으로 주장한다. 또한 이러한 주장에 기초하여 스페인어의 부사 todavía가 어원적으로 전칭양화사와 관련되는 사실과 척도와 관련된 자질을 내포하는 특성에 기인하여 [todavía que]는 척도 첨가의 의미를 갖는 양보의 접속사로 기능한다고 주장하고자 한다.