• Title/Summary/Keyword: Semantic Phrase

Search Result 62, Processing Time 0.022 seconds

Lexical Semantic Information and Pitch Accent in English (영어 어휘 의미 정보와 피치 액센트)

  • Jeon, Yoon-Shil;Kim, Kee-Ho;Lee, Yong-Jae
    • Speech Sciences
    • /
    • v.10 no.3
    • /
    • pp.187-209
    • /
    • 2003
  • In this paper, we examine if the lexical information of the verb and its noun object affects the pitch accent patterns of the verb phrase focus. Three types of verb-object combinations with different semantic weights are discussed: when the verbs have optional direct objects, when the objects have the greater semantic weight relative to verbs, and when the verbs and the objects have equal semantic weight. Argument-structure-based works note that the pitch accent location in a focused phrase is closely related to the argument structure and contextual information. For example, it has been argued that contextually new noun objects receive accent while given noun objects don't. Contrary to nouns, verbs can be accented or not in verb phrase focus regardless of whether they are given information or new information (Selkirk 1984, 1992). However, the production experiment in this paper shows that the accenting of verbs is not fully optional, but influenced by the lexical semantic information of the verbs. The accenting of noun objects with given information is possible and the deaccenting of new noun objects also occurs depending on the lexical information of the noun objects. The results demonstrate that in addition to argument structure and information by means of context sentences, the lexical semantic information of words influences the pitch accent location in focused phrase.

  • PDF

Chunking of Contiguous Nouns using Noun Semantic Classes (명사 의미 부류를 이용한 연속된 명사열의 구묶음)

  • Ahn, Kwang-Mo;Seo, Young-Hoon
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.3
    • /
    • pp.10-20
    • /
    • 2010
  • This paper presents chunking strategy of a contiguous nouns sequence using semantic class. We call contiguous nouns which can be treated like a noun the compound noun phrase. We use noun pairs extracted from a syntactic tagged corpus and their semantic class pairs for chunking of the compound noun phrase. For reliability, these noun pairs and semantic classes are built from a syntactic tagged corpus and detailed dictionary in the Sejong corpus. The compound noun phrase of arbitrary length can also be chunked by these information. The 38,940 pairs of 'left noun - right noun', 65,629 pairs of 'left noun - semantic class of right noun', 46,094 pairs of 'semantic class of left noun - right noun', and 45,243 pairs of 'semantic class of left noun - semantic class of right noun' are used for compound noun phrase chunking. The test data are untrained 1,000 sentences with contiguous nouns of length more than 2randomly selected from Sejong morphological tagged corpus. Our experimental result is 86.89% precision, 80.48% recall, and 83.56% f-measure.

Two Semantic Types of Korean Sluicing Constructions (슬루싱의 두 가지 의미 유형)

  • Wee, Hae-Kyung
    • Language and Information
    • /
    • v.19 no.2
    • /
    • pp.109-125
    • /
    • 2015
  • In this study, I attempt to show two points about Korean sluicing. First, the semantic source of the null subject of the copula phrase in Korean sluicing is a null pronoun. This null subject pronoun may refer to either the antecedent indefinite individual or the antecedent event of the preceding clause. Second, depending on the presence/absence of postpositions in the remant wh-phrase, sluicing constructions are classified into two different semantic types: i) an equative clause and ii) a predicational clause.

  • PDF

Phrase-Chunk Level Hierarchical Attention Networks for Arabic Sentiment Analysis

  • Abdelmawgoud M. Meabed;Sherif Mahdy Abdou;Mervat Hassan Gheith
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.9
    • /
    • pp.120-128
    • /
    • 2023
  • In this work, we have presented ATSA, a hierarchical attention deep learning model for Arabic sentiment analysis. ATSA was proposed by addressing several challenges and limitations that arise when applying the classical models to perform opinion mining in Arabic. Arabic-specific challenges including the morphological complexity and language sparsity were addressed by modeling semantic composition at the Arabic morphological analysis after performing tokenization. ATSA proposed to perform phrase-chunks sentiment embedding to provide a broader set of features that cover syntactic, semantic, and sentiment information. We used phrase structure parser to generate syntactic parse trees that are used as a reference for ATSA. This allowed modeling semantic and sentiment composition following the natural order in which words and phrase-chunks are combined in a sentence. The proposed model was evaluated on three Arabic corpora that correspond to different genres (newswire, online comments, and tweets) and different writing styles (MSA and dialectal Arabic). Experiments showed that each of the proposed contributions in ATSA was able to achieve significant improvement. The combination of all contributions, which makes up for the complete ATSA model, was able to improve the classification accuracy by 3% and 2% on Tweets and Hotel reviews datasets, respectively, compared to the existing models.

Automatic Detection of Korean Accentual Phrase Boundaries

  • Lee, Ki-Yeong;Song, Min-Suck
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.1E
    • /
    • pp.27-31
    • /
    • 1999
  • Recent linguistic researches have brought into focus the relations between prosodic structures and syntactic, semantic or phonological structures. Most of them prove that prosodic information is available for understanding syntactic, semantic and discourse structures. But this result has not been integrated yet into recent Korean speech recognition or understanding systems. This study, as a part of integrating prosodic information into the speech recognition system, proposes an automatic detection technique of Korean accentual phrase boundaries by using one-stage DP, and the normalized pitch pattern. For making the normalized pitch pattern, this study proposes a method of modified normalization for Korean spoken language. For the experiment, this study employs 192 sentential speech data of 12 men's voice spoken in standard Korean, in which 720 accentual phrases are included, and 74.4% of the accentual phrase boundaries are correctly detected while 14.7% are the false detection rate.

  • PDF

On the Distribution of‘-(N)un’in Korean (‘-은/는’의 분포에 대하여)

  • 염재일
    • Language and Information
    • /
    • v.5 no.2
    • /
    • pp.57-74
    • /
    • 2001
  • In this paper, I propose syntactic, semantic and pragmatic restrictions on the distribution of the contrastive topic marker‘-(n)un’in Korean. A contrastive topic is associated with another focus. The association with focus is subject to syntactic islands. On the other hand, there is no syntactic restriction between a phrase attached with‘-(n)un’and a focused expression within the ‘-(n)un’phrase itself. In this area there is a semantic requirement that the alternatives generated by a focused expression be maintained up to the phrase attached with‘-(n)un’. Finally, when‘-(n)un’is used in an embedded clause, the whole sentence becomes natural when the contrastive topic introduced by‘-(n)un’and its alternative contrastive topic, which is presupposed by the contrastive topic marker, jointly constitute a more complex topic which is related to the whole context. And exclusiveness facilitates the formation of the whole complex context.

  • PDF

A Study on the Lexicalization of {Geuraegajigo} Based on the Spontaneous Speech Corpus (자유 발화 자료에 나타난 {그래가지고}의 접속 부사화)

  • Ha, Youngwoo;Shin, Jiyoung
    • Korean Linguistics
    • /
    • v.64
    • /
    • pp.195-223
    • /
    • 2014
  • The aim of this paper is to study the morphemization of {Geuraegajigo} based on a spontaneous speech corpus. For this purpose, the distributions, the semantic functions, and the intonational phrase pattterns of the connective {Geuraegajigo} have been analyzed based on the corpus. The results are as follow; at first, coalescence that comes with a morphemization process was found, resulting in many variations. Secondly, there are three functions of it: [Direct/Indirect interrelationship], [Enumerate conjunction], and [Discourse marker]. And this semantic/functional diversity has many similarities with conjunctive adverbs. Lastly, intonational phrase patterns of {Geuraegajigo} accord with those of conjunctive adverbs. Especially, the discourse strategic IP pattern is connected with the short variation type. In conclusion, {Geuraegajigo} has finished turning into a conjunctive adverb through morphemization.

Word Network Analysis based on Mutual Information for Ontology of Korean Rural Planning (한국농촌계획 온톨로지 구축을 위한 상호정보 기반 단어연결망 분석)

  • Lee, Jemyung
    • Journal of Korean Society of Rural Planning
    • /
    • v.23 no.3
    • /
    • pp.37-51
    • /
    • 2017
  • There has been a growing concern on ontology especially in recent knowledge-based industry and defining a field-customized semantic word network is essential for building it. In this paper, a word network for ontology is established with 785 publications of Korean Society of Rural Planning(KSRP), from 1995 to 2017. Semantic relationships between words in the publications were quantitatively measured with the 'normalized pointwise mutual information' based on the information theory. Appearance and co-appearance frequencies of nouns and adjectives in phrases are analyzed based on the assumption that a 'noun phrase' represents a single 'concept'. The word network of KSRP was compared with that of $WordNet^{TM}$, a world-wide thesaurus network, for the verification. It is proved that the KSRP's word network, established in this paper, provides words' semantic relationships based on the common concepts of Korean rural planning research field. With the results, it is expecting that the established word network can present more opportunity for preparation of the fourth industrial revolution to the field of the Korean rural planning.

THE SEMANTIC STRUCTURE OF JAPANESE ADJECTIVES WITH -TAI DERIVATIONAL SUFFIX

  • Ikeya, Akira
    • Proceedings of the Korean Society for Language and Information Conference
    • /
    • 1996.02a
    • /
    • pp.157-166
    • /
    • 1996
  • This paper treats the japanese adjective phrase forming derivational suffix -tai from a new point of view: firstly it tries to approach from a semantic standpoint by applying the proposal made in Ikeya (1991). It will be shown that adjective phrases formed by -tai fits nicely with the semantic structure proposed by Ikeya. Secondly, we attempt to 'derive' -tai sentences by adopting a basic framework of HPSG so that we can 'derive' them without having recourse to transformational operations, that is, in a monostratal way. In tackling the problem we have tried to incorporate many ideas proposed so far on this issue.

  • PDF

The Concessive Conjunctive Phrase todavía que in Some Areas of Latin America (라틴아메리카 일부 지역에서 사용되는 양보 접속사구 todavía que의 특성)

  • Kwak, Jae-Yong
    • Cross-Cultural Studies
    • /
    • v.46
    • /
    • pp.143-171
    • /
    • 2017
  • In this paper, we present a morphosyntactic and semantic study of the concessive conjunctive phrase [todavía que] in the Andean and River Plate areas of Latin America. Real Academia $Espa{\tilde{n}}ola$(2009) describes that the adverb todavía occurs in a variety of focus-sensitive environments and specially, in the Andean and River Plate areas, the phrase [todavía que] participates in the concessive constructions and triggers an additive interpretation. It is well known that the Quechuan suffix '-raq' and the Aymaran suffix '-raki' have the meaning of todavía which is characterized as a additive particle. Due to the absence of the paratactic relationship between the mentioned Spanish adverb and those indigenous suffixes, the adverb todavía in [todavía que] is used as a additive focalizer. We assume that these morphosyntactic and semantic properties are derived from the language contact of Quechua-Spanish and Aymara Spanish. Based on this assumption, we will further assume that the etymological peculiarity of todavía as a universal quantifier is closely related to the reading of [todavía que] as a Scalar Additive Concessive Conjunctive Phrase.