• Title/Summary/Keyword: Word order

Search Result 1,011, Processing Time 0.025 seconds

The Strength of the Relationship between Semantic Similarity and the Subcategorization Frames of the English Verbs: a Stochastic Test based on the ICE-GB and WordNet (영어 동사의 의미적 유사도와 논항 선택 사이의 연관성 : ICE-GB와 WordNet을 이용한 통계적 검증)

  • Song, Sang-Houn;Choe, Jae-Woong
    • Language and Information
    • /
    • v.14 no.1
    • /
    • pp.113-144
    • /
    • 2010
  • The primary goal of this paper is to find a feasible way to answer the question: Does the similarity in meaning between verbs relate to the similarity in their subcategorization? In order to answer this question in a rather concrete way on the basis of a large set of English verbs, this study made use of various language resources, tools, and statistical methodologies. We first compiled a list of 678 verbs that were selected from the most and second most frequent word lists from the Colins Cobuild English Dictionary, which also appeared in WordNet 3.0. We calculated similarity measures between all the pairs of the words based on the 'jcn' algorithm (Jiang and Conrath, 1997) implemented in the WordNet::Similarity module (Pedersen, Patwardhan, and Michelizzi, 2004). The clustering process followed, first building similarity matrices out of the similarity measure values, next drawing dendrograms on the basis of the matricies, then finally getting 177 meaningful clusters (covering 437 verbs) that passed a certain level set by z-score. The subcategorization frames and their frequency values were taken from the ICE-GB. In order to calculate the Selectional Preference Strength (SPS) of the relationship between a verb and its subcategorizations, we relied on the Kullback-Leibler Divergence model (Resnik, 1996). The SPS values of the verbs in the same cluster were compared with each other, which served to give the statistical values that indicate how much the SPS values overlap between the subcategorization frames of the verbs. Our final analysis shows that the degree of overlap, or the relationship between semantic similarity and the subcategorization frames of the verbs in English, is equally spread out from the 'very strongly related' to the 'very weakly related'. Some semantically similar verbs share a lot in terms of their subcategorization frames, and some others indicate an average degree of strength in the relationship, while the others, though still semantically similar, tend to share little in their subcategorization frames.

  • PDF

Focus and Particle Constructions

  • Lee, Wonbin
    • Korean Journal of English Language and Linguistics
    • /
    • v.4 no.2
    • /
    • pp.195-227
    • /
    • 2004
  • This paper concerns the issue related to the focus phenomena with a particular reference to the two alternating orders (continuous vs. discontinuous orders) in particle constructions in English. To explain the alternation of word order in particle constructions, I will argue that the choice of word order is closely related to the focus property of the object DPs. Following Drubig (2003), I will assume that focus-feature is taken to be a syntactic feature assigned freely to a lexical head in the process of the mapping into Lexical Array (LA) from the lexicon (LEX). I argue that the focus-marked object DP cannot move out of its focus domain and thus the continuous order is derived. In the case of non-focus-marked object DP, however, the object DP moves out of VP in order to receive an appropriate interpretation. As a result, the discontinuous order is derived.

  • PDF

A Study on Revisiting Intentions of Medical Institution Customers and Service Quality Factors Influencing Word-of-Mouth Effect (의료기관 이용고객의 재방문의도와 구전효과에 영향을 미치는 서비스 품질요인에 관한 연구)

  • Kim, Kyoung-Hwan;Chang, Young-Il;Jung, You-Soo
    • Journal of Korean Society for Quality Management
    • /
    • v.39 no.1
    • /
    • pp.15-23
    • /
    • 2011
  • The biggest change in the current medical service market is the shift from producer-based approach to customer-based approach. Thus, there is a high necessity for an introduction of market-oriented and customer-oriented marketing activities in medical institutions. Especially, revisitation and positive word-of-mouth of customers are the most effective marketing methods of consistently obtaining customers and drawing new loyal customers. Hence, the factors influencing the reuse of medical services and word-of-mouth are demonstrated using SERVQUAL, and the results show that the service factors influencing reuse intentions on medical services are assurance, responsiveness, and tangibles and the factors strengthening word-of-mouth are empathy and assurance on the services provided by medical institutions. This study has determined that revistation and word-of-mouth of customers are not only influenced by satisfaction on medical services but by the quality itself as well. Therefore, medical service providers need to pursue standardized strategies in order to enhance the quality of medical services simultaneously in addition to simply increasing the customer satisfaction level.

The Effects of Misalignment between Syllable and Word Onsets on Word Recognition in English (음절의 시작과 단어 시작의 불일치가 영어 단어 인지에 미치는 영향)

  • Kim, Sun-Mi;Nam, Ki-Chun
    • Phonetics and Speech Sciences
    • /
    • v.1 no.4
    • /
    • pp.61-71
    • /
    • 2009
  • This study aims to investigate whether the misalignment between syllable and word onsets due to the process of resyllabification affects Korean-English late bilinguals perceiving English continuous speech. Two word-spotting experiments were conducted. In Experiment 1, misalignment conditions (resyllabified conditions) were created by adding CVC contexts at the beginning of vowel-initial words and alignment conditions (non-resyllabified conditions) were made by putting the same CVC contexts at the beginning of consonant-initial words. The results of Experiment 1 showed that detections of targets in alignment conditions were faster and more correct than in misalignment conditions. Experiment 2 was conducted in order to avoid any possibilities that the results of Experiment 1 were due to consonant-initial words being easier to recognize than vowel-initial words. For this reason, all the experimental stimuli of Experiment 2 were vowel-initial words preceded by CVC contexts or CV contexts. Experiment 2 also showed misalignment cost when recognizing words in resyllabified conditions. These results indicate that Korean listeners are influenced by misalignment between syllable and word onsets triggered by a resyllabification process when recognizing words in English connected speech.

  • PDF

Acoustic characteristics and perceptual cues for Korean Stops (한국어 파열음의 음향적 특성과 지각 단서)

  • Lee, Kyung-Hee;Jung, Myung-Sook
    • Speech Sciences
    • /
    • v.7 no.2
    • /
    • pp.139-155
    • /
    • 2000
  • The aim of this research is to investigate acoustic characteristics of three different types of Korean Stops-plain, tensed, and aspirated-, and employ these as a base to determine which one(s) can be used as perceptual cues. In this paper, we have examined acoustic characteristics of Korean Stops, especially voice onset time(VOT), closure duration(CD), degree of pitch of following vowels and differences in the intensity of the Stops build-up after the onset of voicing. From the above characteristics, differences can be made between word-initial and word-medial positions. That is to say, in word-initial position, the three Korean Stops are distinguished by VOT and pitch, whereas in word-medial by CD, VOT and pitch. However, the acoustic characteristics do not have the same value as perceptual cues. In both word-initial, and medial positions, the immediately following vowels play the most important role in perceiving Korean Stops. And in case of word'-medial positions,. CD and VOT also play important perceptual roles. In order to have a more fine-grained distinction among Korean Stops, we think future research should be done to investigate which factor(s) of the following vowels is/are the most determinative perceptual cue(s). However, based on our investigation, we may conclude that it is highly plausible that pitch can be one of the most important perceptual cues when distinguishing the three Korean Stops.

  • PDF

Multilingual Word Translation Service based on Word Semantic Analysis (어휘의미분석 기반 다국어 어휘대역 서비스)

  • Ryu, Pum-Mo
    • Journal of Digital Contents Society
    • /
    • v.19 no.1
    • /
    • pp.75-83
    • /
    • 2018
  • Multicultural family members have difficulty in educating their children due to language differences. In order to solve these difficulties, it is necessary to provide smart translation services that enable them easily and quickly access real-life vocabularies. However, the current automatic translation technology is being developed in dominant languages such as English, Chinese, and Japanese. There are also limitations to translating special-purpose terms such as documents of schools and instructions of public institutions. In this study, we propose a real-time automatic word translation service for multicultural family members who understand beginner level Korean. The service automatically analyzes the semantics of each word in the Korean sentences and provides a word-by-word translation. This study includes semantic analysis research for Korean language, building multilingual translation knowledge, and fusion study of language education. We evaluated the word translation service for migrant women from Vietnam and Japan and obtained meaningful evaluation results.

Variable Vocabulary Word Recognizer using Phonetic Knowledge-based Allophone Model (음성학적 지식 기반 변이음 모델을 이용한 가변 어휘 단어 인식기)

  • Kim, Hoi-Rin;Lee, Hang-Seop
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.2
    • /
    • pp.31-35
    • /
    • 1997
  • In this paper, we propose a variable vocabulary word recognizer that is able to recognize new words not exist in training data. For the variable vocabulary word recognizer, we must have an on-line lexicon generator to transform new candidate words to the corresponding pronunciation sequences of phones without any large lexicon table. And, we also must make outputs. In order to model the phones and allophones reliably, we define Korean allophones by triphone clustering based on phonetic knowledge of preceding and succeeding phones of each phone. Using the clustering method, we generated 1,548 allophones with POW (Phonetically Optimized Words) 3,848 word DB. We evaluated the proposed word recognizer with POW 3,848 DB, PBW (Phonetically Balanced Words) 445 DB, and 244 word DB in hotel reservation task. Experimental results showed word recognition accuracy of 79.6% for the POW DB corresponding to vocabulary-dependent case, 79.4% in case of 445 word lexicon and 88.9% in case of 100 word lexicon for the PBW DB, and 71.4% for the hotel reservation DB corresponding to vocabulary-independent case.

  • PDF

Analyzing the Sentence Structure for Automatic Identification of Metadata Elements based on the Logical Semantic Structure of Research Articles (연구 논문의 의미 구조 기반 메타데이터 항목의 자동 식별 처리를 위한 문장 구조 분석)

  • Song, Min-Sun
    • Journal of the Korean Society for information Management
    • /
    • v.35 no.3
    • /
    • pp.101-121
    • /
    • 2018
  • This study proposes the analysis method in sentence semantics that can be automatically identified and processed as appropriate items in the system according to the composition of the sentences contained in the data corresponding to the logical semantic structure metadata of the research papers. In order to achieve the purpose, the structure of sentences corresponding to 'Research Objectives' and 'Research Outcomes' among the semantic structure metadata was analyzed based on the number of words, the link word types, the role of many-appeared words in sentences, and the end types of a word. As a result of this study, the number of words in the sentences was 38 in 'Research Objectives' and 212 in 'Research Outcomes'. The link word types in 'Research Objectives' were occurred in the order such as Causality, Sequence, Equivalence, In-other-word/Summary relation, and the link word types in 'Research Outcomes' were appeared in the order such as Causality, Equivalence, Sequence, In-other-word/Summary relation. Analysis target words like '역할(Role)', '요인(Factor)' and '관계(Relation)' played a similar role in both purpose and result part, but the role of '연구(Study)' was little different. Finally, the verb endings in sentences were appeared many times such as '~고자', '~였다' in 'Research Objectives', and '~었다', '~있다', '~였다' in 'Research Outcomes'. This study is significant as a fundamental research that can be utilized to automatically identify and input the metadata element reflecting the common logical semantics of research papers in order to support researchers' scholarly sensemaking.

A postprocessing method for korean optical character recognition using eojeol information (어절 정보를 이용한 한국어 문자 인식 후처리 기법)

  • 이영화;김규성;김영훈;이상조
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.35C no.2
    • /
    • pp.65-70
    • /
    • 1998
  • In this paper, we will to check and to correct mis-recognized word using Eojeol information. First, we divided into 16 classes that constituents in a Eojeol after we analyzed Korean statement into Eojeol units. Eojeol-Constituent state diagram constructed these constitutents, find the Left-Right Connectivity Information. As analogized the speech of connectivity information, reduced the number of cadidate words and restricted case of morphological analysis for mis-recognition Eojeol. Then, we improved correction speed uisng heuristic information as the adjacency information for Eojeol each other. In the correction phase, construct Reverse-Order Word Dictionary. Using this, we can trace word dictionary regardless of mis-recongnition word position. Its results show that improvement of recognition rate from 97.03% to 98.02% and check rate, reduction of chadidata words and morpholgical analysis cases.

  • PDF

A study on character segmentation and determination of linguistic type for recognition of on-line cursive characters (온라인 연속 필기 문자의 인식을 위한 문자간 구분 및 종류의 결정에 관한 연구)

  • 박강령;전병환;김창수;김우성;김재희
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.34C no.7
    • /
    • pp.61-69
    • /
    • 1997
  • With the vigorous researches in the character recognition, the need to recognize run-on multilingual handwritten characters is increasing to provide uses with more comfortable PUI(pen user interface) environments. In general, many intermediate word candidates word candidates are generated in run-on multilingual recognition because there is no information of ending position and linguistic kind of character. To remove unnecessary word candidates which are generated in run-on multilingual recognition, we classify them into two groups and select the best candidate among the word candidates in the group where the final characater is completed using 5 attributes. In this research, we propose a method in order to select the best one candidate. It is called WRM (Weighted ranking method). The weights are adaptively trained by LMS(Least mean square) learning rule. Results show that the abilities of decision makin gusing weights are much better than those not using weights.

  • PDF