• 제목/요약/키워드: verb

검색결과 342건 처리시간 0.032초

지식베이스를 이용한 임베디드용 연속음성인식의 어휘 적용률 개선 (Vocabulary Coverage Improvement for Embedded Continuous Speech Recognition Using Knowledgebase)

  • 김광호;임민규;김지환
    • 대한음성학회지:말소리
    • /
    • 제68권
    • /
    • pp.115-126
    • /
    • 2008
  • In this paper, we propose a vocabulary coverage improvement method for embedded continuous speech recognition (CSR) using knowledgebase. A vocabulary in CSR is normally derived from a word frequency list. Therefore, the vocabulary coverage is dependent on a corpus. In the previous research, we presented an improved way of vocabulary generation using part-of-speech (POS) tagged corpus. We analyzed all words paired with 101 among 152 POS tags and decided on a set of words which have to be included in vocabularies of any size. However, for the other 51 POS tags (e.g. nouns, verbs), the vocabulary inclusion of words paired with such POS tags are still based on word frequency counted on a corpus. In this paper, we propose a corpus independent word inclusion method for noun-, verb-, and named entity(NE)-related POS tags using knowledgebase. For noun-related POS tags, we generate synonym groups and analyze their relative importance using Google search. Then, we categorize verbs by lemma and analyze relative importance of each lemma from a pre-analyzed statistic for verbs. We determine the inclusion order of NEs through Google search. The proposed method shows better coverage for the test short message service (SMS) text corpus.

  • PDF

Processing Nominal Suffixes in Korean: Evidence from Priming Experiments

  • Ahn, Hee-Don;An, Duk-Ho;Choi, Jung-Yun;Hwang, Jong-Bai;Jeon, Moon-Gee;Kim, Ji-Hyon
    • 한국언어정보학회지:언어와정보
    • /
    • 제15권1호
    • /
    • pp.1-12
    • /
    • 2011
  • This study investigates morphologically complex nouns in Korean through a series of priming studies. Two experiments examined whether morphological affixes on Korean nouns were decomposed or processed as a whole. Two types of morphological affixes were examined: morpho-syntactic case markers and the plural marker '-tul'. Results showed that priming occurred for the plural marker with SOAs of 80 ms and 160 ms, but no priming occurred for the morpho-syntactic case markers. These results suggest that the morphological processing for these two types of affixes differ. We argue that Korean nouns with the plural suffix are decomposed into the stem and affix, supporting the Decomposition Model (Pinker & Ullman, 2002). We suggest that while plural markers are truly morphological affixes, case markers in Korean are morpho-syntactic, and thus presuppose the existence of other syntactic elements, such as the matrix verb, hence the lack of priming effects.

  • PDF

Narrative and Grammatical Analyses of Story-retelling in Chinese Speakers of Korean as a Second Language

  • 백은아;손은남;강수균;박선희;이현혜;최경희
    • 대한음성학회지:말소리
    • /
    • 제56호
    • /
    • pp.127-134
    • /
    • 2005
  • Although the narrative development and the acquisition of the Korean grammatical morphemes by monolingual Korean-speaking children have been studied extensively, little is known about the narrative characteristics and the processes through which native speakers of other languages (L2 speakers) use the Korean grammatical morphemes. To understand the similarities and differences between L1 and L2 narrative skills and Korean grammatical morpheme use, 13 native Chinese-speaking college students who are learning Korean as a second language were studied. L2 participants used significantly fewer words, subordinate clauses, connective morphological endings, and pronouns per T-unit. Their speech also illustrated significantly more omission and confusion (substitution) errors in the use of auxiliary words and verb endings. Some of the syntactic and morphological factors need to be considered for the intervention of speakers with limited Korean proficiency.

  • PDF

A Model-Based Method for Information Alignment: A Case Study on Educational Standards

  • Choi, Namyoun;Song, Il-Yeol;Zhu, Yongjun
    • Journal of Computing Science and Engineering
    • /
    • 제10권3호
    • /
    • pp.85-94
    • /
    • 2016
  • We propose a model-based method for information alignment using educational standards as a case study. Discrepancies and inconsistencies in educational standards across different states/cities hinder the retrieval and sharing of educational resources. Unlike existing educational standards alignment systems that only give binary judgments (either "aligned" or "not-aligned"), our proposed system classifies each pair of educational standard statements in one of seven levels of alignments: Strongly Fully-aligned, Weakly Fully-aligned, Partially-$aligned^{***}$, Partially-$aligned^{**}$, Partially-$aligned^*$, Poorly-aligned, and Not-aligned. Such a 7-level categorization extends the notion of binary alignment and provides a finer-grained system for comparing educational standards that can broaden categories of resource discovery and retrieval. This study continues our previous use of mathematics education as a domain, because of its generally unambiguous concepts. We adopt a materialization pattern (MP) model developed in our earlier work to represent each standard statement as a verb-phrase graph and a noun-phrase graph; we align a pair of statements using graph matching based on Bloom's Taxonomy, WordNet, and taxonomy of mathematics concepts. Our experiments on data sets of mathematics educational standards show that our proposed system can provide alignment results with a high degree of agreement with domain expert's judgments.

논항변이와 의미차이 (Argument Alternations with Meaning Differences)

  • 김현효
    • 한국산학기술학회논문지
    • /
    • 제3권4호
    • /
    • pp.240-244
    • /
    • 2002
  • 영어문장의 논항변이는 오랫동안 언어학자들의 주목을 받아온 재미있는 현상이다. 대표적으로 인용되는 예문은 “load hay onto the truck" 과 “load the truck with hay" 간의 의미차이 그리고 “The garden swarms with bees" 또는 “Bees are swarming in the garden" 과 “The garden swarms with bees" 의 의미차이이다. 이 문장들은 언어학이론상 설명하기 곤란한 의미차이를 가져오게 된다. 통사적인 논항의 재배치는 미묘하지만 엄연한 동사의 의미차이를 보인다. 이를 학자에 따라 “Double-faced" 또는 “Verbal diathesis"라고도 하는데, 본고에서는 Dowty(200l)의 견해를 따르며, 관련있는 학자들의 견해를 반영한다. 나아가 “Argument alternation" 의 행위자 주어 (A-subject)와 위치주어 (L-subject) 형 문장을 중심으로 문장을 분석하고 상이한 특질을 기술하여 Dowty(2001)식 분석의 이론적 타당성을 확인하게 된다.

  • PDF

한국어 어순재배치(scrambling) 문장의 신경언어학적 연구 (A Neurolinguistic Study of Korean Scrambling: An Event-related Potentials(EPR) based Study)

  • 황유미;이갑희;윤영도
    • 한국정보과학회 언어공학연구회:학술대회논문집(한글 및 한국어 정보처리)
    • /
    • 한국정보과학회언어공학연구회 2012년도 제24회 한글 및 한국어 정보처리 학술대회
    • /
    • pp.29-34
    • /
    • 2012
  • 본 연구는 한국어 어순재배치(scrambling) 문장의 이해 과정에서 발생되는 대뇌 활동을 사건관련전위(event-related Potentials; ERPs) 이용하여 살펴보기 위하여 실시되었다. 네 개의 어절로 구성된 표준 어순 문장(일년만에 마님이 영감을 만났어요.)과 어순재배치 문장(일년만에 영감을 마님이 만났어요.)을 어절별로 제시하고 첫 번째 명사구(NP1), 두 번째 명사구(NP2), 동사(Verb)의 시작점(onset)에서 측정한 뇌파를 비교하였다. 뇌파의 분석은 대뇌 영역을 중심선(midline), 중앙(medial), 편측(lateral)로 나누어 전후 분포(anterior-posterior distribution)와 정중선(midline)의 열에 의해 좌우 반구(hemisphere)로 분리하여 분석하였다. 분석 결과 중심선 영역에서 표준 어순에 비해 뒤섞기 어순에서 300-500ms 시간 창(time window)에서 큰 부적 전위(negative potential)가 관찰되었으며 이는 어순재배치로 인한 N400효과로 해석되며 P600효과는 관찰되지 않았다. 특히 첫 번째 명사구에서 문장유형(표준 어순 vs. 어순재배치)의 차이가 가장 크게 관찰되었으며 두 번째 명사구에서는 중앙에서 문장유형과 반구(좌우반구)의 상호작용이 관찰되었고, 동사에서는 문장유형과 반구, 문장유형과 전극 위치의 전후 분포와의 상호작용이 관찰되었다. 본 연구 결과에서 관찰된 N400효과는 독일어와 일본어를 대상으로 한 어순재배치 연구 결과와 유사하며 한국어 어순재배치 문장에 관한 사건관련 전위를 고찰하였다는 점에서 의의가 있다.

  • PDF

Bridging the Gap between Grammar and Conversation in Korean College English Conversation Classes

  • Lee, Eun-Ah
    • 영어어문교육
    • /
    • 제5호
    • /
    • pp.27-48
    • /
    • 1999
  • College students frequently feel their grammar knowledge from primary and middle school is not useful when they are asked to speak in college conversation classes. Because of their frustration at their lack of communicational ability as well as inappropriate teaching methods and class textbooks that have little to do with the student's major course of study, the student often has a low motivation to study. It is not uncommon for students to seek English education outside of their college classrooms by going to language institutes or studying abroad. College teachers need to find a way to use the student's background in grammar from primary and secondary schools. Despite the student's sentiment about his/her grammar education, grammar is an essential key to successful English conversation. Some ways that teachers can close the gap between primary and secondary school grammar education and college conversation classes are: to use a theme-based methodology, cue cards, and modeling. Activities such as Grammar Clinic, Grammar Police, and Show and Tell can be effective ways to bridge this gap. Teachers can use these activities and methods to correct such student errors as: incorrect word order, missing or unnecessary be verbs, confusion between be and do verbs, subject-verb agreement. and incorrect tense.

  • PDF

Variation of Cannonical Sentence Structure in Korean & Japanese Dialects & its Implication

  • Khym, Han-gyoo
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제7권2호
    • /
    • pp.142-148
    • /
    • 2015
  • The main purpose of this squib is to provide a new principled account for variation of canonical sentence structure in Korean and Japanese based on the linguistic data commonly observed in some dialects of Korean and Japanese. Unlike the English case in which Comp(lementizer) such as 'that' in an embedded clause freely drops as far as the ECP (Lasnik & Saito 1992) is obeyed, some dialects of both Korean and Japanese show interesting linguistic data very different from those of English, thereby leading us to reasonably doubt the traditionally-accepted paradigm of the canonical sentence structure of CP for all languages. In this squib I propose, based on Korean & Japanese dialects and by developing the Minimal Structure Principle (MSP) ($Bo{\check{s}}kovi{\acute{c}}$ 1997, p. 25), that the cannonical structure of a sentence is not fixed, from the beginning at all, to be one single maximal category, CP. Instead, it should be decided to be either CP or IP, based on the feature of [${\pm}$markedness] and MSP, and the marked (or non-cannonical) embedded sentence needs to satisfy ECP for adjacency (or feature-licensing by the matrix verb in the MP terminology).

초점과 정보 구조에 따른 한국어 화자의 영어 억양 실현 양상 (Korean Speakers' Realization of Focus and Information Structure on English Intonation in Comparison with English Native Speakers)

  • 엄혜영;이혜숙;김기호
    • 음성과학
    • /
    • 제8권2호
    • /
    • pp.133-148
    • /
    • 2001
  • Focus and information structure are closely related with the distribution of pitch accents. A focused word conveys new information and bears a pitch accent. A content word can usually get a pitch accent, but it can be deaccented if it is mentioned earlier in the discourse. In this paper, we test how English native speakers and Korean learners of English realize pitch accents according to focus and information structure of a sentence. The production experiment shows that English native speakers give a pitch accent to narrow-focused items, deaccenting all the other items of the sentence. For VP broad focus, native speakers give a pitch accent either to both the verb and its complement or to the complement only. On the other hand, it is found that Koreans give pitch accents to most content words regardless of focus and information structure. Moreover, the perception experiment confirms that Koreans' intonation patterns, which are not appropriate in terms of focus and information structure, may jeopardize listeners' comprehension. This paper shows that Korean speakers have little knowledge about focus and information structure for intonational realization, and that such notions should be applied to teaching of English intonation.

  • PDF

MATLAB을 이용한 송전선로의 아크사고 검출 및 고장거리 추정 소프트웨어 개발에 관한 연구 (A Study on the Defection of Arcing Faults in Transmission Lines and Development of Fault Distance Estimation Software using MATLAB)

  • 김병천;박남옥;김동수;김길환
    • 대한전기학회논문지:전력기술부문A
    • /
    • 제51권4호
    • /
    • pp.163-168
    • /
    • 2002
  • This paper present a new verb efficient numerical algorithm for arcing faults detection and fault distance estimation in transmission line. It is based on the fundamental differential equations describing the transients on a transmission line before, during and alter the fault occurrence, and on the application of the "Least Error Squares Technique"for the unknown model parameter estimation. If the arc voltage estimated is a near zero, the fault is without arc, in other words the fault is permanent fault. If the arc voltage estimated has any high value, the faust is identified as an fault, or the transient fault. In permanent faults case, fault distance estimation is necessary. This paper uses the model of the arcing fault in transmission line using ZnO arrestor and resistance to be implemented within EMTP. One purpose of this study is to build a structure for modeling of arcing fault detection and fault distance estimation algorithm using Matlab programming. In this paper, This algorithm has been designed in Graphic user interface(GUI).