• Title/Summary/Keyword: phonetic level

검색결과 112건 처리시간 0.019초

Gradient Reduction of $C_1$ in /pk/ Sequences

  • Son, Min-Jung
    • 음성과학
    • /
    • 제15권4호
    • /
    • pp.43-60
    • /
    • 2008
  • Instrumental studies (e.g., aerodynamic, EPG, and EMMA) have shown that the first of two stops in sequence can be articulatorily reduced in time and space sometimes; either gradient or categorical. The current EMMA study aims to examine possible factors_linguistic (e.g., speech rate, word boundary, and prosodic boundary) and paralinguistic (e.g., natural context and repetition)_to induce gradient reduction of $C_1$ in /pk/ cluster sequences. EMMA data are collected from five Seoul-Korean speakers. The results show that gradient reduction of lip aperture seldom occurs, being quite restricted both in speaker frequency and in token frequency. The results also suggest that the place assimilation is not a lexical process, implying that speakers have not fully developed this process to be phonologized in the abstract level.

  • PDF

Blind speech segmentation과 에너지 가중치를 이용한 문장 종속형 화자인식기의 성능 향상 (Performance improvement of text-dependent speaker verification system using blind speech segmentation and energy weight)

  • 김정곤;김형순
    • 대한음성학회지:말소리
    • /
    • 제47호
    • /
    • pp.131-140
    • /
    • 2003
  • We propose a new method of generating client models for HMM based text-dependent speaker verification system with only a small amount of training data. To make a client model, statistical methods such as segmental K-means algorithm are widely used, but they do not guarantee the quality or reliability of a model when only limited data are avaliable. In this paper, we propose a blind speech segmentation based on level building DTW algorithm as an alternative method to make a client model with limited data. In addition, considering the fact that voiced sounds have much more speaker-specific information than unvoiced sounds and energy of the former is higher than that of the latter, we also propose a new score evaluation method using the observation probability raised to the power of weighting factor estimated from the normalized log energy. Our experiment shows that the proposed methods are superior to conventional HMM based speaker verification system.

  • PDF

원어민 화자와 한국인 학습자 영어 발화의 초점구조에 대한 실험음성학적 연구;협의초점과 광의초점을 중심으로 (An Experimental Study on Focus Structures of English Utterances by Native Speakers and Korean Learners)

  • 최경민;장태엽
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2006년도 추계학술대회 발표논문집
    • /
    • pp.75-79
    • /
    • 2006
  • In this study, we investigate ways that focus is realized in English utterances produced by native speakers of English and Korean learners. As compared to the previous studies which deal mainly with functional aspects of focus as a part of intonational structure, we attempt to provide more quantitative information on F0 and discover the extent to which Korean learners distinguish focus types in their English utterance production. On the test sentences designed to be disambiguated by correct focus realization, it is found that, even advanced-level Korean learners, unlike native speakers, hardly employ F0 to clarify the specific meaning of English utterances.

  • PDF

발음 변이의 발음사전 포함 결정 조건을 통한 발음사전 최적화 (Pronunciation Lexicon Optimization with Applying Variant Selection Criteria)

  • 전재훈;정민화
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2006년도 추계학술대회 발표논문집
    • /
    • pp.24-27
    • /
    • 2006
  • This paper describes how a domain dependent pronunciation lexicon is generated and optimized for Korean large vocabulary continuous speech recognition(LVCSR). At the level of lexicon, pronunciation variations are usually modeled by adding pronunciation variants to the lexicon. We propose the criteria for selecting appropriate pronunciation variants in lexicon: (i) likelihood and (ii) frequency factors to select variants. Our experiment is conducted in three steps. First, the variants are generated with knowledge-based rules. Second, we generate a domain dependent lexicon which includes various numbers of pronunciation variants based on the proposed criteria. Finally, the WERs and RTFs are examined with each lexicon. In the experiment, 0.72% WER reduction is obtained by introducing the variants pruning criteria. Furthermore, RTF is not deteriorated although the average number of variants is higher than that of compared lexica.

  • PDF

장모음 인식장치 설계 제작 (Design and Manufacture of a Device for the Recognition of Long Vowels)

  • 구용회
    • 전자공학회논문지T
    • /
    • 제35T권3호
    • /
    • pp.9-14
    • /
    • 1998
  • 장모음 음성인식을 전자회로로 수행하였다. 레벨 압축은 음성 파형을 직렬 펄스로 변화시킬 수 있었다 이 펄스들로 모음을 구별하는 정보가 된다. 펄스의 샘풀링은 한 단위로 모음의 피치 직렬신호를 얻어지는 레지스터 에 의해서 이루어진다. 샘풀링 펄스에 의한 시간제어 펄스는 음성파형의 첩두치 펄스에 의해 발진되어 진다. 이 레지스터에 있는 병렬 데이터는 만약 OO이면 OO이다는 규칙으로 이루처지는 의지결정 회로의 뜻에 따라 음성 심볼이 인식되어진다.

  • PDF

소음문장 제거를 위한 음소지속시간 사용 (The Usage of Phoneme Duration Information for Rejecting Garbage Sentences)

  • 구명완;김호경;박성준;김재인
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 5월 학술대회지
    • /
    • pp.219-222
    • /
    • 2003
  • In this paper, we study the usage of phoneme duration information for rejection garbage sentence. First, we build a phoneme duration modeling in a speech recognition system based on dicicion tree state tying, We assume that phone duration has a Gamma distribution. Next, we build a verification module in which word-level confidence measure is used. Finally, we make a comparative study on phoneme duration with speech DB obtained from the live system. This DB consistes of OOT(out-of-task) and ING(in-grammar) utterences. the usage of phone duration information yields that OOT recognition rate is improved by 46% and that another 8.4% error rate is reduced when combined with utterence verification module.

  • PDF

대화 예제를 이용한 상황 기반 대화 관리 시스템 (A Situation-Based Dialogue Management with Dialogue Examples)

  • 이청재;정상근;이근배
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2005년도 추계 학술대회 발표논문집
    • /
    • pp.113-115
    • /
    • 2005
  • In this paper, we present POSSDM (POSTECH Situation-Based Dialogue Manager) for a spoken dialogue system using a new example and situation-based dialogue management techniques for effective generation of appropriate system responses. Spoken dialogue system should generate cooperative responses to smoothly control dialogue flow with the users. We introduce a new dialogue management technique incorporating dialogue examples and situation-based rules for EPG (Electronic Program Guide) domain. For the system response inference, we automatically construct and index a dialogue example database from dialogue corpus, and the best dialogue example is retrieved for a proper system response with the query from a dialogue situation including a current user utterance, dialogue act, and discourse history. When dialogue corpus is not enough to cover the domain, we also apply manually constructed situation-based rules mainly for meta-level dialogue management.

  • PDF

주변성 난독증의 특성과 대뇌활성화 양상 - 단일사례연구 - (Cognitive neuropsychological assesment in pure alexic patient with letter-by-letter reading using fMRl - Single case study -)

  • 손효정;편성범;김충명;남기춘
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2005년도 추계 학술대회 발표논문집
    • /
    • pp.137-140
    • /
    • 2005
  • In this study we investigated the cognitive neuropsychological characteristics and the underlying mechanism in a letter-by-letter reading dyslexic patient after cerebral infarct of left posterior cerebral artery using fMRl, The results of cognitive neuropsychological assesment are visual perception was appropriate, and semantic categorization, picture naming and picture-word matching tasks were above83% correct, respectively. However, she was very poor in lexical decision task. The selective reading impairment is thought to result from the disruption of the left occipitotemporal region included fusiform gyrus. In fMRl results, the activation level increase din the right occipitotemporal region included fusiform gyrus compared with normal group in compensation for left impairment and more increased in pseudo word reading task than word reading on account of familiarity.

  • PDF

품사셋에 의한 운율경계강도의 예측 (Prediction of Prosodic Boundary Strength by means of Three POS(Part of Speech) sets)

  • 엄기완;김진영;김선미;이현복
    • 대한음성학회지:말소리
    • /
    • 제35_36호
    • /
    • pp.145-155
    • /
    • 1998
  • This study intended to determine the most appropriate POS(Part of Speech) sets for predicting prosodic boundary strength efficiently. We used 3-level POB bets which Kim(1997), one of the authors, has devised. Three POS sets differ from each other according to how much grammatical information they have: the first set has maximal syntactic and morphological information which possibly affects prosodic phrasing, and the third set has minimal one. We hand-labelled 150 sentences using each of three POS sets and conducted perception test. Based on the results of the test, stochastic language modeling method was used to predict prosodic boundary strength. The results showed that the use of each POS set led to not too much different efficiency in the prediction, but the second set was a little more efficient than the other two. As far as the complexity in stochastic language modeling is concerned, however, the third set may be also preferable.

  • PDF

영어-한국어 단어번역과제에서 이름-일치도와 단어빈도의 효과 (Effects of Name Agreement and Word Frequency on the English-Korean Word Translation Task)

  • 구민모;남기춘
    • 대한음성학회지:말소리
    • /
    • 제61호
    • /
    • pp.31-48
    • /
    • 2007
  • This study investigated the roles of name agreement and word frequency in the English-Korean word translation task. Using the low-frequency homonyms with low name agreement as stimuli, Experiment 1 revealed that the name agreement of materials is a determinant which could modulate times to translate English words into Korean equivalents. On the contrary, Experiment 2 showed that the name agreement of materials does not play a decisive role in the translation task, using the low-frequency homonyms having high name agreement as stimuli. In Experiment 3, we identified that the frequency effects observed from previous two experiments are indeed brought about during the lexical access. Our findings suggest that the word frequencies of materials have a strong influence on English-Korean word translation times, and homonyms are represented independently each other in the lexeme level.

  • PDF