• 제목/요약/키워드: speech-timing

검색결과 48건 처리시간 0.033초

천이구간 정보를 이용한 음성의 가변적인 시간축 변환 (Variable Time-Scale Modification of Speech Using Trasient Information)

  • 이성주;김희동;김형순
    • 전자공학회논문지S
    • /
    • 제35S권6호
    • /
    • pp.147-155
    • /
    • 1998
  • 기존의 시간축 변환 방법은 음성 특징에 따른 발음 속도의 영향을 고려하지 않기 때문에 변환비율이 커짐에 따라 합성음의 명료도가 떨어지는 문제점이 있다. 본 논문에서는 이러한 문제점을 해결하기 위하여 음성 인지과정에서 천이 구간의 시간축 정보가 중요한 역할을 한다는 사실에 기반을 둔 가변적인 시간축 변환 방법을 제안한다. 이를 위하여 제안된 방법에서는 먼저 음성신호를 천이 구간과 정적인 구간으로 구분하고, 천이 구간의 시간축 정보는 그대로 유지하면서 정적인 구간만을 시간축 변환함으로써 목표하는 변환 비율을 얻는다. 청취자 선호도 시험 결과, 제안된 방법이 기존의 대표적인 시간축 변환 방법인 SOLA 방법에 비해 그 성능이 우수함을 확인하였다.

  • PDF

Durational aspects of Korean nasal geminates

  • Oh, Eunhae
    • 말소리와 음성과학
    • /
    • 제9권4호
    • /
    • pp.19-25
    • /
    • 2017
  • The current study focused on the production of geminate nasal consonants across different word boundary types in Korean as a function of speech style to investigate whether temporal properties are preserved across varying speaking rates. Assimilated geminates in Korean, known as true geminates, are produced with distinctively longer consonant duration compared to singletons. Despite a large body of literature for geminates across different languages, geminates in Korean have been relatively less investigated with respect to the durational patterns in relative terms and temporal variabilities. In this study, singletons, word-internal geminates and word-boundary (fake) geminates produced by ten native Seoul Korean speakers were compared in terms of absolute consonant closure duration, preceding vowel duration, the relative ratios (consonant-to-preceding vowel duration) as well as the temporal variabilities in speech production. The results showed that word-internal geminates were produced with longer consonant duration and greater temporal variabilities than singletons and word-boundary geminates in absolute duration, indicating relatively greater flexibility in timing. However, only word-internal geminates were produced with distinctively longer consonant duration with significantly lower variability in relative duration regardless of speech styles. The results provide some insight into the representation of temporal information in the production of Korean geminate consonants.

Modelling Duration In Text-to-Speech Systems

  • 정현성
    • 대한음성학회지:말소리
    • /
    • 제49호
    • /
    • pp.159-174
    • /
    • 2004
  • The development of the durational component of prosody modelling was overviewed and discussed in text-to-speech conversion of spoken English and Korean, showing the strengths and weaknesses of each approach. The possibility of integrating linguistic feature effects into the duration modelling of TTS systems was also investigated. This paper claims that current approaches to language timing synthesis still require an understanding of how segmental duration is affected by context. Three modelling approaches were discussed: sequential rule systems, Classification and Regression Tree (CART) models and Sums-of-Products (SoP) models. The CART and SoP models show good performance results in predicting segment duration in English, while it is not the case in the SoP modelling of spoken Korean.

  • PDF

Gender difference in the sound change of lexical pitch accents of South Kyungsang Korean

  • Lee, Hyunjung
    • 말소리와 음성과학
    • /
    • 제7권4호
    • /
    • pp.123-130
    • /
    • 2015
  • Given a recent finding showing that female speakers of South Kyungsang Korean is undergoing a sound change of the lexical pitch accent, this study tested whether the change is also reflected for male speech. This study compared F0 scaling and timing properties of accent words produced by younger female and male speakers of South Kyungsang Korean. The results indicated clear gender-related differences, showing more distinct acoustic properties across the accent words for male production compared to females. Despite the better distinction, however, younger male speakers showed peak delay where the F0 peaks are located further to the right compared to conservative speakers' production. Therefore, it might be suggested that younger male speakers' accent productions are in between conservative and innovative phonetic forms.

인두피판성형술 전후의 언어 평가 (SPEECH-LANGUAGE EVALUATION BEFORE AND AFTER PHARYNGOPLASTY)

  • 유양근;한진순;김정록;황순정
    • 대한구순구개열학회지
    • /
    • 제3권2호
    • /
    • pp.61-66
    • /
    • 2000
  • General characteristics of speech in deft palate patients are hypemasality and articulation disorder, which are affected by velopharyngeal inadequacy(VPI). 17 subjects with a chief complaint of 'nasal sounds and inaccurate pronunciation' underwent a speech-language evaluation before and after pharyngoplasty. Hypemasality and obligatory articulation errors were improved but compensatory articulation errors remained after pharyngoplasty. Above mentioned results indicate that resonance may be normal or improved following successful surgical management of VPI but, compensatory articulation errors will still persist. The separate recognition of hypemasality, compensatory and obligatory articulation errors in deft palate patients is important in determining the timing of therapy and selection of appropriate targets in therapy.

  • PDF

잡음제거 기능을 갖춘 시-청각 단서 제공 읽기 훈련 프로그램 (A Reading Trainning Program offering Visual-Auditory Cue with Noise Cancellation Function)

  • 방동혁;강현덕;길세기;이상민
    • 재활복지공학회논문지
    • /
    • 제2권1호
    • /
    • pp.35-43
    • /
    • 2009
  • 본 논문에서는 개발된 잡음제거 기능을 갖춘 시-청각 단서 제공 읽기 훈련 프로그램(이하 프로그램)을 소개한다. 프로그램은 시-청각 단서들을 지닌 훈련용 문장들을 제공한다. 말운동장애인들은 읽기훈련을 위해서 시각단서와 청각단서들을 각각 또는 동시에 사용 가능하다. 훈련 결과의 평가 편의성 제공을 위해서 잡음제거 알고리즘을 개발하였다. 알고리즘은 피험자가 컴퓨터화면에 제공된 문장을 읽을 때 읽는 말소리와 함께 녹음된 잡음과 청각단서 소리를 제거한다. 또한 피험자가 읽기 연습을 시작할 때 최초의 말소리 개시시간을 검출하는 기능을 구현하였다. 말소리의 녹음은 4가지 잡음환경(실내 잡음, 백색 잡음, 자동차 내부잡음, 배블 잡음)에서 성인 6명(남성 3 명, 여성 3명)으로부터 하였다. 잡음제거 전과 후에 대한 조건에서 녹음된 말소리의 실제 시작 시간과 프로그램상에서 찾은 시간과의 오차를 실험하였다. 잡음제거 전과 후에서의 시간오차가 $4.847{\pm}2.4235[ms]$ 향상되었다. 개발된 프로그램은 말운동장애인의 훈련 및 증상 평가에 도움이 될 수 있으리라 사료된다.

  • PDF

Stress Effects on Korean Vowels with Reference to Rhythm

  • 윤일승
    • 대한음성학회지:말소리
    • /
    • 제67호
    • /
    • pp.1-16
    • /
    • 2008
  • Stress effects upon Korean vowels were investigated with reference to rhythm. We measured three acoustic correlates (Duration: VOT, Vowel Duration; F0; Intensity) of stress from the seven pairs of stressed vs. unstressed Korean vowels /i, ${\varepsilon}(e)$, a, o, u, i, e/. The results of the experiment revealed that stress gave only inconsistent and weak effects on duration, which supports that Korean is not a stress-timed language as far as strong stress effects on duration are still considered crucial in stress-timing. On the other hand, Korean stressed vowels were most characterized with higher F0 and next with stronger intensity. But speakers generally showed tactics to reversely use F0 and intensity in stressing an utterance rather than proportionately strengthening both of the two acoustic correlates of stress. There was found great inter-speaker variability especially in the variations of duration.

  • PDF

Long-Term Follow-Up Study of Young Adults Treated for Unilateral Complete Cleft Lip, Alveolus, and Palate by a Treatment Protocol Including Two-Stage Palatoplasty: Speech Outcomes

  • Kappen, Isabelle Francisca Petronella Maria;Bittermann, Dirk;Janssen, Laura;Bittermann, Gerhard Koendert Pieter;Boonacker, Chantal;Haverkamp, Sarah;de Wilde, Hester;Van Der Heul, Marise;Specken, Tom FJMC;Koole, Ron;Kon, Moshe;Breugem, Corstiaan Cornelis;van der Molen, Aebele Barber Mink
    • Archives of Plastic Surgery
    • /
    • 제44권3호
    • /
    • pp.202-209
    • /
    • 2017
  • Background No consensus exists on the optimal treatment protocol for orofacial clefts or the optimal timing of cleft palate closure. This study investigated factors influencing speech outcomes after two-stage palate repair in adults with a non-syndromal complete unilateral cleft lip and palate (UCLP). Methods This was a retrospective analysis of adult patients with a UCLP who underwent two-stage palate closure and were treated at our tertiary cleft centre. Patients ${\geq}17$ years of age were invited for a final speech assessment. Their medical history was obtained from their medical files, and speech outcomes were assessed by a speech pathologist during the follow-up consultation. Results Forty-eight patients were included in the analysis, with a mean age of 21 years (standard deviation, 3.4 years). Their mean age at the time of hard and soft palate closure was 3 years and 8.0 months, respectively. In 40% of the patients, a pharyngoplasty was performed. On a 5-point intelligibility scale, 84.4% received a score of 1 or 2; meaning that their speech was intelligible. We observed a significant correlation between intelligibility scores and the incidence of articulation errors (P<0.001). In total, 36% showed mild to moderate hypernasality during the speech assessment, and 11%-17% of the patients exhibited increased nasalance scores, assessed through nasometry. Conclusions The present study describes long-term speech outcomes after two-stage palatoplasty with hard palate closure at a mean age of 3 years old. We observed moderate long-term intelligibility scores, a relatively high incidence of persistent hypernasality, and a high pharyngoplasty incidence.

The Vowel Length as a Function of the Articulatory Force of the Following Consonants in Korean

  • Kim, Dae-Won
    • 음성과학
    • /
    • 제9권3호
    • /
    • pp.143-153
    • /
    • 2002
  • This study was designed to determine (1) the effects of the following stop consonant on the vowel length in isolated bi-syllabic words, (2) the mechanism which renders vowels longer in duration before lax stops than tense stops, (3) where the aspiratory interval is included, in the vowel portion or the preceding consonantal portion and (4) the influence of the preceding consonants upon the duration of the following vowel. Measurements were made of five timing variables on acoustic signals as three native Korean speakers uttered isolated bi-syllabic /VCV/ words in which the vowel was identical, /$\alpha$/, and the C slot was filled with bilabial stops. Findings: (1) the vowel length before the lax stops was significantly longer than before the tense stops, while the difference in the vowel duration between the tense stops was insignificant or negligible, (2) the vowel length varied as a function of the articulatory force of the following consonants, regardless of the phonological unit of syllable, (3) The aspiratory interval is interpreted as a portion of the preceding consonant and (4) The effects of the preceding consonants on the final vowel length were not rule-governed.

  • PDF

An Optimality Theoretic Analysis of Tonal Realization in Korean

  • Oh, Mi-Ra
    • 음성과학
    • /
    • 제10권3호
    • /
    • pp.89-101
    • /
    • 2003
  • This paper investigates edge effects on the relationship between the underlying tonal sequence and its surface realization in the IP-final Accentual Phrase within the Optimality Theoretic framework. I will examine the way in which AP tones are aligned with their associated syllables in IP-final position. In Korean. Jun's (1996) 'see-saw effect' does not allow any two identical tones if they are marking a boundary of a prosodic group. A phonetic experiment conducted in this paper suggests that the 'see-saw effect' only apply to H boundary tones. Furthermore, it will be shown that the timing of tonal peaks is determined through the ranking of a set of violable constraints. The AP tonal realization is achieved through the access to the global intonation in a complicated way. In the course of discussion, pitch patterns in IP-medial Accentual Phrase will also be discussed.

  • PDF