• 제목/요약/키워드: Vocal Tract

검색결과 172건 처리시간 0.028초

청각 장애자용 발음 훈련 기기의 개발 (Speech training aids for deafs)

  • 김동준;윤태성;박상희
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1991년도 한국자동제어학술회의논문집(국내학술편); KOEX, Seoul; 22-24 Oct. 1991
    • /
    • pp.746-751
    • /
    • 1991
  • Deafs train articulation by observing mouth of a tutor. sensing tactually the notions of the vocal organs, or using speech training aids. Present speech training aids for deafs can measure only single speech ter, or display only frequency spectra in histogrm or pseudo-color. In this study, a speech training aids that can display subject's articulation in the form of a cross section of the vocal organs and other speech parameters together in a single system Is aimed to develop and this system makes a subject to know where to correct. For our objective, first, speech production mechanism is assumed to be AR model in order to estimate articulatory notions of the vocal tract from speech signal. Next, a vocal tract profile mode using LPC analysis is made up. And using this model, articulatory notions for Korean vowels are estimated and displayed in the vocal tract profile graphics.

  • PDF

원격으로 실시한 반폐쇄성도훈련이 영유아 교사의 주관적 음성평가에 미치는 효과 (Effect of semi-occluded vocal tract exercise via telepractice on subjective voice evaluation of early childhood teachers)

  • 류형선;김재옥
    • 말소리와 음성과학
    • /
    • 제13권4호
    • /
    • pp.67-74
    • /
    • 2021
  • 본 연구는 영유아 교육시설에서 근무하는 음성의 불편감을 호소하는 10명의 여성 교사들을 대상으로 반폐쇄성도훈련(semi-occluded vocal tract exercise, SOVTE)을 원격으로 실시하였을 때 주관적으로 평가하는 음성평가에 미치는 효과를 살펴보았다. 원격 SOVTE의 효과는 한국어판 음성장애지수(Korean voice handicap index, KVHI), 음성 활동 및 참여 프로파일-한국판(Korean version of the voice activity and participation profile, K-VAPP), 음성노력도 및 GRBAS를 이용한 청지각적 평가로 평가하였다. 연구 결과, KVHI의 총 점수, 기능적 점수, 신체적 점수는 원격 SOVTE를 실시한 후에 통계적으로 유의하게 낮아졌다. 원격 SOVTE 실시 후 K-VAPP의 총 점수도 유의하게 감소하였으며, 음성노력도 또한 유의하게 감소하였다. 그러나 GRB 척도는 원격 SOVTE 실시 전과 후 간에 통계적으로 유의한 차이를 보이지 않았다. 본 연구를 통해 영유아 여성 교사에게 원격으로 실시한 SOVTE는 음성의 불편감을 감소시키는데 효과적임을 입증하였으며, 원격으로 실시한 음성치료가 효과가 있음을 보여준다.

조음 음성 합성기에서 버퍼 재정렬을 이용한 연속음 구현 (Implementation of Continuous Utterance Using Buffer Rearrangement for Articula Synthesizer)

  • 이희승;정명진
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2002년도 하계학술대회 논문집 D
    • /
    • pp.2454-2456
    • /
    • 2002
  • Since articuratory synthesis models the human vocal organs as precise as possible, it is potentially the most desirable method to produce various words and languages. This paper proposes a new type of an articulatory synthesizer using Mermelstein vocal tract model and Kelly-Lochbaum digital filter. Previous researches have assumed that the length of the vocal tract or the number of its cross sections dose not vary while uttering. However, the continuous utterance can not be easily implemented under this assumption. The limitation is overcomed by "Buffer Rearrangement" for dynamic vocal tract in this paper.

  • PDF

초고속 성대촬영기(High-Speed Digital Imaging)를 이용한 말더듬인과 근 긴장성 발성장애인의 /이/모음 발성 시 성대 진동 양상에 관한 비교 연구 (A Comparative Study of Vocal Fold Vibratory Behaviors Shown in the Phonation of the /i/ Vowel between Persons who Stutter and Persons with Muscle Tension Dysphonia Using High-Speed Digital Imaging)

  • 정훈;안종복;박진향;최병흔;권도하
    • 말소리와 음성과학
    • /
    • 제1권4호
    • /
    • pp.195-201
    • /
    • 2009
  • The purpose of this study was to use high-speed digital imaging (HSDI) to compare vocal vibratory behaviors of persons who stutter (PWS) and persons with muscle tension dysphonia (PMTD) for uttering the /i/ vowel in a bid to identify the characteristics of vocal fold vibratory behaviors of PWS. This study surveyed seven developmental PWSs and seven PMTDs. The findings of the study indicated the following: first, regarding the two groups' vocal fold vibratory behaviors, of seven PWSs, three were found to be close vocal tract (VC) and four were found to be combination vocal tract (VCB). Of the seven PMTDs, one was found to be VC, and the other six were found to be VCB. These results indicate that a voiceprint which is different from the open vocal tract (VO) found in normal groups in research conducted by Jung, et al. (2008b) appeared in both groups of this study. Even between the two groups, there is a difference in the voiceprint before vocalization. Second, a VKG analysis was conducted to identify the two groups' vocal cord contact quotient. As a result, the PWS group's vocal cord contact quotient changed gradually from an irregular one at the initial vocalization stage to a regular one. The PMTD group continued the tension at the initial vocalization. Putting together all of these results, there is a difference in vocal fold vibratory behaviors between PWSs and PMTDs when they speak. Thus, there was a difference in muscular tension between the two groups.

  • PDF

성악가의 성종 구분에 관한 문헌적 고찰 (Voice Classification of Trained Classic Singers)

  • 남도현;백재연;최홍식
    • 대한후두음성언어의학회지
    • /
    • 제18권1호
    • /
    • pp.56-61
    • /
    • 2007
  • Introduction: Actually classification of classic singers' voice depends on habitual judgment by voice teachers or voice trainer referring to vocal timbre, vocal range and vocal quality. Such judgments, however, may turn out to be incorrect because they are based on subjective opinions. Therefore, more objective methodology is required. Method: Foreign dissertations searched through Pub Med, along with foreign and domestic journals, were reviewed regard ing how singers' voice has been categorized. Results: Vocal range, vocal timbre, voice quality, fundamental frequency of habitual speaking, length of vocal tract, the length from cricoid cartilage to thyroid cartilage's thyroid notch and length of vocal fold, tone of passaggio as well as traditional approaches such as perceptual judgment used by professional singers have been used for categorize the voice classification. Conclusion: To optimize categorizing singers' voice, vocal range, vocal timbre, voice quality, fundamental frequency of habitual speaking, length of vocal tract, the length from cricoid cartilage to thyroid cartilage's thyroid notch and length of vocal fold, tone of passaggio may be totally recommended.

  • PDF

동적 성대 모델을 이용한 후두 내 유동 및 음향장에 대한 수치 연구 (Computation of Laryngeal Flow and Sound through a Dynamic Model of the Vocal Folds)

  • 배영민;문영준
    • 한국전산유체공학회:학술대회논문집
    • /
    • 한국전산유체공학회 2008년도 춘계학술대회논문집
    • /
    • pp.21-24
    • /
    • 2008
  • The present study numerically investigates the glottal airflow characteristics as well as acoustic features of phonation fully coupled with dynamic behavior of vocal folds. The vocal folds are described by a low-dimensional body-covered model characterized by bio-mechanical parameters such as glottal width, vocal folds stiffness, and subglottal pressure. The flow in the vocal tract is modeled as an incompressible, axisymmetric form of the Navier-Stokes equations (INS), while the acoustic field is predicted by the linearized perturbed compressible equations (LPCE). The computed result shows that a two-mass model of vocal folds is sufficient to reproduce temporal variations in oral airflow and glottis motion produced by female speakers. It is also found that i) the glottal width has a significant effect on the amplitude of glottal flow, and thus on the amplitude of acoustic wave in the vocal tract, ii) the vocal fold tension is the main control parameter for the fundamental frequency of phonation, iii) the subglottal pressure plays an appreciable role on reproduction of the self-sustained oscillation of vocal folds, and iv) the strength of pulsating airflow and vortical structures are primarily affected by glottal width and subglottal pressure, and are closely related to pitch, loudness, and voice quality. Finally, more comprehensive explanation about the difference between one- and two-mass models is presented with discussion of effectiveness of vocal folds oscillation and voice quality.

  • PDF

음성인식에서 화자 내 정규화를 위한 진폭 변경 방법 (An Amplitude Warping Approach to Intra-Speaker Normalization for Speech Recognition)

  • 김동현;홍광석
    • 인터넷정보학회논문지
    • /
    • 제4권3호
    • /
    • pp.9-14
    • /
    • 2003
  • 기존의 성도 정규화 방법은 화자 간 정규화의 정확성을 개선하기 위한 매우 좋은 방법이다. 본 논문에서는 피치 변경 발성에 기반을 둔 새로운 화자 내 warping 인수 추정 방법을 제안한다. 화자 내 피치 변경 발성은 성문과 성도에 의해 발생되는 음성의 음향학적 차이 때문에 음성의 특징 공간 분포는 다르게 나타날 것이다. 발성의 변동은 frequency 성분과 amplitude 성분의 두가지 유형이 있다. 성도 정규화는 화자 간 정규화 방법들 중에서 주파수 정규화 방법이다. 여기에서는 화자 내 정규화를 위하여 진폭 변동을 정규화하는 방법을 제안한다. 참조 피치와 입력 피치의 역비례 계산에 의해서 진폭 warping 인수를 결정하는 것이 가능하다. 성능 평가를 위한 인식 실험 결과 숫자와 단어 인식에서 0.4%∼2.3% 정도의 인식 오류가 감소되었다.

  • PDF

기본주파수와 성도길이의 상관관계를 이용한 HTS 음성합성기에서의 목소리 변환 (Voice transformation for HTS using correlation between fundamental frequency and vocal tract length)

  • 유효근;김영관;서영주;김회린
    • 말소리와 음성과학
    • /
    • 제9권1호
    • /
    • pp.41-47
    • /
    • 2017
  • The main advantage of the statistical parametric speech synthesis is its flexibility in changing voice characteristics. A personalized text-to-speech(TTS) system can be implemented by combining a speech synthesis system and a voice transformation system, and it is widely used in many application areas. It is known that the fundamental frequency and the spectral envelope of speech signal can be independently modified to convert the voice characteristics. Also it is important to maintain naturalness of the transformed speech. In this paper, a speech synthesis system based on Hidden Markov Model(HMM-based speech synthesis, HTS) using the STRAIGHT vocoder is constructed and voice transformation is conducted by modifying the fundamental frequency and spectral envelope. The fundamental frequency is transformed in a scaling method, and the spectral envelope is transformed through frequency warping method to control the speaker's vocal tract length. In particular, this study proposes a voice transformation method using the correlation between fundamental frequency and vocal tract length. Subjective evaluations were conducted to assess preference and mean opinion scores(MOS) for naturalness of synthetic speech. Experimental results showed that the proposed voice transformation method achieved higher preference than baseline systems while maintaining the naturalness of the speech quality.

후두위치의 변화에 따른 Singer's Formant와 성대접촉률의 변화 연구 (Analysis of Singer's Formant & Close Quotient During Change of the Larynx Position)

  • 남도현;최성희;최재남;전석필;최홍식
    • 대한후두음성언어의학회지
    • /
    • 제15권2호
    • /
    • pp.98-111
    • /
    • 2004
  • Background and Objectives : The purpose of this study is to analyze the difference of Fundamental Frequency(Hz), Closed Quotient(Qx ; %), Intensity(dB), Vocal tract length and width(cm), formant frequency(Hz), level of formant frequency(dB) depending on the larynx position. Materials and Methods : One professional male singer(career : 28 years) produced sustained vowel /a/,/e/,/i/,/o/,/u/ in two larynx position (higher, lower) with Dr. Speech and video fluoroscopy was used to quantify the vocal tract morphology. Results : In lower larynx position, CQ is increased 9.8% and Intensity is increased about 10% and level of Formant Frequency is increased. And also Vocal tract length is longer 2.4cm, Vocal tract width(Anterior width : 0.4cm, lateral width : 0.2cm) is wider than in higher larynx position. Conclusions : Singer's formant has a prominent spectrum envelope peak near 2400-2600Hz by clustering of F3, F4 and F5 near 3400Hz in lower larynx position.

  • PDF

음성기관의 공기역학적 고찰 (The Aerodynamic Study of the Vocal Tract)

  • 김기령;박인용;김희남;심상열;최홍식
    • 대한기관식도과학회:학술대회논문집
    • /
    • 대한기관식도과학회 1979년도 제13차 학술대회 연제순서 및 초록
    • /
    • pp.8.3-8
    • /
    • 1979
  • 음의 생성은 성문하의 기류가 성대에서 조절되고 성대상부의 Vocal tract에서 modulation되어 생성되므로 후두에 이상이 생기면 발성시 후두를 통과하는 기류에 변화가 오게된다. 타국에서는 Dohne(1944)과 Arnold(1955, 1958)등 여러학자들이 후두질환에 따른 공기역학적 변화를 측정하여 후두질환의 진단에 기여한 바 크다. 본 저자들은 후두질환에 따른 공기역학적 측정에 앞서 이에 대한 정상역치를 측정하여 그 기준치로 하고자 21∼30세의 정상인 남녀 각각 20명을 대상으로 Collins회사제 Respirometer를 이용하여 평균 기류유출률, 최대 밭성량, 최대발성시간 및 발성속력치 등을 측정하였기에 제 1보로서 보고하는 바이다.

  • PDF