• 제목/요약/키워드: Vocal Tract

검색결과 172건 처리시간 0.021초

청각장애아동과 건청아동의 성도면적 추정 성능 (Performance of Vocal Tract Area Estimation from Deaf and Normal Children's Speech)

  • 김세환;김남;권오욱
    • 대한음성학회지:말소리
    • /
    • 제56호
    • /
    • pp.159-172
    • /
    • 2005
  • This paper analyzes the vocal tract area estimation algorithm used as a part of a speech analysis program to help deaf children correct their pronunciations by comparing their vocal tract shape with normal children's. Assuming that a vocal tract is a concatenation of cylinder tubes with a different cross section, we compute the relative vocal tract area of each tube using the reflection coefficients obtained from linear predictive coding. Then, we obtain the absolute vocal tract area by computing the height of lip opening with a formula modified for children's speech. Using the speech data for five Korean vowels (/a/, /e/, /i/, /o/, and /u/), we investigate the effects of the sampling frequency, frame size, and model order on the estimated vocal tract shape. We compare the vocal tract shapes obtained from deaf and normal children's speech.

  • PDF

비고정 구간 길이 음향 튜브를 이용한 성도 모델링 (Vocal Tract Modeling with Unfixed Sectionlength Acoustic Tubes(USLAT))

  • 김동준
    • 전기학회논문지
    • /
    • 제59권6호
    • /
    • pp.1126-1130
    • /
    • 2010
  • Speech production can be viewed as a filtering operation in which a sound source excites a vocal tract filter. The vocal tract is modeled as a chain of cylinders of varying cross-sectional area in linear prediction acoustic tube modeling. In this modeling the most common implementation assumes equal length of tube sections. Therefore, to model complex vocal tract shapes, a large number of tube sections are needed. This paper proposes a new vocal tract model with unfixed sectionlengths, which uses the reduced lattice filter for modeling the vocal tract. This model transforms the lattice filter to reduced structure and the Burg algorithm to modified version. When the conventional and the proposed models are implemented with the same order of linear prediction analysis, the proposed model can produce more accurate results than the conventional one. To implement a system within similar accuracy level, it may be possible to reduce the stages of the lattice filter structure. The proposed model produces the more similar vocal tract shape than the conventional one.

성도 변형에 따른 모음 포먼트의 변화 고찰 (A Study on Vowel Formant Variation by Vocal Tract Modification)

  • 양병곤
    • 음성과학
    • /
    • 제3권
    • /
    • pp.83-92
    • /
    • 1998
  • Vowels are classified by vocal tract shapes. These shapes form constriction points along the tract, which have an influence on such vocal tract resonance as $F_l,\;F_2,\;F_3$, and so on. This study reviews the perturbation theory of the tract and determines the corresponding formant frequencies from modified vocal tracts using vocal tract area function. Then, formant variation is observed from the theory. Finally, each set of $F_l,\;F_2,\;and\;F_3$ frequency is input to a speech synthesis software to make a vowel sound. Auditory impression of each sound without any modification of its vocal tract shape is almost the same as the corresponding phonetic symbol. Formant frequencies of $F_l,\;F_2,\;F_3$ vary according to the perturbation theory. Generally, constriction along the node causes formant values to decrease while constriction along the anti-node cause it to increase. Vocal tracts modified by more than $3\;cm^2$ change vowel qualities of /a/ and /i/ into those of f /v/ and /$\varepsilon$/, respectively. This study will be helpful in simulating sounds from modified vocal tracts before any operation. Further studies are desirable to compare vocal tract shapes of various languages and their sounds together.

  • PDF

청각장애아 및 건청아 음성으로부터 성도 면적 추정 (Vocal Tract Area Estimation from Deaf and Normal Children's Speech)

  • 김세환;권오욱
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2005년도 추계 학술대회 발표논문집
    • /
    • pp.51-54
    • /
    • 2005
  • This paper analyzes the vocal tract area estimation algorithm used as a part of a speech analysis program to help deaf children correct their pronunciations by comparing their vocal tract shape with normal children's. Assuming that a vocal tract is a concatenation of cylinder tubes with a different cross section, we compute the relative vocal tract area of each tube using the reflection coefficients obtained from linear predictive coding. Then, obtain the absolute vocal tract area by computing the height of lip opening with a formula modified for children's speech. Using the speech data for five Korean vowels (/a/, /e/, /i/, /o/, and /u/), we investigate the effects of the sampling frequency, frame size, and model order. We compare vocal tract shapes obtained from deaf and normal children's speech.

  • PDF

Determining the Relative Differences of Emotional Speech Using Vocal Tract Ratio

  • Wang, Jianglin;Jo, Cheol-Woo
    • 음성과학
    • /
    • 제13권1호
    • /
    • pp.109-116
    • /
    • 2006
  • In this paper, our study focuses on obtaining the differences of emotional speech in three different vocal tract sections. The vocal tract area was computed from the area function of the emotional speech. The total vocal tract was divided into 3 sections (vocal fold section, middle section and lip section) to acquire the differences in each vocal tract section of emotional speech. The experiment data include 6 emotional speeches from 3 males and 3 females. The 6 emotions consist of neutral, happiness, anger, sadness, fear and boredom. The measured difference is computed by the ratio through comparing each emotional speech with the normal speech. The experimental results present that there is not a remarkable difference at lip section, but the fear and sadness have a great change at the vocal fold part.

  • PDF

MRI를 이용한 조음모델시뮬레이터 구현에 관하여 (On the Implementation of Articulatory Speech Simulator Using MRI)

  • 조철우
    • 음성과학
    • /
    • 제2권
    • /
    • pp.45-55
    • /
    • 1997
  • This paper describes the procedure of implementing an articulatory speech simulator, in order to model the human articulatory organs and to synthesize speech from this model after. Images required to construct the vocal tract model were obtained from MRI, they were then used to construct 2D and 3D vocal tract shapes. In this paper 3D vocal tract shapes were constructed by spatially concatenating and interpolating sectional MRI images. 2D vocal tract shapes were constructed and analyzed automatically into a digital filter model. Following this speech sounds corresponding to the model were then synthesized from the filter. All procedures in this study were using MATLAB.

  • PDF

음성인식을 위한 성도 길이 정규화 (Vocal Tract Length Normalization for Speech Recognition)

  • 지상문
    • 한국정보통신학회논문지
    • /
    • 제7권7호
    • /
    • pp.1380-1386
    • /
    • 2003
  • 화자들 사이의 성도의 길이의 변이에 의하여 음성 인식기의 성능이 저하된다. 본 연구에서는 입력 음성에서 추출한 단구간 스펙트럼의 주파수축을 확대하거나 축소하여 음성인식기에 미치는 화자사이의 성도 길이의 영향을 최소화하는 방법을 사용한다 성도의 길이를 정규화하기 위한 주파수 변환 함수로서, 선형의 주파수 변환 함수와 조각적 선형적인 변환 함수를 고려하였다. 또한, 커다란 성도길이의 변이에 따른 주파수축의 척도변화를 보다 효과적으로 모의할 수 있는 가변구간 조각적 선형함수를 제안한다. TIDIGITS 연결 숫자음 음성자료에 대하여 제안한 방법을 적용한 결과, 단어의 오인식률을 2.15%에서 0.53%로 크게 감소시킴으로서, 성도 길이 정규화가 화자 독립 음성인식기의 성능 향상에 필수적임을 알 수 있었다.

성악인에서 발성 시 음의 높낮이에 따른 성도 길이의 변화 (The Change of the Length of Vocal Tract in Singers according to the Phonation at Different Levels of Pitch)

  • 반재호;김창규;이상혁;이경철;진성민
    • 대한후두음성언어의학회지
    • /
    • 제17권1호
    • /
    • pp.14-16
    • /
    • 2006
  • Background and Objectives: The purpose of this study is to investigate the change of vocal tract length according to the level of the pitch by the singers. Materials and Methods: Fifteen tenors were asked to produce successive /a/ sound in G4(382Hz) for the head register, C3(131Hz) for the chest register and usual speaking sound. The control group consisted of 15 males of an similar age who are not professional singers. The length of vocal tract was calculated by applying the formula of Fn=(2n-1) c/4L(F : formant frequency, c : the speed of sound in the vocal tract(350m/sec), L : length of vocal tract, $n=1,2,3,4,{\ldots}{\infty}$). Results: In singer's group, there showed no significant statistical difference of length among head and chest register and usual speaking sound. However in the control group, there showed statistically significant difference of length. Comparison of the absolute difference in the length of vocal tract by changing level of pitch in phonation, between the control group and the singers group. Changing from G4 phonation to C3 phonation and C3 phonation to usual speaking sound showed statistically difference of vocal tract length was less in the singers group than the control group. Conclusion: The change of vocal tract length, in either speaking or singing, was less in singers than the control group. We could assume that the singers maintain their larynx position constantly throughout the pitch range when phonation.

  • PDF

성도 자기공명 영상과 음향정보(F1/F2)를 이용한 한국어 단모음 [이, 에, 아, 오, 우, 으] 판별 (A Vowel Discrimination of Korean Monophthongs [i, e, a, o, u, ${\omega}$] Using Vocal Tract Magnetic Resonance Image and F1/F2)

  • 성철재;박종원;김귀룡
    • 대한음성학회지:말소리
    • /
    • 제56호
    • /
    • pp.103-125
    • /
    • 2005
  • We present a new method of measuring the volume and cross-sectional area of the vocal tract from magnetic resonance images. The vocal tract was divided by the 2 constriction points on the horizontal and vertical planes. The ratios of the volumes of the segment vocal tracts to that of the entire vocal tract play a crucial role in discriminating Korean monophthongs in that vowels were successfully discriminated by the ratios. The discriminant analysis also demonstrated that the acoustic parameters F1 and F2, in addition to the segment volumes, serve as significant parameters in discriminating Korean monophthongs.

  • PDF

성도 면적 함수와 벡터 양자화를 이용한 음성 인식에 관한 연구 (A Study on Speech Recognition using Vocal Tract Area function and Vector Quantization)

  • 송제혁;김동준;박상희
    • 대한의용생체공학회:학술대회논문집
    • /
    • 대한의용생체공학회 1993년도 추계학술대회
    • /
    • pp.171-174
    • /
    • 1993
  • We propose the vocal tract area function as the feature vector of speech recognition. Vocal tract area function is directly related to speech production. The vocal tract area function is not only showing mechanism of speech production but also can be used as an effective feature vector in speech, recognition in this study.

  • PDF