• 제목/요약/키워드: Vocal tract characteristics

검색결과 43건 처리시간 0.019초

Spectral Characteristics and Nasalance Scores of Hypernasality in Patient with Cleft Palate

  • Soh, Byung-Soo;Shin, Hyo-Keun;Kim, Hyun-Gi
    • 음성과학
    • /
    • 제12권1호
    • /
    • pp.27-35
    • /
    • 2005
  • Differential instrumentation for the diagnoses of individuals with Cleft palate has been used to objectively measure speech problems. The Cepstrum Method was used to study the vocal tract transfer function. The vocal tract transfer function and the source spectrum should be considered in the evaluation of nasal resonance. The aim of this study was to collect quantitative data on the acoustic Instrumentation used for evaluating hypernasality. Normal subjects (9 male, 21 female; 37 male children, 20 female children) and individuals with VPI (13 male, 8 female; 16 male children, 9 female) participated in this study. The vowel /i/ was selected to gauge the severances of hypernasality Spectral and Cepstral studies using CSL was used to identify the acoustic characteristics. Cepstrum analysis shows significant differences in quefrency and amplitude. The quefrency of normal groups was shorter than that of the VPI groups, while the amplitude of normal groups was lower than that of the VPI groups. This may have significance in the evaluation 'of nasal resonance.

  • PDF

마비성구어장애 화자의 조음밸브 교호운동에 관한 공기역학 및 음향학적 특징 (A Study on the Aerodynamic and Acoustic Characteristics in Dysarthria Speakers' Diadochokinesis by Articulation Valves in Vocal Tract)

  • 박희준;권순복;왕수건;정옥란
    • 음성과학
    • /
    • 제15권2호
    • /
    • pp.177-189
    • /
    • 2008
  • This study was to investigate diadochokinetic (DDK) rate, regularity and mean flow rate of articulation valves in dysarthria. DDK rate, mean airflow rate (MFR) and regularity of DDK syllable repetitions of vocal function /ihi/, tongue function /ta/, velopharyngeal function /bm/, and labial function /pa/ in 24 normal and dysarthric speakers were measured. Aerophone Ⅱ and Motor Speech Profile were used for data recording and analysis. The results of the findings were as follows: First, there were significant differences between the dysarthria and the normal group in DDK rate. DDK rates in ataxic dysarthria were the lowest and spastic, flaccid, and hypokinetic dysarthria followed in sequence. Second, there was a significant difference between the dysarthria and the normal group in DDK regularity. Third, there was a significant difference between dysarthria groups and normal group in DDK MFR. Finally, there was a significant difference between the 4 groups of dysarthria and the normal group in DDK air flow tracking. The results of this study can be guidelines for normal DDK rate, regularity and flow rate in dysarthria groups. In addition, their differential diagnoses and descriptions are important to make a decision on medical and behavioral management of the individuals with disorders according to DDK characteristics.

  • PDF

LSP를 이용한 성문 스펙트럼 기울기 추정에 관한 연구 (A Study on the Estimation of Glottal Spectrum Slope Using the LSP (Line Spectrum Pairs))

  • 민소연;장경아
    • 음성과학
    • /
    • 제12권4호
    • /
    • pp.43-52
    • /
    • 2005
  • The common form of pre-emphasis filter is $H(z)\;=\;1\;- az^{-1}$, where a typically lies between 0.9 and 1.0 in voiced signal. Also, this value reflects the degree of filter and equals R(1)/R(0) in Auto-correlation method. This paper proposes a new flattening algorithm to compensate the weaked high frequency components that occur by vocal cord characteristic. We used interval information of LSP to estimate formant frequency. After obtaining the value of slope and inverse slope using linear interpolation among formant frequency, flattening process is followed. Experimental results show that the proposed algorithm flattened the weaked high frequency components effectively. That is, we could improve the flattened characteristics by using interval information of LSP as flattening factor at the process that compensates weaked high frequency components.

  • PDF

전기 Glottography(EGG)를 이용한 후두구음역학적 특성 (The Role of the Electroglottography on the Laryngeal Articulation of Speech)

  • 홍기환;박병암;양윤수;서수영;김현기
    • 대한후두음성언어의학회지
    • /
    • 제8권1호
    • /
    • pp.18-26
    • /
    • 1997
  • There are two types of phonetic study, acoustic and physiologic, for differentiating the three manner categories of Korean stop consonants. On the physiologic studies, there are endoscopic, electromyographic(EMG), electroglottographic(EGG) and aerodynamic studies. In this study, I tried to investigate general features of Korean stops using EGG study for the open quotient of vocal fold and baseline shift during speech, and aerodynamic characteristics for e subglottal air pressure, air flow and glottal resistance at consonants. On the aerodynamic study, the glottalized and aspirated stops may be characterized by e increasing subglottal pressure comparing with lenis stop at consonants. The airflow is largest in the aspirated stops followed by lenis stops and glottalized. The glottal airway resistance (GAR) showed highest in the glottalized followed by the lenis, but lowest in e aspirated during e production of consonants, and showed highest in e aspirated, but low in the glottalized and lenis during the production of vowel. The glottal resistance at consonant showed significant difference among consonants and significant interaction between subject and types of consonant. The glottal resistance at vowel showed significant difference among consonants, and e interaction occured between subject and types of consonant. The electroglottography(EGG) has been used for investigating e functioning of e vocal folds during its vibration. The EGG should be related to the patterns of the vocal fold vibration during phonation in characterizing the temporal patterns of each vibratory cycle. The purpose of this study is to investigate the dynamic change of EGG waveforms during continuous speech. The dynamic changes of EGG waveforms fir the three-way distinction of Korean stops were characterized that the aspirated stop appears to be characterized by largest open quotient and smallest glottal contact area of the vocal folds in e initial portion of vocal fold vibration ; the lenis stop by moderate open quotient and glottal contact area ; but the glottalized stop by smallest open quotient and largest glottal contact area. There may be close relationship between the OQ(open quotient) in the initial voice onset and the glottal width at the time of consonant production, the larger glottal width just before vocal fold vibration results in the smaller OQ of the vocal fold vibration in the initial voice onset. The EGG changes of baseline shift during continuous speech production were characterized by the different patterns for the three types of Korean consonants. The small and less stiffness change of baseline shift was found for the lenis and the glottalized, and the largest and stiffest change was found for the aspirated. On the baseline shift for the initial voice onset, they showed so similar patterns with for the consonant production, larger changed in the aspirated. for the lenis and the glottalized during the initial voice onset, three subjects showed individual difference each other. I suggest at s characteristics were strongly related with articulatory activity of vocal tract for the production of consonant, especially for the aspirated stop. The suspecting factors to affect EGG waveforms are glottal width, vertical laryngeal movement and the intrapharyngeal pressure to neighboring tissue during connected spech. So the EGG may be an useful method to describe laryngeal activity to classify pulsing conditions of the larynx during speech production, and EGG research can be controls for monitoring the vocal tract articulation, although above factors to affect EGG would have played such a potentially role on vocal fold vibratory behavior obtained using consonant production.

  • PDF

감정 인식을 위한 음성 특징 도출 (Extraction of Speech Features for Emotion Recognition)

  • 권철홍;송승규;김종열;김근호;장준수
    • 말소리와 음성과학
    • /
    • 제4권2호
    • /
    • pp.73-78
    • /
    • 2012
  • Emotion recognition is an important technology in the filed of human-machine interface. To apply speech technology to emotion recognition, this study aims to establish a relationship between emotional groups and their corresponding voice characteristics by investigating various speech features. The speech features related to speech source and vocal tract filter are included. Experimental results show that statistically significant speech parameters for classifying the emotional groups are mainly related to speech sources such as jitter, shimmer, F0 (F0_min, F0_max, F0_mean, F0_std), harmonic parameters (H1, H2, HNR05, HNR15, HNR25, HNR35), and SPI.

제주어 화자에서 '아래 아'(/ㆍ/) 조음의 영상의학적 및 음향학적 특성 (Radiological and acoustic characteristics of "Arae-a" (/ㆍ/) articulation in Jeju language speakers)

  • 이승진;최홍식
    • 말소리와 음성과학
    • /
    • 제10권1호
    • /
    • pp.57-64
    • /
    • 2018
  • The purpose of the present study was to explore the radiological and acoustic characteristics of "Arae-a" (/${\cdot}$/) articulation in two male Jeju language speakers, focusing on selected measures in radiological images derived from computed tomography scans, as well as the first and the second formant measures in selected vowels. An elderly male speaker (a 78-year-old) and a young male speaker (a 34-year-old) participated in the study. During the production of four selected vowels, the shape of the vocal tract was identified, and selected measures were obtained from the elderly participant's computed tomography (CT) scans. For acoustic analysis, the participants were given a list of near-minimal pairs consisting of 112 words and asked to read them aloud. The results indicated that the "Arae-a" (/${\cdot}$/) articulation of the elderly speaker showed unique acoustic and radiological characteristics compared to other similar vowels, thus presenting substantial consistency with the descriptions of the "Hunminjeongeum Haeryebon." In contrast, the F1 and F2 measures of the young male's /${\cdot}$/ articulation were not distinguished from those of /ㅗ/. Current results, in part, support the scientific principles underlying the invention of "Arae-a," which reflects the shape of the vocal tract during production, and the necessity for further research.

선형다변회귀모델과 LP-PSOLA 합성방식을 이용한 음성변환 (Voice Conversion Using Linear Multivariate Regression Model and LP-PSOLA Synthesis Method)

  • 권홍석;배건성
    • 한국음향학회지
    • /
    • 제20권3호
    • /
    • pp.15-23
    • /
    • 2001
  • 본 논문에서는 임의의 사람이 발성한 음성을 마치 다른 사람이 발성한 것처럼 들리도록 하는 음성변환 기술에 대하여 설명하고, 화자간의 성도 특성과 여기신호 특성 파라미터 변환을 독립적으로 수행하기 위한 변환방법을 실험한다. 성도 특성 파라미터 변환은 입력되는 음성신호에서 LPC (Linear Predictive Cofficient)켑스트럼을 추출하여 선형다변회귀모델에 적용하여 수행하고, 여기신호 특성 파라미터 변환은 잔차신호를 추출하여 LP-PSOLA (Linear Predictive-Pitch Synchronous Overlap and Add) 합성방식을 이용한 화자간의 평균 피치주기 변환으로 수행된다. 실험결과는 선형다변회귀모델과 LP-PSOLA 합성방식을 이용하여 변환된 음성이 대상화자의 음성에 유사함을 보여준다

  • PDF

녹음 환경의 차이에 따른 화자의 음원 특성 비교: 발성유형지수 k를 중심으로 (Comparison of Speaker's Source Characteristics in Different Recording Environments by Using Phonation Type Index k)

  • 이후동;강선미;박한상;장문수
    • 음성과학
    • /
    • 제10권3호
    • /
    • pp.213-224
    • /
    • 2003
  • Spoken sound includes not only speaker's source but the characteristics of vocal tract and speech radiation. This paper is based on the theory of Park[1], who proposes the Phonation Type Index k; a variable that shows the characteristic of speaker's source excluding those of speaker's vocal tract and speech radiation. With Park's theory, we collect data by changing recording environments and expanding experimental data, and analyze the data collected to see whether or not the PTI k shows good discriminating power as a variable for speaker recognition. In the experiment, we repeatedly record 8 sentences ten times for each of 5 males in the environment of a recording room and an office, extract PTI k for each speaker, and measure the discriminating power for each speaker by using the value of PTI k. The result shows that PTI k has the excellent discriminating power of speakers. We also confirm that, even if the recording environment is changed, PTI k shows similar results.

  • PDF

후두질환에 대한 술전 술후 음성의 음향적 특성비교 분석 (Analysis and Comparisons of Acoustical Characteristics of Pathologic Voice before and after Surgery)

  • 김대현;조철우;백무진;왕수건
    • 음성과학
    • /
    • 제7권3호
    • /
    • pp.285-294
    • /
    • 2000
  • In this paper the acoustic characteristics of pathological voice, which are measured before and after surgical operation, are compared. This experiment is conducted for the purpose of predicting patients' speech after operation. The voices are recorded from the same patients. Jitter, shimmer and other parameters are. computed and their statistical characteristics are compared. Also spectral changes, such as formant frequency shift and spectral slope change, are compared. From the experimental results, it is verified that not only source characteristics but also vocal tract components vary. And this indicates that the modification of source parameters are not enough for the prediction. Also the result indicates that the operation causes change to both the physical shape of vocal folds and the manner of articulation.

  • PDF

목소리 특성의 주관적 평가와 음성 특징과의 상관관계 기초연구 (A Preliminary Study on Correlation between Voice Characteristics and Speech Features)

  • 한성만;김상범;김종열;권철홍
    • 말소리와 음성과학
    • /
    • 제3권4호
    • /
    • pp.85-91
    • /
    • 2011
  • Sasang constitution medicine utilizes voice characteristics to diagnose a person's constitution. To classify Sasang constitutional groups using speech information technology, this study aims at establishing the relationship between Sasang constitutional groups and their corresponding voice characteristics by investigating various speech feature variables. The speech variables include features related to speech source and vocal tract filter. Experimental results show that statistically significant correlation between voice characteristics and some speech feature variables is observed.

  • PDF