• Title/Summary/Keyword: sustained vowel phonation

Search Result 32, Processing Time 0.017 seconds

Comparison of Acoustic Parameters According to the Section of Analysis in Sustained Vowel Phonation (모음연장 음성 샘플의 분석 구간에 따른 음향학적 파라미터 비교)

  • Shin, Yu-Jeong
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.18 no.7
    • /
    • pp.269-274
    • /
    • 2017
  • This study aimed to investigate the acoustic differences that occur in diverse sections of sustained vowel phonation, which is often used in an objective speech analysis of voice disorder patients. The subjects included 17 voice disorder patients (vocal nodules) and 12 normal individuals without any voice disorder. The participants' sustained vowel phonation of /a/ was divided into onset, middle, and offset, and the jitter, shimmer, and NHR in each section were analyzed using the MDVP(Multi-Dimensional Voice Program). The Friedman test and post hoc analysis were used. In the vocal nodules group, the jitter, shimmer and NHR were significantly higher in the off section of sustained vowel phonation than in the middle section, and there were no significant differences between the beginning and middle sections. In contrast, in the group of normal individuals, there were no significant differences between any of the sections. The values of the acoustic parameters according to the section of analysis in the sustained vowel phonation are different and the vocal in the end section is significantly more unstable than that in the middle section. The results of this study will be useful for selecting the sections to be analyzed in sustained vowel phonation and interpreting the results of the analysis.

Comparison of Vowel and Text-Based Cepstral Analysis in Dysphonia Evaluation (발성장애 평가 시 /a/ 모음연장발성 및 문장검사의 켑스트럼 분석 비교)

  • Kim, Tae Hwan;Choi, Jeong Im;Lee, Sang Hyuk;Jin, Sung Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.26 no.2
    • /
    • pp.117-121
    • /
    • 2015
  • Background : Cepstral analysis which is obtained from Fourier transformation of spectrum has been known to be effective indicator to analyze the voice disorder. To evaluate the voice disorder, phonation of sustained vowel /a/ sound or continuous speech have been used but the former was limited to capture hoarseness properly. This study is aimed to compare the effectiveness in analysis of cepstrum between the sustained vowel /a/ sound and continuous speech. Methods : From March 2012 to December 2014, total 72 patients was enrolled in this study, including 24 unilateral vocal cord palsy, vocal nodule and vocal polyp patients, respectively. The entire patient evaluated their voice quality by VHI (Voice Handicap Index) before and after treatment. Phonation of sustained vowel /a/ sample and continuous speech using the first sentence of autumn paragraph was subjected by cepstral analysis and compare the pre-treatment group and post-treatment group. Results : The measured values of pre and post treatment in CPP-a (cepstral peak prominence in /a/ vowel sound) was 13.80, 13.91 in vocal cord palsy, 16.62, 17.99 in vocal cord nodule, 14.19, 18.50 in vocal cord polyp respectively. Values of CPP-s (cepstral peak prominence in text-based speech) in pre and post treatment was 11.11, 12.09 in vocal cord palsy, 12.11, 14.09 in vocal cord nodule, 12.63, 14.17 in vocal cord polyp. All 72 patients showed subjective improvement in VHI after treatment. CPP-a showed statistical improvement only in vocal polyp group, but CPP-s showed statistical improvement in all three groups (p<0.05). Conclusion : In analysis of cepstrum, text-based analysis is more representative in voice disorder than vowel sound speech. So when the acoustic analysis of voice by cepstrum, both phonation of sustained vowel /a/ sound and text based speech should be performed to obtain more accurate result.

  • PDF

A Comparison of Voice Analysis of Children with Cochlear Implant and with Normal Hearing (인공와우이식 아동과 건청 아동의 음성 분석 비교)

  • Yoon, Misun;Choi, Eunah;Sung, Youngju
    • Phonetics and Speech Sciences
    • /
    • v.5 no.4
    • /
    • pp.71-78
    • /
    • 2013
  • The purpose of this study was to compare the acoustic voice outcomes of children with cochlear implant to those of children with normal hearing. Participants were 41 children using unilateral cochlear implant (18 males and 23 females), and children with normal hearing from the same age and sex. Mean age of implantation was approximately 3 years old, mean duration of implant use was 4 years in CI group. Acoustic analyses were performed using MDVP of CSL. Speech samples were 3 sustained vowels, /a, i, u/. 9 parameters (F0, Fhi, Flo, Jitter, Shimmer, vF0, vAm, NHR, and SPI) were analyzed. Children with CI did not show the significant differences in those parameters after the vowel /a/ phonation. Meanwhile, there were significantly different results in F0, Fhi, vF0, and SPI after /i, u/ phonation. These results revealed that differences of voice characteristics in children with CI compare to children with NH persist regarding vowel context. It suggests that high vowels would recommend as speech samples for acoustic evaluation. Futhermore perceptual analysis and speech therapy for phonation control would be necessary for children with CI.

Automatic Speaker Identification by Sustained Vowel Phonation (지속적으로 발성한 모음에 의한 화자인식)

  • Bae, Geon-Seong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.11 no.1
    • /
    • pp.35-41
    • /
    • 1992
  • A speaker identification scheme using the speaker-based VQ codecook of a sustained vowel is proposed and tested. With the pitch synchronous LPC vector of the sustained vowel /i/ as a feature vector, a VQ codebook size of 4 was found to be suitable to characterize each speaker's feature space. For 40 normal speakers (20 males, 20 females), we achieved the correct identification rate of 99.4% with a training data set, and 89.4% with a test data set with speech samples of only 50 pitch periods.

  • PDF

Sustained Vowel Modeling using Nonlinear Autoregressive Method based on Least Squares-Support Vector Regression (최소 제곱 서포트 벡터 회귀 기반 비선형 자귀회귀 방법을 이용한 지속 모음 모델링)

  • Jang, Seung-Jin;Kim, Hyo-Min;Park, Young-Choel;Choi, Hong-Shik;Yoon, Young-Ro
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.7
    • /
    • pp.957-963
    • /
    • 2007
  • In this paper, Nonlinear Autoregressive (NAR) method based on Least Square-Support Vector Regression (LS-SVR) is introduced and tested for nonlinear sustained vowel modeling. In the database of total 43 sustained vowel of Benign Vocal Fold Lesions having aperiodic waveform, this nonlinear synthesizer near perfectly reproduced chaotic sustained vowels, and also conserved the naturalness of sound such as jitter, compared to Linear Predictive Coding does not keep these naturalness. However, the results of some phonation are quite different from the original sounds. These results are assumed that single-band model can not afford to control and decompose the high frequency components. Therefore multi-band model with wavelet filterbank is adopted for substituting single band model. As a results, multi-band model results in improved stability. Finally, nonlinear sustained vowel modeling using NAR based on LS-SVR can successfully reconstruct synthesized sounds nearly similar to original voiced sounds.

The characteristics of soprano students' voice related to the vocal methods (발성방법에 따른 소프라노 성악도의 음성 특성)

  • Kim, Jungtaek;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.75-83
    • /
    • 2017
  • The purpose of this study is to find clues to the risk of voice disorders in soprano students. The subjects of the study were 17 soprano students and 18 general students (women). The phonation of vowels /a/, /i/, and /u/ with C4 and F4 notes in each group were recorded. Then, only soprano students were made to record their classical vocalization containing vibrato. Formant, formant energy, bandwidth, VAI (vowel area index), VSA (vowel space area) and L/H ratio were analyzed. There was significant difference in F3 such that the singers' note was measured around 3 kHz which seems to be 400 Hz higher than one from general students. But, There was no significant difference in L/H ratio between soprano student and the general student. There was a significant difference in F3 in the comparison of the soprano students' two vocalization methods. Classical vocalization was measured at 200Hz higher than sustained phonation in F3. Vocal tract adjustment was made and vowel space changed, but there was no significant difference in F3 energy, which is the index of singers' formant according to the phonation method. The L/H ratio, which can be a direct indicator of vocal effort, has no difference in phonation method and is lowered in all phonation methods as the pitch increases. C4 and F4 pitches are lower than the singing range of the soprano. When the pitch changes, vocal effort increases like a general student which will be an indicator of the risk of vocalization. This will be a clue to the vocalization of the immature soprano student.

Acoustic characteristics of the sustained vowel phonation according to age groups (모음 연장 발성이 보이는 연령대별 음향음성학적 특성 연구)

  • Seo, Yoon-Jeong;Shin, Jiyoung
    • Phonetics and Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.67-76
    • /
    • 2018
  • This study was performed to investigate acoustic characteristics of sustained vowels produced by Seoul Korean speakers. For this study, three hundred nine healthy adults were chosen as participants from Korean Standard Speech Database. These subjects were divided into five chronological age groups (20s, 30s, 40s, 50s, 60-70s) and two gender groups (male and female). Fundamental frequency (f0), jitter, shimmer, and NHR (noise-to-harmonics ratio) was measured with 8 Korean vowels (/ɑ/, /æ/, /ʌ/, /e/, /o/, /u/, /ɯ/, /i/) by using Praat. The results showed that the vowel type significantly affected all acoustic parameters. Gender affected f0, jitter, and NHR significantly. The mean female speakers' f0 was greater than the males', and the mean jitter and NHR of male speakers was greater than the females'. Moreover, age affected shimmer and NHR significantly; in particular, the shimmer and NHR of elderly speakers was greater than the young speakers.

Layngeal Function Assessment by Electroglottographic Signal Analysis during Sustained Vowel Phonation (연속모음에서의 Electroglottograph 신호해석에 의한 후두기능 평가)

  • Song, Chul-Gyu;Lee, Myoung-Ho
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1994 no.05
    • /
    • pp.79-81
    • /
    • 1994
  • Petubation in the fundamental frequency and in the peak amplitude of the EGG signal derived with a four-electrode EGG system were investigated for the purpose of developing useful measures for the detection of layngeal pathology. The data were compared to the degree of amplitude perturbation and frequency perturbation. There was a close relation between amplitude perturbation and frequency perturbation analysis of EGG signal and degree of laryngeal pathology.

  • PDF

Alterations of Mucosal Vibration of True Vocal Folds on Tongue-Tip Trill : Preliminary Study Using the Electroglottography (Trill 발성시 전기성문파 측정검사로 분석한 성대점막 진동의 변화 : 예비연구)

  • 진성민;반재호;김남훈;이경철;권기환;이용배
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.11 no.1
    • /
    • pp.76-80
    • /
    • 2000
  • Tongue-tip trill is a sound made by the tongue tip making contract with the alveolar ridge and oscillating rapidly as sound is produced. It is an exercise used by many singers to warm up the voice and used as one of the methods of voice rehabilitation for patients who have the vocal folds scarred postoperatively and also who present with a variety of disorders, particularly hypofunction and presbyphonia. We intended to investigate the mucosal vibration of the true vocal folds on tongue-tip trill by electroglottography and to find e effective methods of tongue-tip trill. One adult male volunteer participated. Spectrography and electroglottography were checked repeatedly 15 times, more than 5 second in each times, at same pitch, in three conditions of phonation : sustained /a/ vowel, anterior trill in which tongue-tip vibrated at anterior portion of alveolar ridge just behind the anterior tooth, and posterior trill in which at palatal crest behind the transverse palatine fold We measured the first and second formant to determine indirectly the position of tongue and calculated speed quotient and the ratio of closing phase to closed phase. Speed quotients of posterior trill were higher than sustained /a/ vowel and anterior trill in 14 times. The ratio of closing phase to dosed phase of posterior trill were lower than the others in 14 times. Mucosa of true vocal folds is vibrated more effectively on posterior trill rather than sustained /a/ vowel and anterior trill. So, when tongue-tip trill is used as a method of voice rehabilitation, we suggest that posterior trill is better in producing effective mucosal vibration

  • PDF