• Title/Summary/Keyword: sustained vowel

Search Result 64, Processing Time 0.021 seconds

A Study of Acoustic Characteristics of Two Syllables Words and Sustained Vowel (병적음성에 대한 지속 모음 및 이음절어 발화시 나타나는 음향학적 차이에 대한 연구)

  • 채윤정;김범규;홍기환
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.11 no.1
    • /
    • pp.104-112
    • /
    • 2000
  • An evaluation of voice disorder has two methods. One is a perceptual analysis and the other is an acoustic analysis. All of these methods are just focused on sustained vowel. The analysis of conversational speech levels in voice disorder has not been achieved enough. The purpose of the present study is to compare two syllable words and sustained vowel in the vocal polyp patients and normal male speakers and to be applied on the vocal assessment and the voice therapy as a basic data. fifteen male patients with vocal polyp were the subject group. Fifteen healthy male were the control group for this study. The voices of the subject and control group, saved in MDVP of CSL were analyzed by its own analysis program. As a results, in subject group, the voice qualities between the vowel following lenis stop and the sustained vowel had no differences, and the voice qualities were different significantly between the vowel following heavily aspirated stop and the sustained vowel. In the control group the vowel fllowing stops and sustained vowel had also many differences in their voice quality, especially significant between the vowel following glottal stop and e sustained vowel.

  • PDF

Sustained Vowel Modeling using Nonlinear Autoregressive Method based on Least Squares-Support Vector Regression (최소 제곱 서포트 벡터 회귀 기반 비선형 자귀회귀 방법을 이용한 지속 모음 모델링)

  • Jang, Seung-Jin;Kim, Hyo-Min;Park, Young-Choel;Choi, Hong-Shik;Yoon, Young-Ro
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.7
    • /
    • pp.957-963
    • /
    • 2007
  • In this paper, Nonlinear Autoregressive (NAR) method based on Least Square-Support Vector Regression (LS-SVR) is introduced and tested for nonlinear sustained vowel modeling. In the database of total 43 sustained vowel of Benign Vocal Fold Lesions having aperiodic waveform, this nonlinear synthesizer near perfectly reproduced chaotic sustained vowels, and also conserved the naturalness of sound such as jitter, compared to Linear Predictive Coding does not keep these naturalness. However, the results of some phonation are quite different from the original sounds. These results are assumed that single-band model can not afford to control and decompose the high frequency components. Therefore multi-band model with wavelet filterbank is adopted for substituting single band model. As a results, multi-band model results in improved stability. Finally, nonlinear sustained vowel modeling using NAR based on LS-SVR can successfully reconstruct synthesized sounds nearly similar to original voiced sounds.

Comparison of Acoustic Parameters According to the Section of Analysis in Sustained Vowel Phonation (모음연장 음성 샘플의 분석 구간에 따른 음향학적 파라미터 비교)

  • Shin, Yu-Jeong
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.18 no.7
    • /
    • pp.269-274
    • /
    • 2017
  • This study aimed to investigate the acoustic differences that occur in diverse sections of sustained vowel phonation, which is often used in an objective speech analysis of voice disorder patients. The subjects included 17 voice disorder patients (vocal nodules) and 12 normal individuals without any voice disorder. The participants' sustained vowel phonation of /a/ was divided into onset, middle, and offset, and the jitter, shimmer, and NHR in each section were analyzed using the MDVP(Multi-Dimensional Voice Program). The Friedman test and post hoc analysis were used. In the vocal nodules group, the jitter, shimmer and NHR were significantly higher in the off section of sustained vowel phonation than in the middle section, and there were no significant differences between the beginning and middle sections. In contrast, in the group of normal individuals, there were no significant differences between any of the sections. The values of the acoustic parameters according to the section of analysis in the sustained vowel phonation are different and the vocal in the end section is significantly more unstable than that in the middle section. The results of this study will be useful for selecting the sections to be analyzed in sustained vowel phonation and interpreting the results of the analysis.

Automatic Speaker Identification by Sustained Vowel Phonation (지속적으로 발성한 모음에 의한 화자인식)

  • Bae, Geon-Seong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.11 no.1
    • /
    • pp.35-41
    • /
    • 1992
  • A speaker identification scheme using the speaker-based VQ codecook of a sustained vowel is proposed and tested. With the pitch synchronous LPC vector of the sustained vowel /i/ as a feature vector, a VQ codebook size of 4 was found to be suitable to characterize each speaker's feature space. For 40 normal speakers (20 males, 20 females), we achieved the correct identification rate of 99.4% with a training data set, and 89.4% with a test data set with speech samples of only 50 pitch periods.

  • PDF

Comparison of Vowel and Text-Based Cepstral Analysis in Dysphonia Evaluation (발성장애 평가 시 /a/ 모음연장발성 및 문장검사의 켑스트럼 분석 비교)

  • Kim, Tae Hwan;Choi, Jeong Im;Lee, Sang Hyuk;Jin, Sung Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.26 no.2
    • /
    • pp.117-121
    • /
    • 2015
  • Background : Cepstral analysis which is obtained from Fourier transformation of spectrum has been known to be effective indicator to analyze the voice disorder. To evaluate the voice disorder, phonation of sustained vowel /a/ sound or continuous speech have been used but the former was limited to capture hoarseness properly. This study is aimed to compare the effectiveness in analysis of cepstrum between the sustained vowel /a/ sound and continuous speech. Methods : From March 2012 to December 2014, total 72 patients was enrolled in this study, including 24 unilateral vocal cord palsy, vocal nodule and vocal polyp patients, respectively. The entire patient evaluated their voice quality by VHI (Voice Handicap Index) before and after treatment. Phonation of sustained vowel /a/ sample and continuous speech using the first sentence of autumn paragraph was subjected by cepstral analysis and compare the pre-treatment group and post-treatment group. Results : The measured values of pre and post treatment in CPP-a (cepstral peak prominence in /a/ vowel sound) was 13.80, 13.91 in vocal cord palsy, 16.62, 17.99 in vocal cord nodule, 14.19, 18.50 in vocal cord polyp respectively. Values of CPP-s (cepstral peak prominence in text-based speech) in pre and post treatment was 11.11, 12.09 in vocal cord palsy, 12.11, 14.09 in vocal cord nodule, 12.63, 14.17 in vocal cord polyp. All 72 patients showed subjective improvement in VHI after treatment. CPP-a showed statistical improvement only in vocal polyp group, but CPP-s showed statistical improvement in all three groups (p<0.05). Conclusion : In analysis of cepstrum, text-based analysis is more representative in voice disorder than vowel sound speech. So when the acoustic analysis of voice by cepstrum, both phonation of sustained vowel /a/ sound and text based speech should be performed to obtain more accurate result.

  • PDF

Cepstral and spectral analysis of voices with adductor spasmodic dysphonia (내전형연축성 발성장애 음성에 대한 켑스트럼과 스펙트럼 분석)

  • Shim, Hee Jeong;Jung, Hun;Lee, Sue Ann;Choi, Byung Heun;Heo, Jeong Hwa;Ko, Do-Heung
    • Phonetics and Speech Sciences
    • /
    • v.8 no.2
    • /
    • pp.73-80
    • /
    • 2016
  • The purpose of this study was to analyze perceptual and spectral/cepstral measurements in patients with adductor spasmodic dysphonia(ADSD). Sixty participants with gender and age matched individuals(30 ADSD and 30 controls) were recorded in reading a sentence and sustained the vowel /a/. Acoustic data were analyzed acoustically by measuring CPP, L/H ratio, mean CPP F0 and CSID, and auditory-perceptual ratings were measured using GRBAS. The main results can be summarized as below: (a) the CSID for the connected speech was significantly higher than for the sustained vowel (b) the G, R and S for the connected speech were significantly higher than for the sustained vowel (c) Spectral/cepstral parameters were significantly correlated with the perceptual parameters, and (d) the ROC analysis showed that the threshold of 13.491 for the CSID achieved a good classification for ADSD, with 86.7% sensitivity and 96.7% specificity. Spectral and cepstral analysis for the connected speech is especially meaningful on cases where perceptual analysis and clinical evaluation alone are insufficient.

Formant Trajectories of English Vowels Produced by American Males (미국인 남성이 발음한 영어 모음의 포먼트 궤적)

  • Yang, Byung-Gon
    • Phonetics and Speech Sciences
    • /
    • v.1 no.3
    • /
    • pp.65-72
    • /
    • 2009
  • Formant values are the most important acoustic correlates of English vowels. Classical studies on English vowels reported the first three formant values measured at a single timepoint on a sustained vowel segment. However, many recent studies revealed that partial onset or offset segments with information of dynamic spectral changes may contribute to the exact identification of English vowels with an accuracy almost comparable to that by the whole vowel segment or word. The purpose of this study was to examine formant trajectories of nine English vowels collected by Hillenbrand et al.(1995). Acoustic analysis was systematically made by a Praat script at six equidistant timepoints over the vowel segment. Results showed that the first formant trajectories played an important role in distinguishing each vowel within the front- or back-vowel groups. The second formant trajectories of the back vowels varied more drastically than those of the front vowels. The third formant value was similar except the high vowel /i/. From the vowel space on F1 by F2 axes, the formant trajectories of each vowel clearly showed a transition toward the locus of the following consonant /d/. Other acoustic data revealed that there were some vowel inherent duration or pitch values. From this study we can conclude that the dynamic spectral changes are very important in specifying acoustic characteristics of the English vowels. Further studies on vowels and diphthongs in different contexts are desirable.

  • PDF

A Study on the Acoustic Characteristics of Sexy Voice (섹시한 음성의 음향학적 특징 연구)

  • Jeong Ok-Ran;Jo Sung-Mi
    • MALSORI
    • /
    • no.57
    • /
    • pp.73-84
    • /
    • 2006
  • The purpose of this study was to explore the acoustic characteristics of sexy voice. In this study, we measured acoustic parameters (fundamental frequency, jitter, shimmer, and nasalance) of a sustained vowel sound produced by 40 actors (20 males and 20 females) and 40 non-actors (20 males and 20 females). Digital audio recordings were made in the sustained vowel |a| for acoustic analyses using Praat (version 4.1.9) and Nasal View (version 4.5). Twenty voice pathologists participated in the listening experiment and judged the degree of sexiness on a 7-point scale. The results showed that fundamental frequency, shimmer and nasalance had significant differences between actors and non-actors. The acoustic parameters of sexy voice matched perceptual aspects of a previous study: Low fundamental frequency-low pitch and high shimmer-husky voice. On the other hand, the nasalance score did not match that of the previous study: Decreased nasalance had a higher score on sexiness scale judged by the listeners. It would be desirable to study the voice quality by analyzing and controlling more acoustic and auditory parameters for practical applications in the future.

  • PDF

Acoustic characteristics of the sustained vowel phonation according to age groups (모음 연장 발성이 보이는 연령대별 음향음성학적 특성 연구)

  • Seo, Yoon-Jeong;Shin, Jiyoung
    • Phonetics and Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.67-76
    • /
    • 2018
  • This study was performed to investigate acoustic characteristics of sustained vowels produced by Seoul Korean speakers. For this study, three hundred nine healthy adults were chosen as participants from Korean Standard Speech Database. These subjects were divided into five chronological age groups (20s, 30s, 40s, 50s, 60-70s) and two gender groups (male and female). Fundamental frequency (f0), jitter, shimmer, and NHR (noise-to-harmonics ratio) was measured with 8 Korean vowels (/ɑ/, /æ/, /ʌ/, /e/, /o/, /u/, /ɯ/, /i/) by using Praat. The results showed that the vowel type significantly affected all acoustic parameters. Gender affected f0, jitter, and NHR significantly. The mean female speakers' f0 was greater than the males', and the mean jitter and NHR of male speakers was greater than the females'. Moreover, age affected shimmer and NHR significantly; in particular, the shimmer and NHR of elderly speakers was greater than the young speakers.

Alterations of Mucosal Vibration of True Vocal Folds on Tongue-Tip Trill : Preliminary Study Using the Electroglottography (Trill 발성시 전기성문파 측정검사로 분석한 성대점막 진동의 변화 : 예비연구)

  • 진성민;반재호;김남훈;이경철;권기환;이용배
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.11 no.1
    • /
    • pp.76-80
    • /
    • 2000
  • Tongue-tip trill is a sound made by the tongue tip making contract with the alveolar ridge and oscillating rapidly as sound is produced. It is an exercise used by many singers to warm up the voice and used as one of the methods of voice rehabilitation for patients who have the vocal folds scarred postoperatively and also who present with a variety of disorders, particularly hypofunction and presbyphonia. We intended to investigate the mucosal vibration of the true vocal folds on tongue-tip trill by electroglottography and to find e effective methods of tongue-tip trill. One adult male volunteer participated. Spectrography and electroglottography were checked repeatedly 15 times, more than 5 second in each times, at same pitch, in three conditions of phonation : sustained /a/ vowel, anterior trill in which tongue-tip vibrated at anterior portion of alveolar ridge just behind the anterior tooth, and posterior trill in which at palatal crest behind the transverse palatine fold We measured the first and second formant to determine indirectly the position of tongue and calculated speed quotient and the ratio of closing phase to closed phase. Speed quotients of posterior trill were higher than sustained /a/ vowel and anterior trill in 14 times. The ratio of closing phase to dosed phase of posterior trill were lower than the others in 14 times. Mucosa of true vocal folds is vibrated more effectively on posterior trill rather than sustained /a/ vowel and anterior trill. So, when tongue-tip trill is used as a method of voice rehabilitation, we suggest that posterior trill is better in producing effective mucosal vibration

  • PDF