Browse > Article
http://dx.doi.org/10.13064/KSSS.2014.6.2.047

Spectral and Cepstral Analyses of Esophageal Speakers  

Shim, Hee-Jeong (한림대학교 대학원 언어병리청각학과)
Jang, Hyo-Ryung (한림대학교 대학원 언어병리청각학과)
Shin, Hee-Baek (한림대학교 대학원 언어병리청각학과)
Ko, Do-Heung (한림대학교 언어청각학부)
Publication Information
Phonetics and Speech Sciences / v.6, no.2, 2014 , pp. 47-54 More about this Journal
Abstract
The purpose of this study was to analyze spectral versus cepstral measurements in esophageal speakers. The comparison between the measurements in thirteen male esophageal speakers was compared with the control group of thirteen normal speakers using the sustained vowel /a/. The main results can be summarized as below: (a) the CPP and L/H ratio of the esophageal group were significantly lower than those of the control group (b) the CPP was significantly correlated with the spectral parameters such as jitter, shimmer, NHR and VTI, and (c) the ROC analysis showed that the threshold of 10.25dB for the CPP achieved a good classification for esophageal speakers, with 100% perfect sensitivity and specificity. Thus, it was known that cepstral-based acoustic measures such as CPP, may be more reliable predictors than other spectral-based acoustic measures such as jitter and shimmer. And it was found that cepstral-based acoustic measures were effective in distinguishing esophageal voice quality from normal voice quality. This research will contribute to establishing a baseline related to speech characteristics in voice rehabilitation with laryngectomees.
Keywords
total laryngectomy; voice quality; esophageal speakers; cepstral analysis; CPP; L/H ratio; mean CPP Fo; ADSV;
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 Blood, G. W. (1984). Fundamental frequency and intensity measurements in laryngeal and alaryngeal speakers. Journal of Communication Disorders, 17, 319-324.   DOI
2 Stajner-Katusic, S., Horga, D., Musura, M., & Globlek, D. (2006). Voice and speech after laryngectomy. Clinical Linguistics & Phonetics, 20(2/3), 195-203.   DOI   ScienceOn
3 Maccallum, J., Cai, L., Zhou, L., Zhang, Y., & Jiang, J. (2007). Acoustic analysis of aperiodic voice: perturbation and nonlinear dynamic properties in esophageal phonation. Journal of Voice, 23(3), 283-290.
4 Moers, C., Mobius, B., Rosanowski, F., Noth, E., Eysholdt, U., & Haderlein, T. (2012). Vowel- and text-based cepstral analysis of chronic hoarseness. Journal of Voice, 26(4), 416-424.   DOI   ScienceOn
5 Noll, A. M. (1967). Cepstrum pitch determination. Journal of the Acoustic Society of America, 41, 293-309.   DOI
6 Kumar, B., Bhat, J., & Prasad, N. (2009). Cepstral analysis of voice in persons with vocal nudules. Journal of Voice, 24(6), 651-653.
7 Watts, C. R., & Awan, S. N. (2011). Use of spectral/cepstral analyses for differentiating normal from hypofunctional voices in sustained vowel and continuous speech contexts. Journal of Speech, Language, and Hearing Research, 54(6), 1525-1537.   DOI
8 Lowell, S. Y., Kelley, R. T., Awan, S. N., Colton, R. H., & Chan, N. H. (2012). Spectral- and cepstral-based acoustic features of dysphonic, strained voice quality. Annals of Otology, Rhinology and Laryngology, 121(8), 539-548.   DOI
9 Heman-Ackah, T., Michael, D., & Goding, G. (2002). The relationship between cepstral peak prominence and selected parameters of dysphonia, Journal of Voice, 16(1), 20-27.   DOI   ScienceOn
10 Awan, S., Roy, N., & Dromey, C. (2009). Estimating dysphonia severity in continuous speech: Application of a multi-parameter spectral/cepstral model. Clinical linguistics & phonetics, 23(11), 825-841.   DOI
11 Hillenbrand, J., Cleveland, R., & Erickson, R. (1994). Acoustic Correlates of Breathy Vocal Quality. Journal of Speech, Language, and Hearing Research, 37, 769-778.   DOI
12 Seo, I., & Seong, C. (2013). Voice quality of dysarthric speakers in connected speech. Phonetics and Speech Sciences, 5(4), 33-41. (서인효, 성철재 (2013). 연결발화에서 마비말화자의 음질 특성. 말소리와 음성과학, 5(4), 33-41.)   과학기술학회마을   DOI
13 Wolfe, V, Fitch J, & Cornell R. (1995). Acoustic prediction of severity in commonly occurring voice problems. Journal of Speech, Language & Hearing Research, 38, 273-279.   DOI
14 Shim, H., Jang, H., & Ko, D. (2013). Speech Intelligibility and Vowel Space Characteristics of Alaryngeal Speech. Phonetics and Speech Sciences, 5(4), 17-24. (심희정, 장효령, 고도흥 (2013). 무후두음성의 말 명료도와 모음 공간 특성. 말소리와 음성과학, 5(4), 17-24.)   과학기술학회마을   DOI
15 Timo, L. (2009). Long term average spectrum in screening of voice quality in speech: untrained male university students, Journal of Voice, Vol. 23, No. 6, 671-676.   DOI   ScienceOn
16 Kent, R. D., & Read, C. (2002). Acoustic Analysis of speech(2nd ed). Singular Thomson Learning.
17 Hillenbrand, J., & Houde, R. (1996). Acoustic correlates of breathy vocal quality dysphonic voices and continuous speech. Journal of Speech, Language, and Hearing Research, 39, 311-321.   DOI
18 Lee, M. (2011). A Study of Acoustic Measurement in Connected Speech with Dysphonia. Phonetics and Speech Sciences, 3(4), 109-115. (이명순 (2011). 음성장애 연속구어의 음향학적 분석. 말소리와 음성과학, 3(4), 109-115.)   과학기술학회마을
19 Choi, S. H., Choi, H. S., Kim, H. S., Lee, S. E., & Pyo, H. Y. (2003). Analysis of acoustic characteristics of vowel and consonants production study on speech proficiency in esophageal speech. Speech Sciences, 10(3), 7-27. (최성희, 최홍식, 김한수, 임성은, 이성은, 표화영 (2003). 식도발성의 숙련 정도에 따른 모음의 음향학적 특징과 자음 산출에 대한 연구. 음성과학, 10(3), 7-27.)   과학기술학회마을
20 Awan, S., & Roy, N. (2009). Outcomes measurement in voice disorders: application of an acoustic index of dysphonia severity. Journal of Speech, Language and Hearing Research, 52(2), 482-499.   DOI
21 Most, T., Tobin, Y., & Mimran, R. C. (2000). Acoustical and perceptual characteristics of esophageal tracheoesophageal speech production. Journal of Communication Disorder, 33(2), 165-181.   DOI   ScienceOn
22 Kang, S. K. Aerodynamic analysis of esophageal speech of the laryngectomees. Ph.D. dissertation Taegu University. (강수균 (2002). 후두적출자의 식도발성시 기류역학적 연구, 대구대학교 박사학위 논문.)
23 Pyo, H., Choi, H., Lim, S., & Choi, S. (1999). The comparison of the acoustic and aerodynamic characteristics of provox voice and esophageal voice produced by the same laryngectomee. Speech Sciences, 5(1), 121-139. (표화영, 최홍식, 임성은, 최성희. (1999). 동일 후적자가 산출하는 기관식도 발성과 식도 발성에 대한 음향학적 및 공기역학적 특성 비교. 음성과학, 5(1), 121-139.)   과학기술학회마을
24 Debruyne, F., Delaere, P., Wouters, J., & Uwents, P. (1994). Acoustic analysis of tracheo-oesophageal versus oesophageal speech. Journal of Laryngology and Otology, 108, 325-328.
25 Website, http://www.cancer.go.kr/, September 6, 2013.
26 Jang, H. R., Shim, H. J., & Ko, D. H. (2013). Acoustic Characteristics of Patients with Total Laryngectomees via Voice Rehabilitation Techniques. Phonetics and Speech Sciences, 5(4), 25-32. (장효령, 심희정, 고도흥 (2013). 후두적출술 환자의 발성법에 따른 음향학적 특성. 말소리와 음성과학, 5(4), 25-32.)   과학기술학회마을   DOI