Browse > Article
http://dx.doi.org/10.13064/KSSS.2022.14.2.011

Speech recognition rates and acoustic analyses of English vowels produced by Korean students  

Yang, Byunggon (Department of English Education, Pusan National University)
Publication Information
Phonetics and Speech Sciences / v.14, no.2, 2022 , pp. 11-17 More about this Journal
Abstract
English vowels play an important role in verbal communication. However, Korean students tend to experience difficulty pronouncing a certain set of vowels despite extensive education in English. The aim of this study is to apply speech recognition software to evaluate Korean students' pronunciation of English vowels in minimal pair words and then to examine acoustic characteristics of the pairs in order to check their pronunciation problems. Thirty female Korean college students participated in the recording. Speech recognition rates were obtained to examine which English vowels were correctly pronounced. To compare and verify the recognition results, such acoustic analyses as the first and second formant trajectories and durations were also collected using Praat. The results showed an overall recognition rate of 54.7%. Some students incorrectly switched the tense and lax counterparts and produced the same vowel sounds for qualitatively different English vowels. From the acoustic analyses of the vowel formant trajectories, some of these vowel pairs were almost overlapped or exhibited slight acoustic differences at the majority of the measurement points. On the other hand, statistical analyses on the first formant trajectories of the three vowel pairs revealed significant differences throughout the measurement points, a finding that requires further investigation. Durational comparisons revealed a consistent pattern among the vowel pairs. The author concludes that speech recognition and analysis software can be useful to diagnose pronunciation problems of English-language learners.
Keywords
speech recognition; formant trajectory; duration; English vowel; Korean students;
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 Yang, B. (2022). Measuring vowels. In R. A. Knight, & J. Setter (Eds.), The Cambridge handbook of phonetics (pp. 261-284). Cambridge, UK: Cambridge University Press.
2 Yang, B., & Whalen, D. H. (2015). Perception and production of English vowels by American males and females. Australian Journal of Linguistics, 35(2), 121-141.   DOI
3 Yang, B. (2010). Formant trajectories of English high tense and lax vowels produced by Korean and American speakers. Korean Journal of Linguistics, 35(2), 407-423.   DOI
4 Delattre, P. C., Liberman, A. M., & Cooper, F. S. (1955). Acoustic loci and transitional cues for consonants. Journal of the Acoustical Society of America, 27(4), 769-773.   DOI
5 De Decker, P. M., & Nycz, J. R. (2012). Are tense [ae]s really tense? The mapping between articulation and acoustics. Lingua, 122(7), 810-821.   DOI
6 Lee, S., & Rhee, S. C. (2019). The relationship between vowel production and proficiency levels in L2 English produced by Korean EFL learners. Phonetics and Speech Sciences, 11(2), 1-13.   DOI
7 van Rij, J. (2015). Overview of GAMM analysis of time series data. Retrieved from https://jacolienvanrij.com/Tutorials/GAMM.html
8 Boersma, P., & Weenink, D. (2021). Praat: Doing phonetics by computer (version 6.2) [Computer program]. Retrieved from http://www.praat.org/
9 Davenport, M., & Hannahs, S. J. (1998). Introducing phonetics and phonology. London, UK: Hodder Arnold.
10 Kennedy, R. (2022). The phonetics/phonology interface. In R. A. Knight, & J. Setter (Eds.), The Cambridge handbook of phonetics (pp. 682-706). Cambridge, UK: Cambridge University Press.
11 Yang, B. (2009). Formant trajectories of English vowels produced by American males. Phonetics and Speech Sciences, 1(3), 65-72.
12 Yang, B. (1990). Development of vowel normalization procedures: English and Korean (Doctoral dissertation). The University of Texas, Austin, TX.
13 Yang, B. (1996). A comparative study of American English and Korean vowels produced by male and female speakers. Journal of Phonetics, 24(2), 245-261.   DOI
14 Yang, B. (2006). Discrimination of synthesized English vowels by American and Korean listeners. Speech Sciences, 13(1), 7-27.
15 Yang, B. (2017). Google speech recognition of an English paragraph produced by college students in clear or casual speech styles. Phonetics and Speech Sciences, 9(4), 43-50.   DOI
16 Yang, B. (2019). A comparison of normalized formant trajectories of English vowels produced by American men and women. Phonetics and Speech Sciences, 11(1), 1-8.   DOI
17 Yang, B. (2020). An evaluation of Korean students' pronunciation of an English passage by a speech recognition application and two human raters. Phonetics and Speech Sciences, 12(4), 19-25.   DOI
18 R Core Team. (2021). R: A language and environment for statistical computing (version 4.1.0) [Computer software]. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from https://www.R-project.org/
19 Millett, P. (2021). Accuracy of speech-to-text captioning for students who are deaf or hard of hearing. Journal of Educational, Pediatric & (Re)Habilitative Audiology, 25, 1-13.
20 Nearey, T. (2006). English vowels. Linguistics 205 course notes of practical phonetics. Retrived from https://sites.ualberta.ca/~tnearey/Ling205/Week4/EnglishVowelsNarrow4Up.pdf
21 Soskuthy, M. (2017). Generalised additive mixed models for dynamic analysis in linguistics: A practical introduction. Retrieved from https://arxiv.org/abs/1703.05339v1
22 Weckwerth, J. (2022). Vowels. In R. A. Knight, & J. Setter (Eds.), The Cambridge handbook of phonetics (pp. 40-64). Cambridge, UK: Cambridge University Press.
23 Wood, S. N. (2006). Generalised additive models: An introduction with R. Boca Raton, FL: CRC Press.
24 Pickett, J. M. (1980). The sounds of speech communication: A primer of acoustic phonetics and speech perception (Perspectives in Audiology Series). Baltimore, MD: University Park Press.