Browse > Article
http://dx.doi.org/10.13064/KSSS.2020.12.4.019

An evaluation of Korean students' pronunciation of an English passage by a speech recognition application and two human raters  

Yang, Byunggon (Department of English Education, Pusan National University)
Publication Information
Phonetics and Speech Sciences / v.12, no.4, 2020 , pp. 19-25 More about this Journal
Abstract
This study examined thirty-one Korean students' pronunciation of an English passage using a speech recognition application, Speechnotes, and two Canadian raters' evaluations of their speech according to the International English Language Testing System (IELTS) band criteria to assess the possibility of using the application as a teaching aid for pronunciation education. The results showed that the grand average percentage of correctly recognized words was 77.7%. From the moderate recognition rate, the pronunciation level of the participants was construed as intermediate and higher. The recognition rate varied depending on the composition of the content words and the function words in each given sentence. Frequency counts of unrecognized words by group level and word type revealed the typical pronunciation problems of the participants, including fricatives and nasals. The IELTS bands chosen by the two native raters for the rainbow passage had a moderately high correlation with each other. A moderate correlation was reported between the number of correctly recognized content words and the raters' bands, while an almost a negligible correlation was found between the function words and the raters' bands. From these results, the author concludes that the speech recognition application could constitute a partial aid for diagnosing each individual's or the group's pronunciation problems, but further studies are still needed to match human raters.
Keywords
English pronunciation; evaluation; speech recognition; intelligibility; function word; content word;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Benner, M. L. (2020). Parts of speech. Retrieved from https://webapps.towson.edu/ows/ptsspch.htm
2 House, A. S. (1957). Analog studies of nasal consonants. Journal of Speech and Hearing Disorders, 22(2), 190-204.   DOI
3 IDP Education Canada. (2020). IELTS. Test format. Retrieved from https://ieltscanadatest.com/take-ielts/test-format/
4 Kang, S., & Ahn, H. (2013). An intelligibility-based approach to English pronunciation teaching: Evidence from [r] production. Language Research, 49(3), 631-646.
5 Levis, J. (2011). Assessing speech intelligibility: Experts listen to two students. In J. Levis, & K. LeVelle (Eds.), Proceedings of the 2nd Pronunciation in Second Language Learning and Teaching Conference (pp. 56-69). Ames, IA.
6 Levis, J., & LeVelle, K. (2011). Pronunciation and intelligibility: An overview of the conference. In. J. Levis, & K. LeVelle (Eds.), Proceedings of the 2nd Pronunciation in Second Language Learning and Teaching Conference (pp. 1-6). Ames, IA.
7 Naver. (2020). Papago. Retrieved from https://papago.naver.com/
8 Loukina, A., Lopez, M., Evanini, K., Suendermann-Oeft, D., Ivanov, A., & Zechner, K. (2015, September). Pronunciation accuracy and intelligibility of non-native speech. Proceedings of Interspeech 2015 (pp. 1917-1921). Dresden, Germany.
9 Luce, P., & Pisoni, D. B. (1998). Recognizing spoken words: The neighborhood activation model. Ear & Hearing, 19(1), 1-36.   DOI
10 Miller, N. (2013). Measuring up to speech intelligibility. International Journal of Language & Communication Disorders, 48(6), 601-612.   DOI
11 O'Shea, J. D. (2013). Function word lists. Retrieved from https://semanticsimilarity.files.wordpress.com/2013/08/jim-oshea-fwlist264.pdf
12 Nolan, C. (2012). Fire alarm intelligibility. Paper presented in 2012 CFAA-Annual Ontario Technical Seminar. Retrieved from http://www.cfaa.ca/Files/flash/ontario/ATS2012/FireAlarmIntelligibility.pdf
13 Nordquist, R. (2020). Definition and examples of function words in English. Retrieved from http://thoughtco.com/function-wordgrammar-1690876
14 Ohala, J. (1990). The phonetics and phonology aspects of assimilation. In J. Kingston, & M. E. Beckman (Eds.), Papers in Laboratory Phonology I: Between the Grammar and Physics of Speech (pp. 258-275). Cambridge, UK: Cambridge University Press.
15 R Core Team. (2020). R: A language and environment for statistical computing (version 3.6.2) [Computer software]. Vienna, Austria: R Foundation for Statistical Computing, Retrieved from https://www.R-project.org/
16 Rochester Institute of Technology. (2020). Rainbow passage. Retrieved from https://www.rit.edu/ntid/slpros/media/rainbow
17 Smith, L. E., & Nelson, C. L. (2006). World Englishes and issues of intelligibility. In B. B. Kachru, Y. Kachru, & C. L. Nelson (Eds.), The handbook of World Englishes (pp. 428-445). Malden, MA: Blackwell.
18 Yang, B. (2017). Google speech recognition of an English paragraph produced by college students in clear or casual speech styles. Phonetics and Speech Sciences, 9(4), 43-50.   DOI
19 Vitevitch, M., & Luce, P. (2004). A web-based interface to calculate phonotactic probability for words and nonwords in English. Behavior Research Methods, Instruments, & Computers, 36(3), 481-487.   DOI
20 Yang, B. (2005). A study of English consonants identified by college students. Speech Sciences, 12(3), 139-151.
21 Yang, B., & Whalen, D. (2015). Perception and production of English vowels by American males and females. Australian Journal of Linguistics, 35(2), 121-141.   DOI