Browse > Article
http://dx.doi.org/10.9708/jksci.2019.24.02.009

A Computer-Assisted Pronunciation Training System for Correcting Pronunciation of Adjacent Phonemes  

Lee, Jaesung (School of Computer Science and Engineering, Chung-Ang University)
Abstract
Computer-Assisted Pronunciation Training system is considered to be a useful tool for pronunciation learning for students who received elementary level English pronunciation education, especially for students who have difficulty in correcting their pronunciation in front of others or who are not able to receive face-to-face training. The conventional Computer-Assisted Pronunciation Training system shows the word to the user, the user pronounces the word, and then the system provides phoneme or audio feedback according to the pronunciation of the user. In this paper, we propose a Computer-Assisted Pronunciation Training system that can practice on the varying pronunciation according to positions of adjacent phonemes. To achieve this, the proposed system is implemented by recommending a series of words by focusing on adjacent phonemes for simplicity and clarity. Experimental results showed that word recommendation considering adjacent phonemes leads to improvement of pronunciation accuracy.
Keywords
Educational Data Mining; Computer-Assisted Pronunciation Training; Automatic Feedback; Word Recommendation; Adjacent Phonemes;
Citations & Related Records
연도 인용수 순위
  • Reference
1 M. Pennington and J. Richards, "Pronunciation revisited," TESOL quarterly, Vol. 20, No. 2, pp. 207-225, 1986   DOI
2 S. Witt and S. Young, "Phone-level pronunciation scoring and assessment for interactive language learning," Speech Communication, Vol. 30, No. 2, pp. 95-108, 2000   DOI
3 X. Xi, D. Higgins, K. Zechner, and D. Williamson, "A comparison of two scoring methods for an automated speech scoring system," Language Testing, Vol. 29, No. 1, pp. 371-394, 2012   DOI
4 H. Liao, J. Chen, S. Chang, et al., "Decision tree based tone modeling with corrective feedbacks for automatic Mandarin tone assessment," In proceedings of 11th Annual Conference on the International Speech Communication Association, pp. 602-605, Chiba, Japan, 2010
5 M. Harrison, W. Lau, H. Meng, and L. Wang, "Improving mispronunciation detection and diagnosis of learners' speech with context-sensitive phonological rules based on language transfer," In proceedings of 9th Annual Conference on International Speech Communication Association, pp. 2787-2790, Brisbane, Australia, 2008
6 M. Harrison, W. Lo, X. Qian, and H. Meng, "Implementation of an extended recognition network for mispronunciation detection and diagnosis in computer-assisted pronunciation training, In proceedings of ISCA Workshop Speech and Language Technology in Education, pp. 45-48, Warrickshire, UK, 2009
7 L. Wang, X. Feng, H. Meng, "Mispronunciation detection based on cross-language phonological comparisons," In proceedings of International Conference on Audio, Language and Image Processing, pp. 307-311, Shanghai, China
8 K. Goodman, "Reading: A psycholinguistic guessing game," Literacy Research and Instruction, Vol. 6, No. 4, pp. 126-135, 1967
9 A. Neri, C. Cucchiarini, H. Strik, and L. Boves, "The pedagogy-technology interface in computer assisted pronunciation training," Computer assisted language learning, Vol. 15, No. 5, pp. 441-467, 2002   DOI
10 F. Zhang F, C. Huang, F. Soong, M. Chu, and R. Wang, "Automatic mispronunciation detection for Mandarin," In proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 5077-5080, Las Vegas, USA, 2008
11 J. Doremalen, C. Cucchiarini, H. Strik, "Automatic pronunciation error detection in non-native speech: The case of vowel errors in Dutch," The Journal of the Acoustical Society of America, Vol. 134, No. 2, pp. 1336-1347, 2013   DOI
12 H. Strik, K. Truong, F. De Wet, C. Cucchiarini, "Comparing different approaches for automatic pronunciation error detection," Speech communication, Vol. 51, No. 10, pp. 845-852, 2009   DOI
13 L. Wang, X. Feng, H. Meng, "Automatic generation and pruning of phonetic mispronunciations to support computer-aided pronunciation training, In proceeding of 9th Annual Conference on the International Speech Communication Association, pp. 1729-1732, Brisbane, Australia, 2008
14 J. Zhao, H. Yuan, W. Leung, H. Meng, J. Liu, S. Xia, "Audiovisual synthesis of exaggerated speech for corrective feedback in computer-assisted pronunciation training," In proceedings of 2013 IEEE International Confernece on Acoustics, Speech and Signal Processing, pp. 8218-8222, Vancouver, Canada, 2013
15 P. Badin, A. Ben Youssef, G. Bailly, F. Elisei, and T. Hueber, "Visual articulatory feedback for phonetic correction in second language learning. In proceedings of ISCA Workshop Speech and Language Technology in Education, pp. 1-10, Tokyo, Japan, 2010
16 H. Koo, "A study of the effects of vowels on the pronunciation of English sibilants," Speech Science, Vol. 15, No. 1, pp. 31-38, 2008
17 W. Lo, A. Harrison, and H. Meng, "Statistical phone duration modeling to filter for intact utterances in a computer-assisted pronunciation training system, In proceedings of The 35th IEEE International Conference on Acoustics Speech and Signal Processing, pp. 5238-5241, Dallas, USA, 2010
18 J. Lee, C.-H Lee, D.-W. Kim, B.-Y. Kang, "Smartphone-assisted pronunciation learning technique for ambient intelligence," IEEE Access, Vol. 5, No. 1, pp. 312-325, 2017   DOI
19 J. Schalkwyk, D. Beeferman, F. Beaufays et al. ""Your word is my command'': Google search by voice: a case study," In collection of Advances in speech recognition, pp. 61-90, 2010
20 Y. Yun and N. Lee, "Research on the effect of pronunciation training of English unaspirated stops for Koreans," Language and Linguistics, Vol. 57, No. 1, pp. 141-158, 2012
21 J. Kim and K. Yoon, "The formant frequency difference of English vowels as a function of stress and its application on vowel pronunciation training," Phonetics Speech Science, Vol. 5, No. 1, pp. 53-58, 2013
22 J. Kim, "Korean speakers' pronunciation and pronunciation training of English stops," Phonetics Speech Science, Vol. 2, No. 1, pp. 29-36, 2010
23 H. Koo, "A study of production difficulties of English bilabial stops and labiodental fricateives by Korean learners of English," Phonetics Speech Science, Vol. 1, No. 1, pp. 11-15, 2009
24 Y. Yun, "The learning effect of English vowels using the phonological information of Korean vowels," Journal of Modern British American Language Literature, Vol. 30, No. 1, pp. 75-91, 2012
25 K.-Y. La, "Improvement methods for teaching primary school English pronunciation in the EFL environment," Studies in English Education, Vol. 6, No. 2, pp. 5-31, 2001
26 R. Hincks, "Speech technologies for pronunciation feedback and evaluation," ReCALL, Vol. 15, No. 1, pp. 3-20, 2003   DOI
27 P. Gough, C. Juel, and P. Griffith, "Reading, spelling, and the orthographic cipher," Lawrence Erlbaum Associates, Inc, 1992
28 H. Franco, H. Bratt, R. Rossier et al., "EduSpeak: A speech recognition and pronunciation scoring toolkit for computer-aided language learning applications," Language Testing, Vol. 27, No. 3, pp. 401-418, 2010   DOI
29 S. Shaik, "Computer assisted English pronunciation training to undergraduate students," Journal of English Language and Literature, Vol. 4, No. 2, pp. 117-121, 2015
30 H. Liao, Y. Guan, J. Tu, and J. Chen, "A prototype of an adaptive Chinese pronunciation training system," System, Vol. 45, No. 1, 2014
31 C.-S. Park, "Understanding Artificial Intelligence Technology for Artificial Intelligence Humanities," Journal of AI Humanities, Vol. 1, No. 1, pp. 173-182, 2018
32 G. Demenko, A. Wagner, N. Cylwik, "The use of speech technology in foreign language pronunciation training," Archives of Acoustics, Vol. 35, No. 3, pp. 309-329, 2010   DOI
33 J. Smith and B. Beckmann, "Improving pronunciation through Noticing-Reformulation Tasks, University College London, 2005
34 G. Kartal, "Working with an imperfect medium: Speech recognition technology in reading practice," Journal of Educational Multimedia and Hypermedia, Vol. 15, No. 3, pp. 303-328, 2006
35 X. Qian, H. Meng, and F. Soong, "Capturing L2 segmental mispronunciations with joint-sequence models in computer-aided pronunciation training," In proceedings of 7th International Symposium on Chinese Spoken Language Processing, pp. 84-88, Tainan, Taiwan, 2010
36 K. Wong, W. Leung, W. Lo, and H. Meng, "Development of an articulatory visual-speech synthesizer to support language learning," In proceedings of 7th International Symposium on Chinese Spoken Language Processing, pp. 139-143, Tainan, Taiwan, 2010
37 K. Wong, W. Lo, and H. Meng, "Allophonic variations in visual speech synthesis for corrective feedback in CAPT," In proceedings of IEEE International Conference on Acoustic, Speech and Signal Processing, pp. 5708-5711, Prague, Czech, 2011
38 Y. Iribe, S. Manosavanh, K. Katsurada, R. Hayashi, C. Zhu, T. Nitta, "Generating Animated Pronunciation from Speech Through Articulatory Feature Extraction," In proceedings of 12th Annual Conference on International Speech Communication Association pp. 1617-1620, Florence, Italy, 2011