Browse > Article
http://dx.doi.org/10.13067/JKIECS.2021.16.3.519

A Study on the Recognition of English Pronunciation based on Artificial Intelligence  

Lee, Cheol-Seung (Dept. of AI Convergence, Kwangju women's University)
Baek, Hye-Jin (Dept. of Liberal Arts, Kwangju women's University)
Publication Information
The Journal of the Korea institute of electronic communication sciences / v.16, no.3, 2021 , pp. 519-524 More about this Journal
Abstract
Recently, the fourth industrial revolution has become an area of interest to many countries, mainly in major advanced countries. Artificial intelligence technology, the core technology of the fourth industrial revolution, is developing in a form of convergence in various fields and has a lot of influence on the edutech field to change education innovatively. This paper builds an experimental environment using the DTW speech recognition algorithm and deep learning on various native and non-native data. Furthermore, through comparisons with CNN algorithms, we study non-native speakers to correct them with similar pronunciation to native speakers by measuring the similarity of English pronunciation.
Keywords
AI; DTW; CNN; English Pronunciation; English Language Training;
Citations & Related Records
연도 인용수 순위
  • Reference
1 H. Jeon, H. Chung, B. Kang, and Y. Lee, "Survey of Recent Research in Education based on Artificial Intelligence," J. of the Korea Institute of Electronics and Telecommunications Trends (ETRI), vol. 36, no. 1, Feb. 2021, pp. 71-80.
2 T. Kim, M. Ryu, and S. Han, "Framework Research for AI Education for Elementary and Middle School Students," J. of The Korean Association of Artificial Intelligence Education, vol. 1, no. 1, 2020, pp. 31-42.
3 E. Kim, and W. Son, "STT, Scheduling, Automatic subtitle generation, STT post-processing, Sentence analysis," J. of the Korea Institute of Electronic Communication Sciences, vol. 16, no. 1, Feb. 2021, pp. 81-88.   DOI
4 Y. Jeong, and G. Choi, "Efficient iris recognition using deep-learning convolution neural network(CNN)," J. of the Korean Data & Information Science Society, vol. 30, no. 2, June 2019, pp. 349-363.   DOI
5 F. Nazir, M. N. Majeed, M. A. Ghazanfar, and M. Maqsood, "Mispronunciation Detection Using Deep Convolutional Neural Network Features and Transfer Learning-Based Model for Arabic Phonemes," J. of the IEEE Access, vol. 7, Apr. 2019, pp. 52589-52608.   DOI
6 S. Kim, and M. Park, " A Study on Time-series Clustering Analysis based on Dynamic Time Warping," J. of the Korean Data Analysis Society, vol. 20, no. 5, 2018, pp. 2319-2332.   DOI
7 K. Heo, and D. Lim, "Noise reduction using patch-based CNN in images," J. of the Korean Data Analysis Society, vol. 20, no. 5, 2018, pp. 2319-2332.   DOI
8 M. Cho, "A Study on the History, Classification and Development Direction of Artificial Intelligence," J. of the Korea Institute of Electronic Communication Sciences, vol. 16, no. 2, Apr. 2021, pp. 307-312.   DOI
9 W. K. Leung, X. Liu, and H. Meng, "CNN-RNN-CTC Based End-to-end Mispronunciation Detection and Diagnosis," In Proc. IEEE Intf. Conf. Acoustics, Speech and Signal (ICASSP), Brighton, UK, May 2019.
10 B. Kang and O. Kwon, "DNN-based acoustic modeling for speech recognition fo native and foreign speakers," J. of the Phonetics and Speech Sciences, vol. 9, no. 2. 2017, pp. 95-101.
11 J. Kim, "The Development and Application of the Web-based English Pronunciation Learning Material Focused on Suprasegmental Features," J. of the Ewha Education, vol. 13, 2003, pp. 443-457..