Browse > Article
http://dx.doi.org/10.5626/KTCP.2016.22.11.591

Speech Recognition of the Korean Vowel 'ㅜ' Based on Time Domain Bulk Indicators  

Lee, Jae Won (Sungshin Women's Univ.)
Publication Information
KIISE Transactions on Computing Practices / v.22, no.11, 2016 , pp. 591-600 More about this Journal
Abstract
Computing technologies are increasingly applied to most casual human environment networks, as computing technologies are further developed. In addition, the rapidly increasing interest in IoT has led to the wide acceptance of speech recognition as a means of HCI. In this study, we present a novel method for recognizing the Korean vowel 'ㅜ', as a part of a phoneme based Korean speech recognition system. The proposed method involves analyses of bulk indicators calculated in the time domain instead of analysis in the frequency domain, with consequent reduction in the computational cost. Four elementary algorithms for detecting typical waveform patterns of 'ㅜ' using bulk indicators are presented and combined to make final decisions. The experimental results show that the proposed method can achieve 90.1% recognition accuracy, and recognition speed of 0.68 msec per syllable.
Keywords
speech recognition; Korean phoneme; vowel; waveform; bulk indicator;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 Y. Y. Seo, J. D. Song, J. H. Lee, "Phoneme Segmentation in Consideration of Speech feature in Korean Speech Recognition," Journal of Korean Society for Internet Information, Vol. 2, No. 1, pp. 31-38, 2001. (in Korean)
2 M. J. Kim and C. H. Kweon, "An Automatic Segmentation System Based on HMM and Correction Algorithm," Speech Sciences, Vol. 9, No. 4, pp. 265-274, 2002. (in Korean)
3 J. W. Lee, "Speech Recognition of the Korean Vowel 'ㅐ' Based on Time Domain Sequence Patterns," KIISE Transactions on Computing Practices, Vol. 21, No. 11, pp. 713-720, 2015. (in Korean)   DOI
4 Y. K. Lee, "Speech Interface Technology and Service Trend under the Smart Phone Environment," Information & Communications Magazine, Vol. 29, No. 4, pp. 3-9, 2012. (in Korean)
5 F. Brugnara et al., "Automatic segmentation and labeling of speech based on hidden Markov model," Speech Communication, Vol. 12, pp. 357-370, 1993.   DOI
6 J. ZhNF, H. Yu, N. Ma, Z. Li, "The Phoneme Automatic Segmentation Algorithms Study of Tibetan Lhasa Words Continuous Speech Stream," Proc. of the 2nd International Conference On Systems Engineering and Modeling, pp. 578-581, 2013.
7 G. Kiss, D. Sztaho, K. Vicsi, "Language independent automatic speech segmentation into phoneme-like units on the base of acoustic distinctive features," 2013 IEEE 4th International Conference on Cognitive Infocommunications (CogInfoCom), pp. 579-582, 2013.
8 R. A. Brietion, B. M. G. Cheetham, M. C. Hall, "A comparison of distance measures for speech segmentation in variable frame rate speech vocoding," Proc. of the IEEE Colloquium, pp. 6/1-6/5, 1990.
9 KOCCA, Culture Technology(CT) in-depth Report, Nov. 2011.
10 D. K. Kim, C. G. Jeong, and H. Jeong, "Hierarchy and Modularity in Time-Delay Neural Networks for Korean Phoneme Recognition using HMM," IEEK, Vol. 16, No. 2, pp. 81-84, 1994.
11 H. Jung, "Korean Speech Recognition Using Neural Networks," Korean Institute of Information Scientists and Engineers, pp. 63-82, 1993
12 G. Hinton, L. Deng, D. Yu, G. E. Dahl, "Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups," Signal Processing Magazine, IEEE, Vol. 29, No. 6, pp. 82-97, 2012.
13 J. H. Lee, J. W. Lee, and J. Lee, "Korean Phonemes 'ㅅ', 'ㅈ', 'ㅊ' Recognition based on Sign Distribution Volatility," Communications of the Korean Institute of Information Scientists and Engineers, Vol. 19, pp. 377-382, 2013. (in Korean)
14 J. W. Lee, "Speech Recognition of Korean Phonemes 'ㅅ', 'ㅈ', 'ㅊ' based on Volatility and Turning Points," KIISE Transactions on Computing Practices, Vol. 20, No. 11, pp. 579-585, 2014. (in Korean)   DOI
15 S. K. Choi, J. W. Lee, and J. Lee, "Korean Vowel 'ㅏ' Recognition based on Wave Sequence Detection," Proc. of the Digital Contents Society Joint Conference 2013, Vol. 14, pp. 577-579, 2013. (in Korean)
16 W. Roh and J. Lee, "Implementation of Korean Vowel 'ㅏ' Recognition based on Common Feature Extraction of Waveform Sequence," KIISE Transactions on Computing Practices, Vol. 20, No. 11, pp. 567-572, 2014. (in Korean)   DOI
17 Y. Lee, "Phoneme Segmentation Using Phoneme Combination and Formant Scaling in Korean," Master Thesis, Department of Computer Engineering, Inha University, Incheon, Korea, 2003. (in Korean)