Browse > Article

Voice Recognition Based on Adaptive MFCC and Neural Network  

Bae, Hyun-Soo (영남대학교 전자공학과)
Lee, Suk-Gyu (영남대학교 전자공학과)
Publication Information
Abstract
In this paper, we propose an enhanced voice recognition algorithm using adaptive MFCC(Mel Frequency Cepstral Coefficients) and neural network. Though it is very important to extract voice data from the raw data to enhance the voice recognition ratio, conventional algorithms are subject to deteriorating voice data when they eliminate noise within special frequency band. Differently from the conventional MFCC, the proposed algorithm imposed bigger weights to some specified frequency regions and unoverlapped filterbank to enhance the recognition ratio without deteriorating voice data. In simulation results, the proposed algorithm shows better performance comparing with MFCC since it is robust to variation of the environment.
Keywords
Speech; Recognition; Noise; MFCC; Filterbank; Neural network;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Chang-Young Lee, "Improvements on MFCC by elaboration of the filter banks and windows", Speech Sciences, Vol. 14, No.4, pp. 131-144, 2007.
2 F.F. Li, T.J. Cox, "A Neural network model for speech intelligibility quantification", Applied soft computing, Vol. 7, No.1, pp. 145-155, 2007.   DOI   ScienceOn
3 M.H. Kostepen, G. Kurnar, "Speech recognition using back-propagation neural networks", IEEE Region 10 International Conference on EC3-Energy, Computer, Communication and Control System, Vol. 2, pp. 144-148, 1991.
4 Y.Ephraim. "Statistical model based speech enhancement systems", Proc. IEEE, Vol. 80, No.10, pp. 1524-1555, 1992.
5 J.B. Allen, "How do humans process and recognize speech?", IEEE Transactions on Speech and Audio Processing, 2(4), 1994.
6 T. Zeppenfeld and A.H. Waibel, "A hybrid neural network, dynamic programming word spotter", In Proceedings of IEEE International Conference on Acoustics Speech and Signal Processing, Vol. 2, pp. 77-80, 1992.
7 S. Boll "A spectral subtraction algorithm of acoustic noise in speech", IEEE International Conference on ICASSP '79, Vol. 4, pp. 200-203, 1979.
8 Chul-Ho Park, Keun-Sung Bea. "Performance analysis of noisy speech recognition depending on parameters for noise and signal power estimation in MMSE-STSA based speech engancement", 말소리, No.57, pp. 153-164, 2006.
9 Young-Chu Songl "Effective noise suppression in edge region using modified wiener filter", The transactions of the Korean Institute of Electrical Engineers D/D 2003, Vol. 52, No.3, pp. 173-180, 2003.
10 B. Widrow. et al. "Adaptive noise cancelling, principles and applications", Proc. Of IEEE 63(12), pp, 1692-1716, 1975.   DOI
11 Chul-Hee Han, Hong-Goo Kang, Hwang Young-Soo, Youn Dea-Hee "A microphone array beamformer for the performance enhancement of speech recognizer in car", The journal of the acoustical society of Korea, Vol. 24, No.7, pp. 423-430, 2005.   과학기술학회마을
12 Won-Ho Shin, Tae-Young Yang, Weon-Goo Kim, Dea-Hee Youn, Young- Joo Seo, "Speech recognition using noise robust features and spectral subtraction", The Journal of the Acoustical Society of Korea, Vol. 15, No.5, pp. 38-43, 1969.
13 Miki Kazuhiko, Nishiura Takanobu, Nakamura Satoshi "Speech recognition based on HMM decomposition and composition method with a microphone array in noisy reverberant environments", Electronics & Communications in Japan. Part 2, Electronic, Vol. 85, No.9, pp. 13-22.
14 정용주 "A Study on noisy speech recognition using discriminative training for PMC algorithm", The Journal of the Acoustical Society of Korea, Vol. 19, No.2, pp. 83-89, 2000.
15 Wang F-M, Kabal P, Ramachandran R.P, O'Shaughnessy D, "Frequency domain adaptive postfiltering for enhancement of noisy speech", Speech Communication, Vol. 12 No.1, pp. 41-56, 1993.   DOI   ScienceOn
16 Sun-Mi Kang, "잡음 환경하에서의 음성인식에 관한 연구", Journal of Institute of Industrial Technology, pp. 301-318, 1997.
17 B.H. Nitsch, "A Frequency-selective stepfactor controlfor an adaptive filter algorithm working in the frequency domain", Signal processing, the official publication of the European Association for Signal Processing(EURASIP), Vol. 80. No.9, pp. 1733-1745, 2000.
18 Q.C. Liu, B. Champagne, D.K.C. Ho, "Simple design of oversampled uniform DFT filter banks with applications to subband acoustic echo cancellation", Signal processing, Vol. 80, No.5, pp. 831-847, 2000.   DOI   ScienceOn
19 Li Shang, Hashimoto Hideo , Wu Xiaohua, Takahashi, Nobuaki, Takebe, Tauyoshi, "Adaptive IIR bandpass decimation filter for single sinusoid detection", Electrinics and communications in Japan. Part 3, Fundamental electronic science, Vol. 83, No.7, pp. 91-101, 2000.   DOI   ScienceOn
20 L.H. Tey, P.L. So, Y.C Chu, "Adaptive neural network control of active filters", Electric Powersystems Research, Vol. 74, No.1, pp. 37-56, 2005.   DOI   ScienceOn