Intra-and Inter-frame Features for Automatic Speech Recognition |
Lee, Sung Joo
(SW.Content Research Laboratory, ETRI)
Kang, Byung Ok (SW.Content Research Laboratory, ETRI) Chung, Hoon (SW.Content Research Laboratory, ETRI) Lee, Yunkeun (SW.Content Research Laboratory, ETRI) |
1 | B. Milner, "A Comparison of Front-End Configurations for Robust Speech Recognition," Proc. ICASSP, Orlando, FL, USA, vol. 1, May 13-17, 2002, pp. 797-800. |
2 | S.J. Lee et al., "Statistical Model-Based Noise Reduction Approach for Car Interior Applications to Speech Recognition," ETRI J., vol. 32, no. 5, Oct. 2010, pp. 801-809. DOI ScienceOn |
3 | S. Furui, "Speaker-Independent Isolated Word Recognition Using Dynamic Features of Speech Spectrum," IEEE Trans. Acoust., Speech Signal Process., vol. 34, no. 1, Feb. 1986, pp. 52-59. DOI |
4 | S. Davis and P. Mermelstein, "Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences," IEEE Trans. Acoust. Speech Signal Process., vol. 28, no. 4, Aug. 1980, pp. 357-366. DOI |
5 | H. Hermansky, "Perceptual Linear Prediction (PLP) Analysis of Speech," J. Acoust. Soc. America, vol. 87, no. 4, Apr. 1990, pp. 1738-1752. DOI |
6 | W.H. Abdulla, "Auditory Based Feature Vectors for Speech Recognition Systems," Advances in Communications And Software Technologies, WSEAS ed., Athens, Greece: WSEAS Press, 2002, pp. 231-236. |
7 | D.-S. Kim, S.-Y. Lee, and R.M. Kil, "Auditory Processing of Speech Signals for Robust Speech Recognition in Real-World Noisy Environments," IEEE Trans. Speech Audio Process., vol. 7, no. 1, Jan. 1999, pp. 55-69. DOI ScienceOn |
8 | S. Young et al., The HTK Book (for HTK version 3.4), Cambridge, England: Cambridge University Engineering Department, 2006. |