Browse > Article
http://dx.doi.org/10.7236/JIIBC.2017.17.1.199

Efficient Compensation of Spectral Tilt for Speech Recognition in Noisy Environment  

Cho, Jungho (Dept. of Digital Electronics, Dongseoul University)
Publication Information
The Journal of the Institute of Internet, Broadcasting and Communication / v.17, no.1, 2017 , pp. 199-206 More about this Journal
Abstract
Environmental noise can degrade the performance of speech recognition system. This paper presents a procedure for performing cepstrum based feature compensation to make recognition system robust to noise. The approach is based on direct compensation of spectral tilt to remove effects of additive noise. The noise compensation scheme operates in the cepstral domain by means of calculating spectral tilt of the log power spectrum. Spectral compensation is applied in combination with SNR-dependent cepstral mean compensation. Experimental results, in the presence of white Gaussian noise, subway noise and car noise, show that the proposed compensation method achieves substantial improvements in recognition accuracy at various SNR's.
Keywords
speech recognition; spectral tilt; cepstral mean; spectral compensation;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 P. J. Moreno, Speech Recognition in Noisy Environments, Ph. D, Dissertation, Carnegie Mellon University, 1996.
2 H. Hermansky, "RASTA processing of speech," IEEE Trans. Speech Audio processing, vol. 2, pp. 578-589, Oct. 1994. DOI: https://doi.org/10.1109/89.326616   DOI
3 M. J. Gales, S. Young, "Robust speech recognition using parallel model combination," IEEE Trans. Speech Audio processing, vol. 4. pp. 352-359, Sep. 1996.   DOI
4 J. Y. Ahn, Y. S. Kim, S. H. Kim, K. I. Hur, "A Study on Voice Recognition Pattern matching level for vehicle ECU control," The Journal of The Institute of Internet, Broadcasting and Communication (JIIBC), Vol. 10, No. 1, pp.75-80, Feb. 2010.
5 S. V. Vaseghi and B. P. Milner, "Noise compensation methods for hidden Markov model speech recognition in adverse environments," IEEE Trans. Speech and Audio Processing, vol. 5, No. 1, pp. 11-21, Jan. 1997.   DOI
6 D. C. Popescu and I. Zeljkovic, "Kalman filtering of colored noise for speech enhancement," ICSLP'96, Philadelphia, vol. 1, pp.426-429, Oct. 1996.
7 S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
8 D. Naik, "Pole-filtered cepstral mean subtraction," ICSLP'95, Detroit, vol. 1, pp. 157-160, May, 1995.
9 L. R. Rabiner and R. W. Schafer, Digital Processing of Speech Signals, Prentice-Hall, 1978.
10 J. Deller, Jr, J. Proakis and J. Hansen, Discrete-Time Processing of Speech Signals, Macmillan Publishing Co, New York, 1993.
11 V. Goncharoff, E. VonColln, and R. Morris, "Efficient calculation of spectral tilt from various LPC parameters," Proc. IASTED, pp. 60-63, Nov. 1995.
12 A. Oppenheim and D. Johnson, "Discrete representation of signals," Proc. of IEEE, vol. 60, no. 6, pp. 681-691, June, 1972.   DOI
13 P. A. Regalia, S. K. Mitra and P. P. Vaidyanathan, "The digital all-pass filter: A versatile signal processing building block," Proc. of IEEE, vol. 76, no. 1, pp. 19-37, Jan. 1988.   DOI