[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.7236/JIIBC.2017.17.1.199

Efficient Compensation of Spectral Tilt for Speech Recognition in Noisy Environment

Cho, Jungho (Dept. of Digital Electronics, Dongseoul University)

Publication Information

The Journal of the Institute of Internet, Broadcasting and Communication / v.17, no.1, 2017 , pp. 199-206 More about this Journal

Abstract

Environmental noise can degrade the performance of speech recognition system. This paper presents a procedure for performing cepstrum based feature compensation to make recognition system robust to noise. The approach is based on direct compensation of spectral tilt to remove effects of additive noise. The noise compensation scheme operates in the cepstral domain by means of calculating spectral tilt of the log power spectrum. Spectral compensation is applied in combination with SNR-dependent cepstral mean compensation. Experimental results, in the presence of white Gaussian noise, subway noise and car noise, show that the proposed compensation method achieves substantial improvements in recognition accuracy at various SNR's.

Keywords

speech recognition; spectral tilt; cepstral mean; spectral compensation;

Citations & Related Records

Times Cited By KSCI : 1 (Citation Analysis)

Reference
Cited By KSCI

1	P. J. Moreno, Speech Recognition in Noisy Environments, Ph. D, Dissertation, Carnegie Mellon University, 1996.
2	H. Hermansky, "RASTA processing of speech," IEEE Trans. Speech Audio processing, vol. 2, pp. 578-589, Oct. 1994. DOI: https://doi.org/10.1109/89.326616 DOI
3	M. J. Gales, S. Young, "Robust speech recognition using parallel model combination," IEEE Trans. Speech Audio processing, vol. 4. pp. 352-359, Sep. 1996. DOI
4	J. Y. Ahn, Y. S. Kim, S. H. Kim, K. I. Hur, "A Study on Voice Recognition Pattern matching level for vehicle ECU control," The Journal of The Institute of Internet, Broadcasting and Communication (JIIBC), Vol. 10, No. 1, pp.75-80, Feb. 2010.
5	S. V. Vaseghi and B. P. Milner, "Noise compensation methods for hidden Markov model speech recognition in adverse environments," IEEE Trans. Speech and Audio Processing, vol. 5, No. 1, pp. 11-21, Jan. 1997. DOI
6	D. C. Popescu and I. Zeljkovic, "Kalman filtering of colored noise for speech enhancement," ICSLP'96, Philadelphia, vol. 1, pp.426-429, Oct. 1996.
7	S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
8	D. Naik, "Pole-filtered cepstral mean subtraction," ICSLP'95, Detroit, vol. 1, pp. 157-160, May, 1995.
9	L. R. Rabiner and R. W. Schafer, Digital Processing of Speech Signals, Prentice-Hall, 1978.
10	J. Deller, Jr, J. Proakis and J. Hansen, Discrete-Time Processing of Speech Signals, Macmillan Publishing Co, New York, 1993.
11	V. Goncharoff, E. VonColln, and R. Morris, "Efficient calculation of spectral tilt from various LPC parameters," Proc. IASTED, pp. 60-63, Nov. 1995.
12	A. Oppenheim and D. Johnson, "Discrete representation of signals," Proc. of IEEE, vol. 60, no. 6, pp. 681-691, June, 1972. DOI
13	P. A. Regalia, S. K. Mitra and P. P. Vaidyanathan, "The digital all-pass filter: A versatile signal processing building block," Proc. of IEEE, vol. 76, no. 1, pp. 19-37, Jan. 1988. DOI

KSCI

Efficient Compensation of Spectral Tilt for Speech Recognition in Noisy Environment 잡음 환경에서 음성인식을 위한 스펙트럼 기울기의 효과적인 보상 방법

Efficient Compensation of Spectral Tilt for Speech Recognition in Noisy Environment