Browse > Article
http://dx.doi.org/10.6109/jkiice.2016.20.9.1673

Background Noise Classification in Noisy Speech of Short Time Duration Using Improved Speech Parameter  

Choi, Jae-Seung (Department of Electronic Engineering, Silla University)
Abstract
In the area of the speech recognition processing, background noises are caused the incorrect response to the speech input, therefore the speech recognition rates are decreased by the background noises. Accordingly, a more high level noise processing techniques are required since these kinds of noise countermeasures are not simple. Therefore, this paper proposes an algorithm to distinguish between the stationary background noises or non-stationary background noises and the speech signal having short time duration in the noisy environments. The proposed algorithm uses the characteristic parameter of the improved speech signal as an important measure in order to distinguish different types of the background noises and the speech signals. Next, this algorithm estimates various kinds of the background noises using a multi-layer perceptron neural network. In this experiment, it was experimentally clear the estimation of the background noises and the speech signals.
Keywords
Speech Recognition; Stationary Noise; Non-Stationary Noise; Noisy Environment; Improved Speech Parameter;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 C. Hanilci, T. Kinnunen, R. Saeidi, J. Pohjalainen, P. Alku, F. Ertas, J. Sandberg, and M. Hansson-Sandsten, "Comparing spectrum estimators in speaker verification under additive noise degradation," IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4769-4772, March 2012.
2 R. Saeidi, J. Pohjalainen, T. Kinnunen, and Paavo Alku, "Temporally Weighted Linear Prediction Features for Tackling Additive Noise in Speaker Verification," IEEE Signal Processing Letters, vol. 17, no. 6, pp. 599-602, June 2010.   DOI
3 R. Su, X. Liu, and L. Wang, "Automatic Complexity Control of Generalized Variable Parameter HMMs for Noise Robust Speech Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23, no. 1, pp. 102-114, Jan. 2015.
4 J. S. Choi, "Speech and Noise Recognition System by Neural Network," The Korea Institute of Electronic Communication Sciences, vol. 5, no. 4, pp. 357-362, Aug. 2010.
5 N. Moritz, Jorn Anemuller, and B. Kollmeier, "An Auditory Inspired Amplitude Modulation Filter Bank for Robust Feature Extraction in Automatic Speech Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23, no. 11, pp. 1926-1937, Nov. 2015.   DOI
6 J. S. Choi, "Noise Reduction Algorithm in Speech by Wiener Filter," The Korea Institute of Electronic Communication Sciences, vol. 8, no. 8, pp. 1293-1298, Aug. 2013.   DOI
7 S. Furui, "Speaker-Independent Isolasted Word Recognition Using Dynamic Features of Speech Spectrum," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 34, no. 1, pp. 52-59, Feb. 1986.   DOI
8 J. S. Choi, "A Wiener Filter Algorithm of Noise Subtraction Based on Threshold Detection," Korean Institute of Information Technology, vol. 13, no. 7, pp. 51-56, July 2015.
9 X. Zhang, Y. Guo, Xuemei Hou, "A speech Recognition Method of Isolated Words Based on Modified LPC Cepstrum," IEEE International Conference on Granular Computing, pp. 481-484, Nov. 2007.
10 S. K. Pal, S. Mitra, "Multilayer perceptron, fuzzy sets, and classification," IEEE Transaction on Neural Networks, vol. 3, no. 5, pp. 683-697, Sep. 1992.   DOI
11 D. Rumelhart, G. Hinton and R. Williams, "Learning representations by back-propagation errors," Nature, vol. 323, pp. 533-536, Oct. 1986.   DOI
12 H. Hirsch and D. Pearce, "The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions," in Proc. ISCA ITRW ASR2000 on Automatic Speech Recognition: Challenges for the Next Millennium, Paris, France, Oct. 2000.