Browse > Article
http://dx.doi.org/10.6109/jicce.2012.10.2.162

Speech Processing System Using a Noise Reduction Neural Network Based on FFT Spectrums  

Choi, Jae-Seung (Department of Electronic Engineering, Silla University)
Abstract
This paper proposes a speech processing system based on a model of the human auditory system and a noise reduction neural network with fast Fourier transform (FFT) amplitude and phase spectrums for noise reduction under background noise environments. The proposed system reduces noise signals by using the proposed neural network based on FFT amplitude spectrums and phase spectrums, then implements auditory processing frame by frame after detecting voiced and transitional sections for each frame. The results of the proposed system are compared with the results of a conventional spectral subtraction method and minimum mean-square error log-spectral amplitude estimator at different noise levels. The effectiveness of the proposed system is experimentally confirmed based on measuring the signal-to-noise ratio (SNR). In this experiment, the maximal improvement in the output SNR values with the proposed method is approximately 11.5 dB better for car noise, and 11.0 dB better for street noise, when compared with a conventional spectral subtraction method.
Keywords
Speech processing; Neural network; Amplitude and phase spectrums; Background noise;
Citations & Related Records
연도 인용수 순위
  • Reference
1 D. Pearce and H. G. Hirsch, "The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," Proceedings of the 6th International Conference on Spoken Language Processing, Beijing, China, pp. 29-32, 2000.
2 L. Y. Sui, X. W. Zhang, J. J. Huang, and B. Zhou, "An improved spectral subtraction speech enhancement algorithm under nonstationary noise," Proceedings of International Conference on Wireless Communications and Signal Processing, Nanjing, China, pp. 1-5, 2011.
3 K. Daqrouq, I. N. Abu-Isbeih, and M. Alfauri, "Speech signal enhancement using neural network and wavelet transform," Proceedings of the 6th International Multi-Conference on Systems, Signals and Devices, Djerba, Tunisia, pp. 1-6, 2009.
4 W. G. Knecht, M. E. Schenkel, and G. S. Moschytz, "Neural network filters for speech enhancement," IEEE Transactions on Speech and Audio Processing, vol. 3, no. 6, pp. 433-438, 1995.   DOI   ScienceOn
5 Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 33, no. 2, pp. 443-445, 1985.   DOI
6 R. Okamoto, Y. Takahashi, H. Saruwatari, and K. Shikano, "MMSE STSA estimator with nonstationary noise estimation based on ICA for high-quality speech enhancement," Proceedings of IEEE International Conference on Acoustics Speech and Signal Processing, Dallas, TX, pp. 4778-4781, 2010.
7 J. S. Choi and S. J. Park, "Speech enhancement system based on auditory system and time-delay neural network," Proceedings of the 8th International Conference on Adaptive and Natural Computing Algorithms, Warsaw, Poland, pp. 153-160, 2007.
8 J. S. Choi, "An adaptive speech enhancement system based on noise level estimation and lateral inhibition," Acta Acustica united with Acustica, vol. 93, no. 4, pp. 632-644, 2007.
9 C. Glackin, L. Maguire, and L. McDaid, "Feature extraction from spectro-temporal signals using dynamic synapses, recurrency, and lateral inhibition," Proceedings of International Joint Conference on Neural Networks, Barcelona, Spain, pp. 1-6, 2010.
10 D. Yu, L. Deng, J. Droppo, J. Wu, Y. Gong, and A. Acero, "A minimum-mean-square-error noise reduction algorithm on Melfrequency cepstra for robust speech recognition," Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Las Vegas, NV, pp. 4041-4044, 2008.
11 S. Jeong, H. Yang, and M. Hahn, "Two-channel noise reduction for robust speech recognition in car environments," Electronics Letters, vol. 44, no. 17, pp. 1042-1043, 2008.   DOI   ScienceOn
12 S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 27, no. 2, pp. 113-120, 1979.   DOI
13 J. Lim, A. Oppenheim, and L. Braida, "Evaluation of an adaptive comb filtering method for enhancing speech degraded by white noise addition," IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 26, no. 4, pp. 354-358, 1978.   DOI