[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.13067/JKIECS.2021.16.3.465

Nonlinear Speech Enhancement Method for Reducing the Amount of Speech Distortion According to Speech Statistics Model

Choi, Jae-Seung (Division of Smart Electrical and Electronic Engineering, Silla University)

Publication Information

The Journal of the Korea institute of electronic communication sciences / v.16, no.3, 2021 , pp. 465-470 More about this Journal

Abstract

A robust speech recognition technology is required that does not degrade the performance of speech recognition and the quality of the speech when speech recognition is performed in an actual environment of the speech mixed with noise. With the development of such speech recognition technology, it is necessary to develop an application that achieves stable and high speech recognition rate even in a noisy environment similar to the human speech spectrum. Therefore, this paper proposes a speech enhancement algorithm that processes a noise suppression based on the MMSA-STSA estimation algorithm, which is a short-time spectral amplitude method based on the error of the least mean square. This algorithm is an effective nonlinear speech enhancement algorithm based on a single channel input and has high noise suppression performance. Moreover this algorithm is a technique that reduces the amount of distortion of the speech based on the statistical model of the speech. In this experiment, in order to verify the effectiveness of the MMSA-STSA estimation algorithm, the effectiveness of the proposed algorithm is verified by comparing the input speech waveform and the output speech waveform.

Keywords

Speech Recognition; Speech Enhancement Algorithm; Noise Suppression; MMSA-STSA Estimation; Statistics Model;

Citations & Related Records

Times Cited By KSCI : 1 (Citation Analysis)

Reference
Cited By KSCI

1	J. Choi, "Independent Component Analysis based on Frequency Domain Approach Model for Speech Source Signal Extraction," J. of the Korea Institute of Electronic Communication Sciences, vol. 15, no. 5, Oct. 2020, pp. 807-812. DOI
2	S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. on Acoustic Speech Signal Processing, vol. ASSP-27, no. 2, Apr. 1979, pp. 113-120. DOI
3	Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean square error short-time spectral amplitude estimator," IEEE Trans. on Speech and Audio Processing, vol. ASSP-32, no. 6, Dec. 1984, pp. 1109-1121. DOI
4	J. Lim and A. V. Oppenheim, "All-pole modeling of degraded speech," IEEE Trans. ASSP, vol. 26, no. 3, 1978, pp. 197-210.
5	F. Asano, S. Ikeda, M. Ogawa, H. Asoh, and N. Kitawaki, "Combined approach of array processing and independent component analysis for blind separation of acoustic signals," IEEE Trans. on Speech and Audio Processing, vol. 11, no. 3, May 2003, pp. 204-215. DOI
6	J. Choi, "An Adaptive Speech Enhancement System Based on Noise Level Estimation and Lateral Inhibition," ACTA Acustica United with Acustica, vol. 93, no. 4, 2007, pp. 632-644.
7	J. H. L. Hansen and M. A. Clements, "Constrained Iterative Speech Enhancement with Application to Speech Recognition," IEEE Transactions on Signal Processing, vol. 39, no. 4, Apr. 1991, pp. 795-805. DOI
8	H. Lee, "Acoustic Feedback and Noise Cancellation of Hearing Aids by Deep Learning Algorithm," J. of the Korea Institute of Electronic Communication Sciences, vol. 14, no. 6, Dec. 2019, pp. 1249-1256. DOI
9	C. Lee, "Dimensionality Reduction in Speech Recognition by Principal Component Analysis," J. of the Korea Institute of Electronic Communication Sciences, vol. 8, no. 9, Sept. 2013, pp. 1299-1305. DOI
10	M. S. Kavalekalam, M. G. Christensen, F. Gran, and J. B. Boldt, "Kalman filter for speech enhancement in cocktail party scenarios using a codebook-based approach," 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, Mar. 2016, pp. 191-195.
11	H. Gustafsson, S. Nordholm, and I. Claesson, "Spectral subtraction with adaptive averaging of the gain function," 6th European Conference on Speech Communication and Technology(Eurospeech'99), Budapest, Hungary, Sept. 1999, pp. 2599-2602.
12	X. Dang and T. Nakai, "Noise Reduction using Modified Phase Spectra and Wiener Filter," 2011 IEEE International Workshop on Machine Learning for Signal Processing, Sept. 2011, pp. 1-5.
13	H. Hirsch and D. Pearce, "The Aurora experimental framework for the performance evaluation of speech recognition system under noisy conditions," Proc. ISCA ITRW Workshop on Automatic Speech Recognition, Paris, France, 2000.

KSCI

Nonlinear Speech Enhancement Method for Reducing the Amount of Speech Distortion According to Speech Statistics Model 음성 통계 모형에 따른 음성 왜곡량 감소를 위한 비선형 음성강조법

Nonlinear Speech Enhancement Method for Reducing the Amount of Speech Distortion According to Speech Statistics Model