Browse > Article

Speech Enhancement based on Smoothed Global Soft Decision  

Jo, Q-Haing (School of Electronic and Electrical Engineering, Inha University)
Park, Yun-Sik (School of Electronic and Electrical Engineering, Inha University)
Chang, Joon-Hyuk (School of Electronic and Electrical Engineering, Inha University)
Publication Information
Abstract
In this paper, we propose an improved global soft decision for speech enhancement in noise environments. From an examination of statistical model-based speech enhancement, it is shown that the global soft decision has a fundamental drawback at the offset region of speech signals. To overcome the drawback, we apply a new speech enhancement method based on a smoothed Global likelihood ratio to the global soft decision. Performances of the proposed method are evaluated by subjective tests under various environments and yield better results compared with the reported speech enhancement method.
Keywords
Speech Enhancement; Global Soft Decision; Smoothed Global Likelihood Ratio;
Citations & Related Records
연도 인용수 순위
  • Reference
1 J.-H. Chang and N. S. Kim, 'Speech enhancement : new approaches to soft decision,' IEICE Trans. Inf. and Syst., vol. 27, E84-D, pp. 1231-1240, Sep. 2001
2 F. Beritelli, S. Casale, and A. Cavallaro, 'A robust voice activity detector for wireless communications using soft computing,' IEEE Journal on Selectied Areas in Communications, vol. 16, no. 9, pp. 1818-1829, Dec. 1998   DOI   ScienceOn
3 J.-H. Chang, N. S. Kim and S. K. Mitra, 'Voice activity detection based on multiple statistical models,' IEEE Trans. Signal Processing, vol. 54, no. 6, pp. 1965-1976, June 2006   DOI   ScienceOn
4 I. Cohen, 'Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator,' IEEE Signal Processing Letters, vol. 9, no. 4, pp. 113-116, Apr. 2002   DOI   ScienceOn
5 TIA/EIA/IS-127, 'Enhanced variable rate codec, speech service option 3 for wideband spectrum digital systems,' 1996
6 R. J. McAulary and M. L. Malpass, 'Speech enhancement using a soft-decision noise suppression filter,' IEEE Trans. Acoust., Speech, Signal Processing, vol.28, pp. 137-145, Apr. 1980   DOI
7 J.-H. Chang, N. S. Kim and S. K. Mitra, 'A statistical model-based V/UV decision under background noise environments,' IEICE Trans. on Info. and Systs., vol. E87-D, no. 12, pp. 2885-2887, Dec. 2004
8 J.-H. Chang, J. W. Shin and N. S. Kim, 'Voice activity detection employing generalized Gaussian distribution,' Electronics Letters, vol. 40, no. 24, pp.1561-1562, Nov. 2004   DOI   ScienceOn
9 J.-H. Chang and N. S. Kim, 'Voice activity detection based on complex Laplacian model,' Electronics Letters, vol. 39, no. 7, pp. 632-634, Apr. 2003   DOI   ScienceOn
10 Y. D. Cho and A. Kondoz, 'Analysis and improvement of a statistical model-based voice activity detector,' IEEE Signal Processing Letters, vol. 8, no. 10, pp. 276-278, Oct. 2001   DOI   ScienceOn
11 J.-H. Chang, 'Warped discrete cosine transformbased noisy speech enhancement,' IEEE Trans. Circuit and Systems II, vol. 52, issue 9, pp. 535-539, Sept. 2005
12 Y. Ephraim and D. Malah, 'Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator' IEEE Trans. Acoust., Speech, Signal Processing, vol. 32, no. 6, pp. 1109-1121, Dec. 1984   DOI
13 O. Cappe, 'Elimination of musical noise phenomenon with the Ephraim and Malah noise suppressor,' IEEE Trans. Speech and Audio Processing, vol. 2, no. 2, pp. 345-349, Apr. 1994   DOI   ScienceOn
14 J. Sohn, N. S. Kim and W. Sung, 'A statistical model-based voice activity detection,' IEEE Signal Processing Letters, vol. 6, no. 1, pp. 1-3, Jan. 1999
15 J.-H. Chang, 'Perceptual weighting filter for robust speech modification,' Signal Processing, vol. 86, Issue 5, pp. 1089-1093, May 2006   DOI   ScienceOn
16 I. Cohen and B. Berdugo, 'Speech enhancement for non-stationary noise environments,' Signal Processing, vol 81, pp. 2403-2418, Nov. 2001   DOI   ScienceOn
17 J.-H. Chang and N. S. Kim, 'A new structural approach in system identification with generalized analysis-by-synthesis for Robust Speech Coding,' IEEE Trans. Speech and Audio Processing, vol. 14, no. 3, pp. 747-751, May 2006   DOI   ScienceOn
18 E. Nemer, R. Goubran, and S. Mahmoud, 'Robust voice activity detection using higherorder statistics in the LPC Residual domain,' IEEE Trans. Speech and Audio Processing, vol. 9, no. 3, pp. 217-231, Mar. 2001   DOI   ScienceOn
19 J.-H. Chang and N. S. Kim, 'Distorted speech rejection for automatic speech recognition in wireless communication,' IEICE Trans. Info. and Systs., vol. E87-D, no. 7, pp. 1978-1981, July 2005
20 N. S. Kim and J.-H. Chang, 'Spectral enhancement based on global soft decision', IEEE Signal Processing Letters, vol. 7, no. 5, pp. 108-110, May 2000   DOI   ScienceOn
21 I. Cohen and B. Berdugo, 'Noise estimation by minima controlled recursive averaging for robust speech enhancement,' IEEE Signal Processing Letters, vol. 9, no. 1, pp. 12-15, Jan. 2002   DOI   ScienceOn
22 J. Sohn and W. Sung, 'A voice activity detector employing soft decision based noise spectrum adaptation,' Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, pp. 365-368, 1998