Browse > Article

Global Soft Decision Using Probabilistic Outputs of Support Vector Machine for Speech Enhancement  

Jo, Q-Haing (인하대학교 전자전기공학부)
Chang, Joon-Hyuk (인하대학교 전자전기공학부)
Abstract
In this paper, we propose a novel speech enhancement technique using global soft decision (GSD) based on the probabilistic outputs of support vector machine (SVM). Generally, speech enhancement algorithms applied soft decision gain modification and noise power estimation have bettor performance than those employing hard decision. Especially, global speech absence probability (GSAP), which is known as an effective measure of the speech absence in each frame, has been adopted to SD-based speech enhancement methods. For this reason, we introduce a new GSAP estimated from the probabilistic output of SVM using sigmoid function. The performance of the proposed algorithm is evaluated by the PESQ and MOS test under various noise environments and yields better results compared with the conventional GSD scheme.
Keywords
Speech enhancement; Support vector machine; Global soft decision;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Processing, 32(6), 1109-1121, Dec. 1984   DOI
2 Xin Dong and Wu Zhaohui, "Speaker recognition using continuous density support vector machines," Electronics letters, 37(17), 1099-1101, Aug. 2001   DOI   ScienceOn
3 N. S. Kim, J.-H. Chang, "Spectral enhancement based on global soft decision," IEEE Signal Processing Letters, 7(5), pp. 108-110, May 2000.   DOI   ScienceOn
4 V. Vapnik, Statistical learning theory. Wiley, New York, 1998. forthcoming
5 R. J. McAualy and M. L. Malpass, "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Processing, ASSP-28, 137-145, Apr. 1980
6 J. Platt, "Probabilistic outputs for support vector machines and comparison to regularized likelihood methods," Advances in Large Margin Classifiers, MIT Press, 2000
7 J.-H. Chang and N. S. Kim, "Speech enhancement: new approaches to soft decision," IEICE Trans. Inf. and Syst., vol. E84-D(9), 1231-1240, Sep. 2001
8 O. Cappe, "Elimination of musical noise phenomenon with the Ephraim and Malah noise suppressor," IEEE Trans. Speech and Audio Processing, 2(2), 345-349, Apr. 1994   DOI   ScienceOn