DOI QR코드

DOI QR Code

Global Soft Decision Based on Improved Speech Presence Uncertainty Tracking Method Incorporating Spectral Gradient

스펙트럼 변이 기반의 향상된 음성 존재 불확실성 추적 기법을 이용한 Global Soft Decision

  • 김종웅 (한양대학교 융합전자공학부) ;
  • 장준혁 (한양대학교 융합전자공학부)
  • Received : 2012.12.11
  • Accepted : 2013.01.28
  • Published : 2013.05.31

Abstract

In this paper, we propose a novel speech enhancement method to improve the performance of the conventional global soft decision which is based on the spectral gradient method applied to the ratio of a priori speech absence and presence probability value (q). Conventional global soft decision scheme used a fixed value of q in accordance with the hypothesis assumed, but the proposed algorithm is a technique for improving the speech absence probability which is applied adaptively variable value of q according to the speech presence or absence in the previous two frames and the conditions of the spectral gradient value. Experimental results show that the proposed improved global soft decision method based on the spectral gradient method yields better results compared to the conventional global soft decision technique based on the performance criteria of the ITU-T P. 862 PESQ (Perceptual Evaluation of Speech Quality).

본 논문에서는 기존의 global soft decision 기법에서 음성 부재 확률을 구할 때의 음성 부재와 존재에 대한 a priori 확률값의 비(q)에 스펙트럼 변이 기법을 적용한 음성 향상 기법을 제안한다. 기존의 global soft decision 방법은 음성 부재 확률을 구하기 위해 가정한 가설에 따라 고정된 q 값을 사용하였지만, 본 논문에서 제안한 알고리즘은 기존의 고정된 값에 직전 2 프레임에서의 음성 존재 여부와 스펙트럼 변이 값의 상태 조건에 따라 적응적으로 q 값이 가변되도록 하여 음성 부재 확률을 향상시키는 기법이다. 제안된 방법의 성능 평가를 위해 ITU-T P.862 PESQ(Perceptual Evaluation of Speech Quality)를 이용하여 평가하였고, 그 결과 제안된 스펙트럼 변이 기법을 적용한 방법이 기존의 global soft decision 방법보다 향상된 결과를 보여주었다.

Keywords

References

  1. S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. on Acoust., Speech, Signal Processing, 27, 113-120 (1979). https://doi.org/10.1109/TASSP.1979.1163209
  2. J. S. Lim and A. V. Oppenheim, "Enhancement and bandwidth compression of noisy speech," IEEE Trans. on Acoust., Speech, Signal Processing, 67, 1583-1604 (1979).
  3. R. J. McAulary and M. L. Malpass, "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. on Acoust., Speech, Signal Processing, 28, 137-145 (1980). https://doi.org/10.1109/TASSP.1980.1163394
  4. Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. on Acoust., Speech, Signal Processing, 32, 1109-1121 (1984). https://doi.org/10.1109/TASSP.1984.1164453
  5. N. S. Kim and J.-H. Chang, "Spectral enhancement based on global soft decision," IEEE Signal Processing Letters, 7, 108-110 (2000). https://doi.org/10.1109/97.841154
  6. D. Malah, R. Cox and A.J. Accardi, "Tracking speech-presence uncertainty to improve speech enhancement in nonstationary noise environments," Proc. IEEE Int. Conf. Acoust. Speech Signal Process., 789-792 (1999).
  7. W. Lee, J.-H. Song, and J.-H. Chang, "Minima-controlled speech presence uncertainty tracking method for speech enhancement," Signal Processing, 91, 155-161 (2011). https://doi.org/10.1016/j.sigpro.2010.06.019
  8. S.-K. Kim and J.-H. Chang, "Voice activity detection based on conditional MAP criterion incorporating the spectral gradient," Signal Processing, 92, 1699-1705 (2012). https://doi.org/10.1016/j.sigpro.2012.01.005
  9. J.-M. Kum and J.-H. Chang, "Improved global soft decision incorporating second-order conditional MAP in speech enhancement," IEICE Transactions on Information and Systems, 93, 1652-1655 (2010).
  10. ITU-T P.862, Perceptual evaluation of speech quality (PESQ), an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs, 2001.
  11. Y. Hu and P. Loizou, "Evaluation of objective quality measures for speech enhancement," IEEE Trans. Audio Speech Language Process., 16, 229-238 (2008). https://doi.org/10.1109/TASL.2007.911054

Cited by

  1. Improved Speech-Presence Uncertainty Estimation Based on Spectral Gradient for Global Soft Decision-Based Speech Enhancement vol.E96.A, pp.10, 2013, https://doi.org/10.1587/transfun.E96.A.2025