DOI QR코드

DOI QR Code

심리음향 특성을 이용한 음성 향상 알고리즘

A Speech Enhancement Algorithm based on Human Psychoacoustic Property

  • 전유용 (인하대학교 전자공학과) ;
  • 이상민 (인하대학교, 전자공학과, 인하대학교 전자정보공동연구소)
  • 투고 : 2000.04.15
  • 심사 : 2010.05.07
  • 발행 : 2010.06.01

초록

In the speech system, for example hearing aid as well as speech communication, speech quality is degraded by environmental noise. In this study, to enhance the speech quality which is degraded by environmental speech, we proposed an algorithm to reduce the noise and reinforce the speech. The minima controlled recursive averaging (MCRA) algorithm is used to estimate the noise spectrum and spectral weighting factor is used to reduce the noise. And partial masking effect which is one of the human hearing properties is introduced to reinforce the speech. Then we compared the waveform, spectrogram, Perceptual Evaluation of Speech Quality (PESQ) and segmental Signal to Noise Ratio (segSNR) between original speech, noisy speech, noise reduced speech and enhanced speech by proposed method. As a result, enhanced speech by proposed method is reinforced in high frequency which is degraded by noise, and PESQ, segSNR is enhanced. It means that the speech quality is enhanced.

키워드

참고문헌

  1. Cohen I., "Noise estimation by Minima Controlled Recursive Averaging for Robust Speech Enhancement," IEEE signal processing letter, Vol. 9, no. 1, 2002, pp.12-15. https://doi.org/10.1109/97.988717
  2. Cohen I., "Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging," IEEE Transactions on Speech and Audio Processing, vol. 11, no. 5, pp. 466-475, Sept. 2003. https://doi.org/10.1109/TSA.2003.811544
  3. N. Fan, J. Rosca, R. Balan, "Speech noise estimation using enhanced minima controlled recursive averaging," in Proc. ICASSP, Honolulu, Hawaii, U.S.A., pp. 581-584, Apr. 2007.
  4. V. Stouten, H. V. hamme, P. Wambacq, "Application of minimum statistics and minima controlled recursive averaging methods to estimate a cepstral noise model for robust ASR,"in Proc. ICASSP, Toulouse, France, pp. 765-768, May. 2006.
  5. Young Woo Lee, Sang Min Lee, Yoon Sang Ji, Jong Shill Lee, Young Joon Chee, Sung Hwa Hong, Sun I. Kim, In Young Kim, "An Efficient Speech Enhancement Algorithm for Digital Hearing Aids Based on Modified Spectral Subtraction and Companding," IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, Volume E90-A, Issue 8, pp.1628-1635, August 2007 https://doi.org/10.1093/ietfec/e90-a.8.1628
  6. Jong-Mo Kum, Yun-Sik Park and Joo-Hyuk Chang, "Speech Enhancement Based on Minima Controlled Recursive Averaging Incorporating Conditional Maximim a Posteriori Criterion," ICASSP Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4417-4420, 2009
  7. Boll S., "A spectral subtraction algorithm for suppression of acoustic noise in speech," Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '79., Vol.4, pp.200-203, 1979
  8. Hu H.T. Yu C., "Adaptive noise spectral estimation for spectral subtraction speech enhancement," Signal Processing, IET, Vol.1 , No.3, pp.156-163 , 2007 https://doi.org/10.1049/iet-spr:20070008
  9. Zheng Wentao, Cao Zhigang, "Speech Enhancement Based On MMSE-STSA Estimation And Residual Noise Reduction," TENCON '91.1991 IEEE Region 10 International Conference on EC3-Energy, Computer, Communication and Control Systems, Vol.3, pp.265-268, 1991
  10. Udrea R.M., Ciochina S., "Speech enhancement using spectral over-subtraction and residual noise reduction," Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on , Vol.1, pp.165-168, 2003
  11. Hasan T., Hasan M.K., "Suppression of Residual Noise From Speech Signals Using Empirical Mode Decomposition," Signal Processing Letters, IEEE, Vol.16, No.1, pp.2-5, 2009 https://doi.org/10.1109/LSP.2008.2008452
  12. Jong Won Shin and Nam Soo Kim, "Perceptual Reinforcement of Speech Signal Based on Partial Specific Loudness," IEEE SIGNAL PROCESSING LETTERS, VOL.14, NO.11, pp.887-890, NOVEMBER 2007 https://doi.org/10.1109/LSP.2007.900222
  13. B. C. J. Moore, B. R. Glasberg, and T. Baer, "A model for the prediction of thresholds, loudness, and partial loudness," J. Audio Eng. Soc., vol.45, no. 4, pp. 224–240, Apr. 1997
  14. ISO 226 2003 Acoustics - Normal equal-loundnesslevel contours / By International Organization for Standardization (ISO). 2nd ed. Geneva: ISO, 2003.
  15. Beerends, John G., Hekstra, Andries P., Rix, Antony W., Hollier, Michael P., "Perceptual Evaluation of Speech Quality (PESQ) The New ITU Standard for End-to-End Speech Quality Assessment Part II: Psychoacoustic Model," J. Audio Eng. Society Volume 50 Issue 10 pp. 765-778; October 2002
  16. Quackenbush, S.R. "Objective Measures of Speech Quality," Prentice-Hall, NJ, 1988