A Speech Enhancement Algorithm based on Human Psychoacoustic Property

Jeon, Yu-Yong;Lee, Sang-Min;

doi:10.5370/KIEE.2010.59.6.1120

The Transactions of The Korean Institute of Electrical Engineers (전기학회논문지)

Volume 59 Issue 6
/
Pages.1120-1125
/
2010
/
1975-8359(pISSN)
/
2287-4364(eISSN)

The Korean Institute of Electrical Engineers (대한전기학회)

DOI QR Code

A Speech Enhancement Algorithm based on Human Psychoacoustic Property

심리음향 특성을 이용한 음성 향상 알고리즘

전유용 (인하대학교 전자공학과) ;
이상민 (인하대학교, 전자공학과, 인하대학교 전자정보공동연구소)

Received : 2000.04.15
Accepted : 2010.05.07
Published : 2010.06.01

https://doi.org/10.5370/KIEE.2010.59.6.1120 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In the speech system, for example hearing aid as well as speech communication, speech quality is degraded by environmental noise. In this study, to enhance the speech quality which is degraded by environmental speech, we proposed an algorithm to reduce the noise and reinforce the speech. The minima controlled recursive averaging (MCRA) algorithm is used to estimate the noise spectrum and spectral weighting factor is used to reduce the noise. And partial masking effect which is one of the human hearing properties is introduced to reinforce the speech. Then we compared the waveform, spectrogram, Perceptual Evaluation of Speech Quality (PESQ) and segmental Signal to Noise Ratio (segSNR) between original speech, noisy speech, noise reduced speech and enhanced speech by proposed method. As a result, enhanced speech by proposed method is reinforced in high frequency which is degraded by noise, and PESQ, segSNR is enhanced. It means that the speech quality is enhanced.

Keywords

References

Cohen I., "Noise estimation by Minima Controlled Recursive Averaging for Robust Speech Enhancement," IEEE signal processing letter, Vol. 9, no. 1, 2002, pp.12-15. https://doi.org/10.1109/97.988717
Cohen I., "Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging," IEEE Transactions on Speech and Audio Processing, vol. 11, no. 5, pp. 466-475, Sept. 2003. https://doi.org/10.1109/TSA.2003.811544
N. Fan, J. Rosca, R. Balan, "Speech noise estimation using enhanced minima controlled recursive averaging," in Proc. ICASSP, Honolulu, Hawaii, U.S.A., pp. 581-584, Apr. 2007.
V. Stouten, H. V. hamme, P. Wambacq, "Application of minimum statistics and minima controlled recursive averaging methods to estimate a cepstral noise model for robust ASR,"in Proc. ICASSP, Toulouse, France, pp. 765-768, May. 2006.
Young Woo Lee, Sang Min Lee, Yoon Sang Ji, Jong Shill Lee, Young Joon Chee, Sung Hwa Hong, Sun I. Kim, In Young Kim, "An Efficient Speech Enhancement Algorithm for Digital Hearing Aids Based on Modified Spectral Subtraction and Companding," IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, Volume E90-A, Issue 8, pp.1628-1635, August 2007 https://doi.org/10.1093/ietfec/e90-a.8.1628
Jong-Mo Kum, Yun-Sik Park and Joo-Hyuk Chang, "Speech Enhancement Based on Minima Controlled Recursive Averaging Incorporating Conditional Maximim a Posteriori Criterion," ICASSP Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4417-4420, 2009
Boll S., "A spectral subtraction algorithm for suppression of acoustic noise in speech," Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '79., Vol.4, pp.200-203, 1979
Hu H.T. Yu C., "Adaptive noise spectral estimation for spectral subtraction speech enhancement," Signal Processing, IET, Vol.1 , No.3, pp.156-163 , 2007 https://doi.org/10.1049/iet-spr:20070008
Zheng Wentao, Cao Zhigang, "Speech Enhancement Based On MMSE-STSA Estimation And Residual Noise Reduction," TENCON '91.1991 IEEE Region 10 International Conference on EC3-Energy, Computer, Communication and Control Systems, Vol.3, pp.265-268, 1991
Udrea R.M., Ciochina S., "Speech enhancement using spectral over-subtraction and residual noise reduction," Signals, Circuits and Systems, 2003. SCS 2003. International Symposium on , Vol.1, pp.165-168, 2003
Hasan T., Hasan M.K., "Suppression of Residual Noise From Speech Signals Using Empirical Mode Decomposition," Signal Processing Letters, IEEE, Vol.16, No.1, pp.2-5, 2009 https://doi.org/10.1109/LSP.2008.2008452
Jong Won Shin and Nam Soo Kim, "Perceptual Reinforcement of Speech Signal Based on Partial Specific Loudness," IEEE SIGNAL PROCESSING LETTERS, VOL.14, NO.11, pp.887-890, NOVEMBER 2007 https://doi.org/10.1109/LSP.2007.900222
B. C. J. Moore, B. R. Glasberg, and T. Baer, "A model for the prediction of thresholds, loudness, and partial loudness," J. Audio Eng. Soc., vol.45, no. 4, pp. 224–240, Apr. 1997
ISO 226 2003 Acoustics - Normal equal-loundnesslevel contours / By International Organization for Standardization (ISO). 2nd ed. Geneva: ISO, 2003.
Beerends, John G., Hekstra, Andries P., Rix, Antony W., Hollier, Michael P., "Perceptual Evaluation of Speech Quality (PESQ) The New ITU Standard for End-to-End Speech Quality Assessment Part II: Psychoacoustic Model," J. Audio Eng. Society Volume 50 Issue 10 pp. 765-778; October 2002
Quackenbush, S.R. "Objective Measures of Speech Quality," Prentice-Hall, NJ, 1988

The Transactions of The Korean Institute of Electrical Engineers (전기학회논문지)

A Speech Enhancement Algorithm based on Human Psychoacoustic Property

심리음향 특성을 이용한 음성 향상 알고리즘

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)