DOI QR코드

DOI QR Code

Low-Complexity Speech Enhancement Algorithm Based on IMCRA Algorithm for Hearing Aids

보청기를 위한 IMCRA 기반 저연산 음성 향상 알고리즘

  • Received : 2017.11.18
  • Accepted : 2017.11.29
  • Published : 2017.12.31

Abstract

In this paper, we proposed a low-complexity speech enhancement algorithm based on a improved minima controlled recursive averaging (IMCRA) and log minimum mean square error (logMMSE). The IMCRA algorithm track the minima value of input power within buffers in local window and identify the speech presence using ratio between input power and its minima value. In this process, many number of operations are required. To reduce the number of operations of IMCRA algorithm, minima value is tracked using time-varying frequency-dependent smoothing based on speech presence probability. The proposed algorithm enhanced speech quality by 2.778%, 3.481%, 2.980% and 2.162% in 0, 5, 10 and 15dB SNR respectively and reduced computational complexity by average 9.570%.

본 논문에서는 향상된 최소값 제어 재귀 평균 (improved minima controlled recursive averaging, IMCRA) 알고리즘과 로그 최소값 평균 제곱 오차 (log minimum mean square error, logMMSE)를 기반으로 한 저연산 음성 향상 알고리즘을 제안한다. IMCRA 알고리즘은 버퍼를 이용하여 일정 구간에서 입력 신호 전력의 최소값을 추적하고 최소값과 입력 신호의 비율을 통해 음성 존재를 확인한다. 이러한 과정에서 많은 연산이 필요하며 연산량을 줄이기 위해서 음성 존재 확률을 기반으로한 주파수 밴드별 시변 스무딩으로 최소값을 추적한다. 제안된 알고리즘은 0dB, 5dB, 10dB 그리고 15dB 신호 대 잡음비에서 평균 2.778%, 3.481%, 2.980% 그리고 2.162% 음성 품질이 향상되었으며, 평균 9.570% 연산량이 감소한 것을 확인하였다.

Keywords

References

  1. Kochkin, Sergei. "Customer satisfaction with single and multiple microphone digital hearing aids." Hearing Review 11 pp. 24-34, 2000.
  2. Boll, Steven. "Suppression of acoustic noise in speech using spectral subtraction." IEEE Transactions on acoustics, speech, and signal processing 27.2 pp. 113-120, 1979. https://doi.org/10.1109/TASSP.1979.1163209
  3. Ephraim, Yariv, and David Malah. "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator." IEEE Transactions on Acoustics, Speech, and Signal Processing 32.6 pp. 1109-1121, 1984. https://doi.org/10.1109/TASSP.1984.1164453
  4. Ephraim, Yariv, and David Malah. "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator." IEEE Transactions on Acoustics, Speech, and Signal Processing 33.2 pp. 443-445, 1985. https://doi.org/10.1109/TASSP.1985.1164550
  5. Cohen, Israel, and Baruch Berdugo. "Noise estimation by minima controlled recursive averaging for robust speech enhancement." IEEE signal processing letters 9.1 pp. 12-15, 2002. https://doi.org/10.1109/97.988717
  6. Cohen, Israel. "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging." IEEE Transactions on speech and audio processing, Vol.11, Issue 5 pp. 466-475, 2003. DOI:10.1109/TSA.2003.811544.
  7. Ephraim, Yariv, and David Malah. "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator." IEEE Transactions on Acoustics, Speech, and Signal Processing 33.2 pp. 443-445, 1985. https://doi.org/10.1109/TASSP.1985.1164550
  8. Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoustics, Speech, Sig. Process., vol. ASSP-32, no. 6, pp. 1190-1121, Dec. 1984.
  9. Hu, Yi, and Philipos C. Loizou. "Subjective comparison of speech enhancement algorithms." Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on. vol. 1, IEEE, 2006.
  10. Jang, H. S., et al. "Development of Korean standard sentence lists for sentence recognition tests." Audiology 4.2 pp. 161-177, 2008.
  11. Hu, Yi, and Philipos C. Loizou. "Evaluation of objective quality measures for speech enhancement." IEEE Transactions on audio, speech, and language processing 16.1 pp. 229-238, 2008. https://doi.org/10.1109/TASL.2007.911054