Binary Mask Criteria Based on Distortion Constraints Induced by a Gain Function for Speech Enhancement

Kim, Gibak;

IEIE Transactions on Smart Processing and Computing

Volume 2 Issue 4
/
Pages.197-202
/
2013
/
2287-5255(eISSN)

The Institute of Electronics and Information Engineers (대한전자공학회)

Binary Mask Criteria Based on Distortion Constraints Induced by a Gain Function for Speech Enhancement

Kim, Gibak (School of Electrical Engineering, Soongsil University)

Received : 2013.05.13
Accepted : 2013.06.12
Published : 2013.08.31

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Large gains in speech intelligibility can be obtained using the SNR-based binary mask approach. This approach retains the time-frequency (T-F) units of the mixture signal, where the target signal is stronger than the interference noise (masker) (e.g., SNR > 0 dB), and removes the T-F units, where the interfering noise is dominant. This paper introduces two alternative binary masks based on the distortion constraints to improve the speech intelligibility. The distortion constraints are induced by a gain function for estimating the short-time spectral amplitude. One binary mask is designed to retain the speech underestimated (T-F) units while removing the speech overestimated (T-F)units. The other binary mask is designed to retain the noise overestimated (T-F) units while removing noise underestimated (T-F) units. Listening tests with oracle binary masks were conducted to assess the potential of the two binary masks in improving the intelligibility. The results suggested that the two binary masks based on distortion constraints can provide large gains in intelligibility when applied to noise-corrupted speech.

IEIE Transactions on Smart Processing and Computing

Binary Mask Criteria Based on Distortion Constraints Induced by a Gain Function for Speech Enhancement

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)