Browse > Article
http://dx.doi.org/10.7776/ASK.2009.28.1.070

Improvement of Speech Intelligibility in Noisy Environments  

Yoon, Jae-Yul (광운대학교 전자공학과)
Kim, Jung-Hoe (삼성전자 멀티미디어랩)
Oh, Eun-Mi (삼성전자 멀티미디어랩)
Park, Ho-Chong (광운대학교 전자공학과)
Abstract
In speech communications in noisy environments, speech intelligibility is seriously degraded due to the masking effect of ambient noise. In this paper, a new method to improve speech intelligibility in noisy environments is proposed. Based on the perception theory that the temporal envelope plays a major role in determining intelligibility, the proposed method uses a novel operation that enhances the fluctuation of band-wise temporal envelope and also contains pitch enhancement for improving speech naturalness. In addition, a new subjective evaluation scheme employing binaural listening is proposed in order to measure more reliable performance. The subjective performance measured with the proposed scheme shows that the proposed method improves both intelligibility and naturalness in various environments, whereas a function parameter can control the performance trade-off between intelligibility and naturalness.
Keywords
Speech intelligibility; Noisy environments; Temporal envelope; Pitch; Speech quality;
Citations & Related Records
연도 인용수 순위
  • Reference
1 B. Sauert and P. Vary, "Near end listening enhancement : speech intelligibility improvement in noisy environments," ICASSP 2006, pp.493-496, 2006   DOI
2 J. Shin and N. Kim, "Perceptual reinforcement of speech signal based on partial specific loudness," IEEE Signal Pro-cessing Letters, 14(11), 2007   DOI   ScienceOn
3 W. D. Voiers, "Evaluating processed speech using the Dia-gnostic Rhyme Test (DRT)," Speech Technology, vol.1, 1983
4 R. Niederjohn and J. Grotelueschen, "The enhancement of speech intelligibility in high noise levels by high-pass filtering followed by rapid amplitude compression," IEEE Trans. ASSP, 24(4), 1976   DOI
5 J. C. R. Licklider, "The Influence of Interaural Phse Re-lations upon the Masking of Speech by White Noise," The Journal of the Acoustical Society of America, 20(2), 1948   DOI
6 TIA/EIA IS-127, "Enhanced Variable Rate Codec (EVRC), Speech Service Option 3 for Wideband Spread Spectrum Digital Systems," 1997
7 3GPP2 C.S0014-0, "Source-Controlled Variable-Rate Multi-mode Wideband Speech Codec (VMR-WB)," 2004
8 R. Drullman, J. Festen and R. Plomp, "Effect of temporal envelope smearing on speech reception," J. Acoustical Society of America, 95(2), Feb., 1994   DOI   ScienceOn
9 T. Houtgast and H. J. M. Steeneken, "A review of the MTF concept in room acoustics and its use for estimating speech intelligent in audiotoria," J. Acoustical Society of America, 77(3), Mar., 1985   DOI
10 B. C. J. Moore, an introduction to the psychology of hearing, 4th Ed., Academic Press, 1996
11 P. Shankar and S. Park, "Speech intelligibility enhancement using tunable equalization filter," ICASSP2007, pp.613-616, 2007   DOI