Browse > Article

An Improved VAD Algorithm Employing Speech Enhancement Preprocessing and Threshold Updating  

이윤창 (고려대학교 전자정보공학과 신호처리연구실)
안상식 (고려대학교 전자 및 정보공학부)
Abstract
In this paper, we propose an improved statistical model-based voice activity detection algorithm and threshold update method. We first improve signal-to-noise ratio by using speech enhancement preprocessing algorithm combined power subtraction method and matched filter, then apply it to LLR test optimum decision rule for improving the performance even in low SNR conditions. And we propose an adaptive threshold update method that was not concerned in any papers. We also perform extensive computer simulations to demonstrate the performance improvement of the proposed VAD algorithm employing the proposed speech enhancement preprocessing algorithm and adaptive threshold update method under various background noise environments. Finally we verify our results by comparing ITU-T G.729 Annex B.
Keywords
Voice activity detection; Threshold update; Speech enhancement;
Citations & Related Records
연도 인용수 순위
  • Reference
1 ITU-T Recommendation, 'G.729 Annex B: Asilence compression scheme for G.729 optimized for terminals conforming to RecommendationV.70,' Nov. 1996
2 Yoon-Chang Lee, Sang-Sik Ahn, 'An improved voice activity detection algorithm employing speech enhancement preprocessing,' IEICE Transaction on Fundamentals, Vol. E84-A, No.6,Jun. 2001
3 이윤창, 안상식, '정합필터를 이용한 음성검출 방법,' 한국통신학회 하계종합학술발표회 논문 초록집, Vol. 25, pp. 6, Aug. 2002
4 http://spib.rice.edu , Rice Univ., DSP group.
5 Yong Duk Cho, Ahmet M. Kondoz, 'Analysis and improvement of a statisdcal model-based voiceacdvity detector,' IEEE Signal Processing Letters ,Vo1.8, Issue 10, pp.276-278, Oct. 2001
6 Ahmet M. Kondoz, 'Digital speech coding for lowbit rate cornmunications systems,' John Wley &Sons, pp. 337-341
7 H. G. Hirsch, C. Ehrlicher, 'Noise estimation techniques for robust speech recognition, 'ICASSP-95, pp. 153-156. May. 1995
8 Jin Yang, 'Frequency domain noise suppression approaches in mobile telephone system,'ICASSP-93, Vo1.2, pp. 363-366. 1993
9 Jongseo Sohn, Wonyong Sung, 'A statistical model-based voice activity detection,' IEEE Signalprocessing Letters, Vo1.6, No.l, 1999
10 http://www.sipro.com , Sipro Lab., Telecom Inc