Browse > Article

Robust Voice Activity Detection in Noisy Environment Using Entropy and Harmonics Detection  

Choi, Gab-Keun (Computer Engineering Department, Kwangwoon University)
Kim, Soon-Hyob (Computer Engineering Department, Kwangwoon University)
Publication Information
Abstract
This paper explains end-point detection method for better speech recognition rates. The proposed method determines speech and non-speech region with the entropy and the harmonic detection of speech. The end-point detection using entropy on the speech spectral energy has good performance at the high SNR(SNR 15dB) environments. At the low SNR environment(SNR 0dB), however, the threshold level of speech and noise varies, so the precise end-point detection is difficult. Therefore, this paper introduces the end-point detection methods which uses speech spectral entropy and harmonics. Experiment shows better performance than the conventional entropy methods.
Keywords
Voice Activity Detection; End-Point Detection; Speech Recognition;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 David Kozel, Constantin Apostoaia, "Colored Noise Reduction Using Bark Scale Spectral Subtraction, Statistics, and Multiple Time Frames" IEEE EIT Proceedings 2007, pp. 416-421, Chicago USA, May 2007.
2 하동경, 조석제, 진강규, 신옥근, "엔트로피 차와 신호의 에너지에 기반한 잡음환경에서의 음성검출" 한국마린엔지니어링학회지, 제32권 제5호, 768-774 쪽, 2008년 7월
3 L.R. Rabiner, M. R. Sambur, "An Algorithm for Determining the Endpoints of Isolated Utterances", The Bell System Technical Journal, Vol. 54, No. 2, pp.297-315, 1975.   DOI
4 Yi Hu, Philip Loizou, "NOIZEUS Speech Corpus', http://www.utdallas.edu/-loizou/speech/noizeus/
5 Ramalho, M.A. Mammone, R.J. "New speech enhancement techniques using the pitch mode modulation model" Circuits and Systems, 1993 Proceedings of the 36th Midwest Symposium, pp. 1531-1534, Detroit, USA, Aug 1993.
6 Abdallah I., Montresor S., Baudry M, "Robust speech/non-speech detection in adverse conditions using an entropy based estimator" Digital Signal Processing Proceedings 1997, pp. 752-760, Santorini Greece, Jul 1997.
7 Zoltan Tuske, Peter Mihajlik, Zoltan Tobler and Tibor Fegyo, "Robust Voice Activity Detection Based on the Entropy of Noise Suppressed Spectrum" Interspeech 2005, pp. 245-248, Lisbon Portugal., september 2005.
8 Ahmed, B. Holmes, P.H., "A voice activity detector using the chi-square test", Acoustics, Speech, and Signal Processing, 2004. Proceedings., pp. I-625-8, R. Melbourne Inst. of Technol., Vic., Australia, May 2004.
9 조규행, 박윤식, 장준혁, "Smoothed Global Soft Decision에 근거한 음성향상 기법" 전자공학회 논문지, 제 44권, SP편 제 6호, pp. 734-739, 2007년 11월