Browse > Article
http://dx.doi.org/10.7776/ASK.2013.32.2.131

Signal Subspace-based Voice Activity Detection Using Generalized Gaussian Distribution  

Um, Yong-Sub (전남대학교 전자컴퓨터공학부)
Chang, Joon-Hyuk (한양대학교 전자전기공학부)
Kim, Dong Kook (전남대학교 전자컴퓨터공학부)
Abstract
In this paper we propose an improved voice activity detection (VAD) algorithm using statistical models in the signal subspace domain. A uncorrelated signal subspace is generated using embedded prewhitening technique and the statistical characteristics of the noisy speech and noise are investigated in this domain. According to the characteristics of the signals in the signal subspace, a new statistical VAD method using GGD (Generalized Gaussian Distribution) is proposed. Experimental results show that the proposed GGD-based approach outperforms the Gaussian-based signal subspace method at 0-15 dB SNR simulation conditions.
Keywords
Voice activity detection; Signal subspace; Generalized gaussian distribution;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 K. C. Ryu and D. K. Kim, "Statistical voice activity detector based on signal subspace model," (In Korean), J. Acoust. Soc. Kr. 27, 372-378 (2008).
2 D. K. Kim and J.H. Chang, "A subspace approach based on embedded prewhitening for voice activity detection," J. Acoust. Soc. Am. 130, 304-310 (2011).   DOI
3 Y. S. Um, J. H. Chang, and D. K. Kim "An improved VAD approach based on generalized Gaussian distribution in signal subspace domain," (In Korean), J. Acoust. Soc. Kr. 2(s) 30, 136-139 (2011).   DOI   ScienceOn
4 S. Gazor and W. Zhang "Speech probability distribution," IEEE Sig. Proc. Lett. 10, 204-207 (2003).   DOI   ScienceOn
5 Y. Hu and P. C. Loizou, "A generalized subspace approach for enhancing speech corrupted by colored noise," IEEE Trans. Speech Audio Proc. 11, 334-341 (2003).   DOI   ScienceOn
6 A. G. Glen, L. M. Leemis, and D. R. Barr, "Order statistics in goodness-of-fit testing," IEEE Trans. Reliab. 50, 209-213 (2001).   DOI   ScienceOn
7 S. Nadarajah, "A generalized Gaussian distribution," J. Appl. Stat. 32, 685-694 (2005).   DOI   ScienceOn
8 J. S. Sohn, N. S. Kim, and W. Y. Sung, "A statistical model-based voice activity detection," IEEE Sig. Proc. Lett. 6, 1-3 (1999).
9 Y. D. Cho and A. Kondoz, "Analysis and improvement of a statistical model-based voice activity detector," IEEE Sig. Proc. Lett. 8, 276-278 (2001).   DOI   ScienceOn
10 J. H. Chang, N. S. Kim, and S. K. Mitra, "Voice activity detection based on multiple statistical models," IEEE Trans. Signal Proc. 54, 1965-1976 (2006).   DOI   ScienceOn
11 J. H. Chang, J. W. Shin, and N. S. Kim, "Voice activity detector employing generalized Gaussian distribution," IEE Electronics Lett. 40, 1561-1563 (2004).   DOI   ScienceOn
12 J. W. Shin, J. H. Chang, and N. S. Kim, "Statistical modeling of speech signals based on generalized Gamma distribution," IEEE Sig. Proc. Lett. 12, 258-261 (2005).   DOI   ScienceOn
13 S. Gazor and W. Zhang, "A Soft Voice Activity Detector Based on a Laplacian-Gaussian Model," IEEE Trans. Speech and Audio Proc. 11, 498-505 (2003).   DOI   ScienceOn