Browse > Article
http://dx.doi.org/10.7776/ASK.2007.26.1.016

A New Statistical Voice Activity Detector Based on UMP Test  

Jang, Keun-Won (전남대학교 전자컴퓨터 공학부)
Chang, Joon-Hyuk (인하대학교 전자전기 공학부)
Kim, Dong-Kook (전남대학교 전자컴퓨터 공학부)
Abstract
Voice activity detectors (VADs) are important in wireless communication and speech signal processing. In the conventional VAD methods. an expression for the likelihood ratio test (LRT) based on statistical models is derived. Then, speech or noise is decided by comparing the value of the expression with a threshold. We propose a new method with the modified decision rule based on the Gaussian distribution and the uniformly most power (UMP) test. This method requires the distribution of the absolute value of the incoming speech signal. Then we can obtain the final decision through the relation between the Rayleigh distributions. This VAD method can detect speech without a priori signal-to-noise ratio (SNR) which is required in the conventional VAD algorithms. Additionally, in the various VAD performance tests, the proposed VAD method is shown to be more effective than the traditional scheme.
Keywords
Voice activity detection; Gaussian distribution; UMP test; Rayleigh distribution; Likelihood ratio test; A priori SNR;
Citations & Related Records
연도 인용수 순위
  • Reference
1 J. -H. Chang, J. W. Shin and N. S. Kim 'Voice Activity Detector Employing Generalized Gaussian Distribution,' IEEE Electronics Lett. 40 (24) 1561 - 1563, Nov. 2004   DOI   ScienceOn
2 J. -H. Chang, N. S. Kim and S. K. Mitra 'Voice Activity Detection Based on Multiple Statistical Models' IEEE Trans. Signal Processing, 54 (6) 1965 - 1976, June 2006   DOI   ScienceOn
3 R. Martin, 'Noise Power Spectral Density Estimation Based on Optimal Srroothinq and Minimum Statistics,' IEEE Trans. Speech and Audio Processing 9 (5) 504 - 512, Jul. 2001   DOI   ScienceOn
4 Y. D. Cho and A Kondoz 'Analysis and Improvement of A Statistical Model-Based Voice Activity Detector,' IEEE Signal Processing, Lett. 8 (10) 27&-278, Oct. 2001
5 A.Varga and H.J.M. Steeneken, 'Assessment for Automatic Speech Recognition: II.NOISEX-92: A Database and An Experiment to Study The Effect of Additive Noise on Speech Recognition Systems,' Speech Communication, 12 (3) 247-251, Ju1.1993   DOI   ScienceOn
6 L. R. Rabiner and M. R. Sambur, 'Voiced-Unvoiced-Silence Detection Using Itakura LPC Distance Measure,' Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing. 2 323-326, May 1977
7 J. -H. Chang and N. S. Kim, 'Voice Activity Detection Based on Complex Laplacian Model,' IEEE Electronics Lett., 39 (7) 632 - 634, April 2003   DOI   ScienceOn
8 A Dvis, S. Nordholm and R. Togneri, 'Statistical Voice Activity Detection Using Low-Variance Spectrum Estimation and an Adaptive Threshold,' IEEE Trans. Audio, Speech, and Language Processing, 14 (2) 412-424, March 2006   DOI   ScienceOn
9 J. S. Sohn, N. S. Kim and W. Y. Sung, 'A Statistical Model-Based Voice Activity Detection,' IEEE Signal Process, Lett., 6 (1) 1-3, 1999
10 Y. Ephraim and D. Malah, 'Speech Enhancement Using A Minimum Mean-square Error Short-time Spectral Amplitude Estimator'' IEEE Trans. Acoust., Speech, Signal Processing, Vol. ASSP-32, 1109-1121, Dec. 1984   DOI
11 S. M. Kay Fundamentals of Statistical Signal Processing, (Volume 2: Detection Theory, Prentice Hall. 1998)
12 N. S. Kim, and J. -H. Chang, 'Spectral Enhancement Based on Global Soft Decision,' IEEE Signal Process. Lett.. 7 (5) 108-110, 2000   DOI   ScienceOn
13 P. Vary and R. Martin. Digital Speech Transmission: Enhancement, Coding and Error Concealment, (John Wiley & Sons Inc., 2006)