Browse > Article
http://dx.doi.org/10.9717/kmms.2013.16.9.1005

Voice Activity Detection Algorithm using Wavelet Band Entropy Ensemble Analysis in Car Noisy Environments  

Lee, G.H. (경북대학교 대학원 의용생체공학과)
Lee, Y.J. (경북대학교 대학원 의용생체공학과)
Kim, M.N. (경북대학교 의학전문대학원 의공학교실)
Publication Information
Abstract
Voice activity detection is very important process that voice activity separated form noisy speech signal for speech enhance. Over the past few years, many studies have been made on voice activity detection, but it has poor performance in low signal to noise ratio environment or fickle noise such as car noise. In this paper, it proposed new voice activity detection algorithm using ensemble variance based on wavelet band entropy and soft thresholding method. We conduct a survey in a lot of signal to noise ratio environment of car noise to evaluate performance of the proposed algorithm and confirmed performance of the proposed algorithm.
Keywords
Voice Activity Detection; Entropy; Wavelet Band; Ensemble; Car Noise;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 L. Rabiner and B.H. Juang, Fundmentals of Speech Recognition, Prentice Hall, Englewood Cliffs, NJ, 1993.
2 D.G. Ha, S.J. Cho, G.G. Jin, and O.K. Shin, "Voice Activity Detection Based on Signal Energy and Entropy-difference in Noisy Environments," Journal of the Korean Society of Marine Engineering, Vol. 32, No. 5, pp. 768-774, 2008.   과학기술학회마을   DOI   ScienceOn
3 J. Ramiirez, J.C. Segura, C. Beniitez, A. de la- Torre, and A. Rubio, "An Effective Subband OSF-based VAD with Noise Reduction for Robust Speech Recognition," IEEE Trans. on Speech and Audio Processing, Vol. 13, No. 6, pp. 1119-1129, 2005.   DOI   ScienceOn
4 R. Gemello, F. Mana, and R. De Mori, "A Modified Ephraim-Malah Noise Suppression Rule for Automatic Speech Recognition," Proc. ICASSP 2004, Vol. 1, pp. 957-960, 2004.
5 P. Teng and Y. Jia "Voice Activity Detection Via Noise Reducing using Non-Negative Sparse Coding," IEEE Signal Processing Letters, Vol. 20, Issue 5, pp. 475-478, 2013.   DOI   ScienceOn
6 Shi-Wen Deng and Ji-Qing Han, "Statistical Voice Activity Detection Based on Sparse Representation Over Learned Dictionary," Digital Signal Processing, Vol. 23, Issue 4, pp. 1228- 1232, 2013.   DOI   ScienceOn
7 M. Asgari, A. Sayadian, M. Farhadloo, and E.A. Mehrizi, "Voice Activity Detection using Entropy in Spectrum Domain," Telecommunication Networks and Applications Conference, pp. 407-410, 2008.
8 C.E. Shannon, "A Mathematical Theory of Communication," Bell System Technical Journal, Vol. 27, pp. 379-423, 1948.   DOI
9 J. Ramirez, J.C. Segura, C.Benitez, L. Garcia, and A. Rubio, "Statistical Voice Activity Detection using a Multiple Observation Likelihood Ratio Test," IEEE Signal Processing Letter , Vol. 12, No. 10, pp. 689-692, 2005.   DOI   ScienceOn
10 H.K. Kim, S.W. Lee, and J.K. Hong, "Noise Reduction using Spectral Subtraction in the Discrete Wavelet Transform Domain," Journal of the Korea Multimedia Society, Vol. 4, No. 4, pp. 306-315, 2001.
11 J.I. Agbinya, "Discrete Wavelet Transform Techniques in Speech Processing," IEEE TENCON. Digital Signal Processing Applications, Vol. 2, pp. 514-519, 1996.   DOI
12 S.H. Lee and D.H. Yoon, "EEG Signal Compression by Multi-scale Wavelets and Coherence Analysis and Denoising by Continuous Wavelets Transform," Journal of the Institute of Electronics Engineers of Korea, Vol. 41-SP, No. 3, pp. 221-229, 2004.   과학기술학회마을
13 S. Mallat and S. Zhong, "Caracterization of Signals from Multiscale Edges," IEEE Trans. on Information Theory, Vol. 38, No. 2, pp. 710- 732, 1992.
14 K.S. Bae, "Detecttion of Glottal Closure Instant for Voice Speech using Wavelet Transform," Speech Sciences, Vol. 7, No. 3, pp. 164-176, 2000.
15 G.H. Lee, P.U. Kim, Y.J. Lee, and M.N. Kim, "Detection of the First and Second Heart Sound using Three-order Shannon Energy Difference," Journal of the Korea Multimedia Society, Vol. 14, No. 7, pp. 884-894, 2011.   과학기술학회마을   DOI   ScienceOn