Browse > Article
http://dx.doi.org/10.9717/kmms.2014.17.3.312

Voice Activity Detection Algorithm using Fuzzy Membership Shifted C-means Clustering in Low SNR Environment  

Lee, G.H. (경북대학교 대학원 의용생체공학과)
Lee, Y.J. (경북대학교 대학원 의용생체공학과)
Cho, J.H. (경북대학교 IT대학 전자공학부)
Kim, M.N. (경북대학교 의학전문대학원 의공학교실)
Publication Information
Abstract
Voice activity detection is very important process that find voice activity from noisy speech signal for noise cancelling and speech enhancement. Over the past few years, many studies have been made on voice activity detection, it has poor performance for speech signal of sentence form in a low SNR environment. In this paper, it proposed new voice activity detection algorithm that has beginning VAD process using entropy and main VAD process using fuzzy membership shifted c-means clustering. We conduct an experiment in various SNR environment of white noise to evaluate performance of the proposed algorithm and confirmed good performance of the proposed algorithm.
Keywords
Speech Detection; Fuzzy Clustering; Fuzzy Membership Shift;
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 L. Rabiner and B.H. Juang, Fundmentals of Speech Recognition, Prentice Hall, Englewood Cliffs, NJ, 1993.
2 ETSI, ETSI standard doc ETSI ES 202 050 v1.1.1, 2002.
3 G.K. Choi and S.H. Kim, "Voice Activity Detection Method using Psycho-Acoustic Model based on Speech Energy Maximization in Noisy Environments," Journal of the Acoustical Society of Korea, Vol. 28, No. 5, pp. 447-453, 2009.
4 D.G. Ha, S.J. Cho, G.G. Jin, and O.K, Shin, "Voice Activity Detection based on Signal Energy and Entropy-difference in Noisy Environments," Journal of the Korean Society of Marine Engineering, Vol. 32, No. 5, pp. 768-774, 2008.   과학기술학회마을   DOI   ScienceOn
5 M. Asgari, A. Sayadian, M. Farhadloo, and E.A. Mehrizi, "Voice Activity Detection using Entropy in Spectrum Domain," Telecommunication Networks and Applications Conference, pp. 407-410, 2008.
6 R. Gemello, F. Mana, and R.D. Mori, "A Modified Ephraim-Malah Noise Suppression Rule for Automatic Speech Recognition," Proc. the ICASSP 2004, Vol. 1, pp. 957-960, 2004.
7 P. Teng and Y. Jia "Voice Activity Detection Via Noise Reducing using Non-Negative Sparse Coding," IEEE Signal Processing Letters, Vol. 20, Issue 5, pp. 475-478, 2013.   DOI   ScienceOn
8 S.W. Deng and J.Q. Han, "Statistical Voice Activity Detection based on Sparse Rep-resentation over Learned Dictionary," Digital Signal Processing, Vol. 23, Issue 4, pp. 1228- 1232, 2013.   DOI   ScienceOn
9 C.E. Shannon, "A Mathematical Theory of Communication," ACM SIGMOBILE Mobile Computing and Communications Review, Vol. 5, No. 1, pp. 3-55, 2001.
10 G,H, Lee, P.U. Kim, Y.J. Lee, and M.N. Kim, "Detection of The First And Second Heart Sound using Three-order Shannon Energy Difference," Journal of the Korea Multimedia Society, Vol. 14, No. 7, pp. 884-894, 2011.   과학기술학회마을   DOI
11 J.W. Jung and J.T. Lee, "A Study on the Fault Current Discrimination using Enhanced Fuzzy C-Means Clustering," Transactions of KIEE, Vol. 57, No. 11, pp. 2102-2107, 2008.   과학기술학회마을
12 J.N. Chio, S.K. Oh, and H.K. Kim, "Genetic Optimization of Fuzzy C-Means Clustering- Based Fuzzy Neural Network," Transactions of KIEE, Vol. 57, No. 3, pp. 466-472, 2008.
13 J. Ramíirez, J.C. Segura, C. Beníitez, A. de la- Torre, and A.. Rubio, "An Effective Subband OSF-based VAD with Noise Reduction for Robust Speech Recognition," IEEE Trans. on Speech and Audio Processing, Vol. 13, No. 6, pp. 1119-1129, 2005.   DOI   ScienceOn