Browse > Article
http://dx.doi.org/10.5391/JKIIS.2011.21.2.254

Detection and Recognition Method for Emergency and Non-emergency Speech by Gaussian Mixture Model  

Cho, Young-Im (수원대학교 컴퓨터학과)
Lee, Dae-Jong (충북대학교 전기전자컴퓨터공학부)
Publication Information
Journal of the Korean Institute of Intelligent Systems / v.21, no.2, 2011 , pp. 254-259 More about this Journal
Abstract
For the emergency detecting in general CCTV environment of our daily life, the monitoring by only images through CCTV information occurs some problems especially in cost as well as man power. Therefore, in this paper, for detecting emergency state dynamically through CCTV as well as resolving some problems, we propose a detection and recognition method for emergency and non-emergency speech by GMM. The proposed method determine whether input speech is emergency or non-emergency speech by global GMM. If emergeny speech, local GMM is performed to classify the type of emergency speech. The proposed method is tested and verified by emergency and non-emergency speeches in various environmental conditions.
Keywords
Emergency; GMM; Speech enhancement; MFCC; CCTV;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 유장희, 문기영, 조현숙, “지능형 영상보안 기술현황 및 동향,” 전자통신동햑분석, 제23권, 4호, pp. 80-89, 2008.   과학기술학회마을
2 S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans., ASSP, vol. 37, no. 2, pp. 113-120, 1979.
3 Doclo, S., Rong Dong, Klasen, T.J., Wouters, J., Haykin, S., Moonen, M., "Extension of the multi-channel Wiener filter with ITD cues for noise reduction in binaural hearing aids," Applications of Signal Processing to Audio and Acoustics, vol. 16, no. 16, pp 70-73, 2005.
4 Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean square error short-time spectral amplitude estimator," in Proc. ICASSP. vol. ASSP-32, no. 6, pp. 1109-1121, 1984.
5 R. Martin, "Spectral subtraction based on minimum statistics," in Proc. EISOPCO. pp. 1182-1185, 1994.
6 R. Marin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans, Speech Audio Process, vol. 9, no. 5, pp. 504-512, 2001.   DOI
7 J. S. Erkelens and Richard Heusdens, "Tracking of nonstationary noise based on data-driven recursive noise power estimation," IEEE Trans. Audio, Speech and Language Processing, vol. 16, no. 6, pp. 1112-1123. 2008.   DOI
8 Rabiner and Sambur, "An algorithm for determining the endpoints of isolated utterances," The bell system technical journal, vol. 54, no. 2, pp. 297-315, 1975,   DOI
9 Ethem Alpayd, "Soft vector quantization and the EM algorithm," Neural Networks, vol. 11, pp. 467-477, Issue 3, 1998.   DOI
10 P. Dhanalakshmi., S. Palanivel, V. Ramalingam, “Classification of audio signals using AANN and GMM,” Applied soft computing, vol. 11, No. 1, pp. 716-723,2011.   DOI   ScienceOn
11 http://www.sitec.or.kr