[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.5762/KAIS.2012.13.2.817

Estimation of Optimal Mixture Number of GMM for Environmental Sounds Recognition

Han, Da-Jeong (Division of Electronic and Computer Engineering, Chonnam University)
Park, Aa-Ron (Division of Electronic and Computer Engineering, Chonnam University)
Baek, Sung-June (Division of Electronic and Computer Engineering, Chonnam University)

Publication Information

Journal of the Korea Academia-Industrial cooperation Society / v.13, no.2, 2012 , pp. 817-821 More about this Journal

Abstract

In this paper we applied the optimal mixture number estimation technique in GMM(Gaussian mixture model) using BIC(Bayesian information criterion) and MDL(minimum description length) as a model selection criterion for environmental sounds recognition. In the experiment, we extracted 12 MFCC(mel-frequency cepstral coefficients) features from 9 kinds of environmental sounds which amounts to 27747 data and classified them with GMM. As mentioned above, BIC and MDL is applied to estimate the optimal number of mixtures in each environmental sounds class. According to the experimental results, while the recognition performances are maintained, the computational complexity decreases by 17.8% with BIC and 31.7% with MDL. It shows that the computational complexity reduction by BIC and MDL is effective for environmental sounds recognition using GMM.

Keywords

Gaussian mixture model; BIC; MDL; Bayesian information criterion; environmental sounds recognition;

Citations & Related Records

Times Cited By KSCI : 1 (Citation Analysis)

Reference
Cited By KSCI

1	G. McLachlan., D. Peel., "Finite Mixture Models," A wiley-interscience publication, 2000.
2	S. S. Chen and P. S. Gopalkrishana, "Speaker, enviroment, and channel change detection and clustering via the Bayesian information criterion," Proceedings of the IEEE Interational Conference on vol.2, pp.645-648, 1998.
3	National Information Society Agency Information Strategy Planning Division, "Paradigm shift in the era of smart vision and ICT strategy", National Information Society Agency, 2010.
4	J.Rissanen., "modeling by shortest data description," Automatica, vol.14, pp.465-471, 1978. DOI ScienceOn
5	Il-Young Hong, "Context-aware software, Now mind you should read beyond gesture," Korea IT Industry Promotion Agency, 2008.
6	Jun-Qyu Park, Seong-Joon Baek, "Improvement of Environmental Sounds Recognition by Post Processing", the Korea Contents Society vol. 10, pp.31-39, 2010. 과학기술학회마을 DOI
7	S. Chu, S. Narayana, C.-C. J. Kuo, and M. J. Mataric, "Where am I? Scene recognition for mobile robots using audio features," in Proc. ICME, 2006.
8	S. Chu, S. Narayanan, and C.-C. Jay Kuo "Environmental Sound Recognition With Time-Frequency Audio Features," IEEE Trans. on Audio, Speech, and Language Processing, Vol.17, No.6, pp.1-16, 2009. DOI
9	Richard O.Duda, Peter E.Hart, David G.Stork, Pattern Classification, John Wiley & Sons, 2001
10	Burnham, Kenneth P, and David R. Anderson, Model selection and Multimodal Inference : A Practical Information-Theoretic Approach Seconded. New York : Springer-Verlag, 2002

KSCI

Estimation of Optimal Mixture Number of GMM for Environmental Sounds Recognition 환경음 인식을 위한 GMM의 혼합모델 개수 추정

Estimation of Optimal Mixture Number of GMM for Environmental Sounds Recognition