[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.17661/jkiiect.2017.10.1.85

Sound Model Generation using Most Frequent Model Search for Recognizing Animal Vocalization

Ko, Youjung (Department of Computer Engineering, Hanbat National University)
Kim, Yoonjoong (Department of Computer Engineering, Hanbat National University)

Publication Information

The Journal of Korea Institute of Information, Electronics, and Communication Technology / v.10, no.1, 2017 , pp. 85-94 More about this Journal

Abstract

In this paper, I proposed a sound model generation and a most frequent model search algorithm for recognizing animal vocalization. The sound model generation algorithm generates a optimal set of models through repeating processes such as the training process, the Viterbi Search process, and the most frequent model search process while adjusting HMM(Hidden Markov Model) structure to improve global recognition rate. The most frequent model search algorithm searches the list of models produced by Viterbi Search Algorithm for the most frequent model and makes it be the final decision of recognition process. It is implemented using MFCC(Mel Frequency Cepstral Coefficient) for the sound feature, HMM for the model, and C# programming language. To evaluate the algorithm, a set of animal sounds for 27 species were prepared and the experiment showed that the sound model generation algorithm generates 27 HMM models with 97.29 percent of recognition rate.

Keywords

Animal Vocalization Recognition; MFCC; Most Frequent Model Search Algorithm; HMM; Sound Model Generation;

Citations & Related Records

Reference

1	C. Lee, Y. Lee, Z. Ren, "Automatic Recognition of Bird Songs Using Cepstral Coefficients", Journal of Information Technology and Applications Vol. 1 No. 1, May, pp.17-23, 2006
2	D. Mane, Rashmi R.A., S. L. Tade, "Identification & Detection System for Animals from their Vocalization", International Journal of Advanced Computer Research, vol. 3. pp.352 - 357. 2013
3	D. Mitrovic and M. Zeppelzauer, "Discrimination and retrieval of animal sounds," IEEE Multimedia Modelling Conference, 2006.
4	G. G. and Z. Li., "Content-based classification and retrieval by support vector machines," IEEE Transactions on Neural Networks, vol. 14, pp. 29 - 215, 2003.
5	H. Chen, C. Huang, Y. Chen, C. Chen, and S. Chien, "An Intelligent Nocturnal Animal Vocalization Recognition System", International Journal of Computer and Communication Engineering, Vol. 4, No. 1, pp.39 - 45, 2015 DOI
6	I. S. Hong, Y. J. Ko, H. S. Shin, Y. J. Kim, "Emotion Recognition from Korean Language using MFCC, HMM, and Speech Speed", The 12th International Conference on Multimedia Information Technology and Applications(MITA2016), pp.12-15, 2016
7	Chou, C,. and Liu, P,. (2009). "Bird Species Recognition by Wavelet Transformation of a Section of Birdsong", Proceeding Of symposia and workshop on ubiquitous, Autonomies and Trusted Computing. pp 189-193.
8	Z. Le-Qing, "Insect sound recognition based on MFCC and PNN", 2011 International Conference on Multimedia and Signal Processing, pp. 42-46, 2011
9	L. Rabiner and B. H. Juang. Fundamentals of speech recognition. Prentice-Hall, Inc., Upper Saddle River, NJ, USA, 1993.
10	Hidden Markov Model Toolkit, http://htk.eng.cam.ac.uk/. (accessed Jan., 10, 2017)
11	S. Young, etal, "The HTK Book (for HTK Version 3.4)",Cambridge University Engineering Department, 2009
12	MFCC(Mel-Frequency Cepstral Coefficients) Algorithm, https://en.wikipedia.org/wiki/Mel-frequency_cepstrum, (accessed Jan., 26,2017)
13	Baum-Welch Algorithm, https://en.wikipedia.org/wiki/Baum-Welch_algorithm, (accessed Jan., 26,2017)
14	Viterbi Algorithm, https://en.wikipedia.org/wiki/Viterbi_algorithm, (accessed Jan., 26,2017)

KSCI

Sound Model Generation using Most Frequent Model Search for Recognizing Animal Vocalization 최대 빈도모델 탐색을 이용한 동물소리 인식용 소리모델생성

Sound Model Generation using Most Frequent Model Search for Recognizing Animal Vocalization