Browse > Article

Feature Selection for Multi-Class Genre Classification using Gaussian Mixture Model  

Moon, Sun-Kuk (연세대학교 전기전자공학과 디지털신호처리 연구실)
Choi, Tack-Sung (연세대학교 전기전자공학과 디지털신호처리 연구실)
Park, Young-Cheol (연세대학교 컴퓨터정보통신공학부)
Youn, Dae-Hee (연세대학교 전기전자공학과 디지털신호처리 연구실)
Abstract
In this paper, we proposed the feature selection algorithm for multi-class genre classification. In our proposed algorithm, we developed GMM separation score based on Gaussian mixture model for measuring separability between two genres. Additionally, we improved feature subset selection algorithm based on sequential forward selection for multi-class genre classification. Instead of setting criterion as entire genre separability measures, we set criterion as worst genre separability measure for each sequential selection step. In order to assess the performance proposed algorithm, we extracted various features which represent characteristics such as timbre, rhythm, pitch and so on. Then, we investigate classification performance by GMM classifier and k-NN classifier for selected features using conventional algorithm and proposed algorithm. Proposed algorithm showed improved performance in classification accuracy up to 10 percent for classification experiments of low dimension feature vector especially.
Keywords
Feature selection algorithm; Music Information Retrieval; Genre Classification; Gaussian Mixture Model;
Citations & Related Records
연도 인용수 순위
  • Reference
1 G. Tzanetakis, P. Cook, 'Musical Genre Classification of audio signals', IEEE Transaction on Speech and Audio Processing, vol. 10, No. 5, pp. 293-302, 2002   DOI   ScienceOn
2 G. Peeters, 'Automatic classification of large musical instrument databases using hierarchical classifiers with inertia ratio maximization,' Proc. 115th AES Convention, New York, Oct. 2003
3 S. Essid, G. Richard, B. David, 'Instrument rec ognition in polyphonic music based on automatic taxonomies,' IEEE Trans. on Audio, Speech and Language Processing, vol. 14, No. 1, pp. 68-80, Jan. 2006   DOI   ScienceOn
4 Beth Logan, 'Mel Frequency Cepstral Coefficients for music modeling,' Proceedings of the First International Symposium on Music Information Retrieval (ISMIR), 2000
5 E. scheirer, M. Slaney, 'Construction and evaluation of a robust multifeature speech/music discriminator,' Proc. Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), pp. 1331-1334, 1997
6 G. Peeters, 'A large set of audio features for sound description (similarity and classification) in the CUIDADO project,' CUIDADO I.S.T. Project Report, 2004
7 J. J. Burred, A. Lerch, 'A hierarchical approach to automatic musical genre classification,' Proc. 6th Int. Conference on Digital Audio Effects, London, UK, September 2003
8 http://ismir2004.ismir.net/genre_contest/index.htm, 2004
9 E. Wold, T. Blum, D. Keislar, and J. Wheaton, 'Content-based classification, search and retrieval of audio', IEEE Multimedia, vol. 3, No. 3, pp. 27-36, 1996   DOI   ScienceOn
10 S. Theodoridis, K. Koutroumbas, 'Pattern recognition (third edition),' Academic Press, 2006
11 T. Hastie, R. Tibshirani, J. Friedman, 'The elements of statistical learning - data mining, inference, and prediction,' Springer, 2000
12 F. J. Ferri, P. Pudil, M. Hatef, J. Kittler, 'Comparative study of techniques for large-scale feature selection,' Gelsema, E.S., Kanal, L.N. (Eds.), Pattern Recognition in Practice vo. IV, pp. 403-413, 1994
13 D.-N. Jiang, L. Lu, H.-J. Zhang, J.-H. Tao, and L.-H. Cai. 'Music type classification by spectral contrast feature,' In Proceedings of IEEE International Conference on Multimedia and Expo (ICME02), Lausanne Switzerland, Aug 2002
14 E. Scheirer and M. Slaney, 'Construction and evaluation of a robust multifeature speech/music discriminator,' Proc. Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), pp. 1331- 1334, 1997
15 T. Tolenen and M. Karjalainen, 'A computationally efficient multipitch analysis model,' IEEE Trans. Speech Audio Processing, vol. 8, pp.708-716, Nov. 2000   DOI   ScienceOn
16 D. A. Reynolds, R. C. Rose, 'Robust text-independent speaker identification using Gaussian mixture speaker models,' IEEE Transaction on Speech and Audio Processing, vol. 3, No. 1, pp. 72-83, January 1995   DOI   ScienceOn