[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.7776/ASK.2008.27.3.111

New Automatic Taxonomy Generation Algorithm for the Audio Genre Classification

Choi, Tack-Sung (연세대학교 전기전자공학과)
Moon, Sun-Kook (연세대학교 전기전자공학과)
Park, Young-Cheol (연세대학교 컴퓨터정보통신공학부)
Youn, Dae-Hee (연세대학교 전기전자공학과)
Lee, Seok-Pil (전자부품연구원(KETI) 디지털미디어 연구센터)

Publication Information

The Journal of the Acoustical Society of Korea / v.27, no.3, 2008 , pp. 111-118 More about this Journal

Abstract

In this paper, we propose a new automatic taxonomy generation algorithm for the audio genre classification. The proposed algorithm automatically generates hierarchical taxonomy based on the estimated classification accuracy at all possible nodes. The estimation of classification accuracy in the proposed algorithm is conducted by applying the training data to classifier using k-fold cross validation. Subsequent classification accuracy is then to be tested at every node which consists of two clusters by applying one-versus-one support vector machine. In order to assess the performance of the proposed algorithm, we extracted various features which represent characteristics such as timbre, rhythm, pitch and so on. Then, we investigated classification performance using the proposed algorithm and previous flat classifiers. The classification accuracy reaches to 89 percent with proposed scheme, which is 5 to 25 percent higher than the previous flat classification methods. Using low-dimensional feature vectors, in particular, it is 10 to 25 percent higher than previous algorithms for classification experiments.

Keywords

Feature selection algorithm; Genre classification; Hierarchy; Taxonomy; Wrapper algorithm;

Citations & Related Records

Reference

1	L. Lu and H. Zhang, "Content analysis for audio classification and segmentation," IEEE Trans. on Speech and Audio Process., 10(5), 504-516, Sep. 2002 DOI ScienceOn
2	Tao Li and Mitsunori Ogihara, "Music genre classification with taxonomy," Proc. Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), 197-200, 2005
3	E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," Proc. Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), 1331-1334, 1997
4	T. Tolenen and M. Karjalainen, "A computationally efficient multipitch analysis model," IEEE Trans. Speech, Audio Process, 8(6), 708-716, Nov. 2000 DOI ScienceOn
5	F. Pachet and D. Cazaly,"A taxonomy of musical genres," Proc. Content-based Multimedia Information Access (RIAO), Paris, France, 2000
6	D. A. Reynolds and R. C. Rose, "Robust test-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech, Audio Process., 3(1), 47-60, Nov. 1996
7	Huan Liu and Lei Yu, "Toward integrating feature selection algorithmsfor classification and clustering," IEEE Trans. on Knowledge and Data Eng., 17(4), April 2005
8	Juan Jose Burred and Alexander Lerch, "A hierarchical approach to automatic musical genre classification," Proc. of the 6th Int. Conference on Digital Audio Effects (DAFX-03), London, UK, Sept. 8-11, 2003
9	G. Tzanetakis and P. Cook, "Musical Genre Classification of audio signals", IEEE Trans. on Speech and Audio Process., 10(4), 293-302, July 2002 DOI ScienceOn
10	C. Yang, Database retrieval based on spectral similarity, (Stanford Univ. Database Group, Stanford, CA, Tech, Rep. 2001-14, 2001)
11	Beth Logan, "Mel Frequency Cepstral Coefficients for music modeling," in Proc. of the First International Symposium on Music Information Retrieval (ISMIR), 2000
12	V. Vapnik,"The nature of statistical learning theory,"New York; Springer-Verlag, 1995
13	S.Essid, G.Richard, and B.David, "Instrument Recognition in Polyphonic Music Based on Automatic taxonomies," IEEE Trans. Audio, Speech, and Lang. Process., 14(1), 68-80, Jan. 2006 DOI ScienceOn
14	S-Y. Kung and J-N. Hwang, "Neural networks for intelligent multimedia processing," Proceedingsof the IEEE, 86(6), 1244-1272, June 1998
15	P. A. Devijver and J. Kitter, Pattern Recognition: A statistical approach. (New York, Prentice-Hall, 1982)
16	G. Peeters, "A large set of audio fetures for sound description (similarity and classification) in the CUIDADO project," CUIDADO I.S.T. Project Report, 2004
17	S. Essid, G. Richard and B. David, "Musical instrument recognition based on class pairwise feature selection," Proc. 5th Int. Conf. Music Information Retrieval (ISMIR), Barcelona, Spain, Oct. 2004
18	D.-N. Jiang, L. Lu, H.-J. Zhang, J.-H. Tao, and L.-H. Cai, "Music type classification by spectral contrast feature,"Proc. of IEEE Int. Conf. on Multimedia and Expo (ICME02), Lausanne Switzerland, Aug, 2002
19	J.-J. Aucouturier and F. Pachet, "Representing music genre: A state of the Art," J. of New Music Research, 32(1), 83-93, 2003 DOI ScienceOn
20	http://ismir2004.ismir.net/genre_contest/index.htm

KSCI

New Automatic Taxonomy Generation Algorithm for the Audio Genre Classification 음악 장르 분류를 위한 새로운 자동 Taxonomy 구축 알고리즘

New Automatic Taxonomy Generation Algorithm for the Audio Genre Classification