Browse > Article

Feature-Vector Normalization for SVM-based Music Genre Classification  

Lim, Shin-Cheol (Department of Information and Communication Engineering, Sejong University)
Jang, Sei-Jin (Korea Electronics Technology Institute)
Lee, Seok-Pil (Korea Electronics Technology Institute)
Kim, Moo-Young (Department of Information and Communication Engineering, Sejong University)
Publication Information
Abstract
In this paper, Mel-Frequency Cepstral Coefficient (MFCC), Decorrelated Filter Bank (DFB), Octave-based Spectral Contrast (OSC), Zero-Crossing Rate (ZCR), and Spectral Contract/Roll-Off are combined as a set of multiple feature-vectors for the music genre classification system based on the Support Vector Machine (SVM) classifier. In the conventional system, feature vectors for the entire genre classes are normalized for the SVM model training and classification. However, in this paper, selected feature vectors that are compared based on the One-Against-One (OAO) SVM classifier are only used for normalization. Using OSC as a single feature-vector and the multiple feature-vectors, we obtain the genre classification rates of 60.8% and 77.4%, respectively, with the conventional normalization method. Using the proposed normalization method, we obtain the increased classification rates by 8.2% and 3.3% for OSC and the multiple feature-vectors, respectively.
Keywords
Music genre classification; MFCC; OSC; DFB; SVM;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 N. Scaringella, G. Zoia, and D. Mlynek, "Automatic genre classification of music content: A survey," IEEE Signal Process., vol. 23, no. 2, pp. 133-141, 2006.
2 G. Tzanetakis and P. Cook, "Musical genre classification of audio signals," IEEE Trans. Speech Audio Process., vol. 10, no. 5, pp. 293-302, 2002.   DOI   ScienceOn
3 D. N. Jiang, L. Lu, H. J. Zhang, J. H. Tao, and L. H. Cai, "Music type classification by spectral contrast feature," in Proc. IEEE Int. Conf. Multimedia and Expo, vol. 1, pp. 113-116, 2002.
4 C-H. Lee, J-L. Shih, K-M. Yu, and J-M Su, "Automatic music genre classification using modulation spectral contrast feature," in Proc. IEEE Int. Conf. Multimedia and Expo, pp.204-207, 2007.
5 이성훈, 배건성, "Text-dependent한 화자인식 시스템에 관한 연구," 대한전자공학회, 대한전자공학회 학술대회 논문집, 제7권, 제1호, 312-317쪽, 1994년 1월
6 J. Jung, K. Kim, and M. Y. Kim, "Noise robust speaker identification based on the advanced missing feature theory," Electronics Letters, vol. 46, no. 14, pp. 1027-1029, 2010.   DOI   ScienceOn
7 임정수, 송지현, 장준혁, "SVM의 미세조정을 통한 음성/음악분류 성능향상," 대한전자공학회 전자공학회 논문지-SP , 제47권 SP편 제2호, 141-148쪽, 2011월 3월
8 GTZAN Genre Collection Database, "http://marsyas.info/download/data_sets"
9 임신철, 장세진, 이석필, 김무영, "Decorrelated Filter Bank를 이용한 음악 장르 분류 시스템," 한국음향학회지, 제30권, 제2호, 100-106쪽, 2011년 2월