A Study on the Robust Content-Based Musical Genre Classification System Using Multi-Feature Clustering

Multi-Feature Clustering을 이용한 강인한 내용 기반 음악 장르 분류 시스템에 관한 연구

  • Yoon Won-Jung (Dept. of Computer Science and Statistics, Dankook University) ;
  • Lee Kang-Kyu (Dept. of Computer Science and Statistics, Dankook University) ;
  • Park Kyu-Sik (Dept. of Computer Science and Statistics, Dankook University)
  • 윤원중 (단국대학교 컴퓨터과학 및 통계학과) ;
  • 이강규 (단국대학교 컴퓨터과학 및 통계학과) ;
  • 박규식 (단국대학교 컴퓨터과학 및 통계학과)
  • Published : 2005.05.01

Abstract

In this paper, we propose a new robust content-based musical genre classification algorithm using multi-feature clustering(MFC) method. In contrast to previous works, this paper focuses on two practical issues of the system dependency problem on different input query patterns(or portions) and input query lengths which causes serious uncertainty of the system performance. In order to solve these problems, a new approach called multi-feature clustering(MFC) based on k-means clustering is proposed. To verify the performance of the proposed method, several excerpts with variable duration were extracted from every other position in a queried music file. Effectiveness of the system with MFC and without MFC is compared in terms of the classification accuracy. It is demonstrated that the use of MFC significantly improves the system stability of musical genre classification performance with higher accuracy rate.

본 논문에서는 multi-feature clustering(MFC) 방법을 이용한 강인한 내용 기반 음악 장르 분류 알고리즘을 제안한다. 기존 연구와 비교하여 본 논문에서는 입력 질의 패턴(또는 구간)과 입력 질의 길이의 변화에 따라 나타나는 불안정한 시스템 성능을 개선하는데 노력하였고, k-means clustering 기법에 기반한 multi-feature clustering(MFC)이라는 새로운 알고리즘을 제안하였다. 제안된 시스템의 성능을 검증하기 위해 질의 음악 파일의 서로 다른 여러 구간에서 질의 길이를 다변화하여 음악 특징 계수를 추출하였고, MFC 방법을 사용한 시스템과 MFC 방법을 사용하지 않은 시스템에 대한 장르 분류 성공률을 비교하여 제안 알고리즘의 성능을 비교${\cdot}$분석하였다. 모의실험 결과 MFC 방법을 사용한 시스템의 장르 분류 성공률이 높게 나타났고, 시스템의 안정성 역시 높게 나타났다.

Keywords

References

  1. E. Wold, T. Blum, D. Keislar, and J. Wheaton, 'Content-based classification, search, and retrieval of audio,' IEEE Multimedia, vol.3, no. 2, 1996 https://doi.org/10.1109/93.556537
  2. J. Foote, 'Content-based retrieval of music and audio,' in Proc.SPIE Multimedia Storage Archiving Systems II, vol. 3229, C.C.J. Kuo et al., Eds., 1997, pp. 138-147
  3. G. Tzanetakis and P. Cook, 'Musical genre classification of audio signals,' IEEE Trans. on Speech and Audio Processing, vol. 10, no. 5, pp. 293-302, July 2002 https://doi.org/10.1109/TSA.2002.800560
  4. T. Li, M. Ogihara and Q. Li, 'A comparative study on content-based music genre classification,' in Proc. of the 26th annual internal ACM SIGIR, pp. 282-289, ACM Press, July 2003 https://doi.org/10.1145/860435.860487
  5. D. N. Jiang, L. Lu and H. J. Zhang, 'Music type classification by spectra contrast features,' in Proc. ICME2002, 2002, pp. 113-116 https://doi.org/10.1109/ICME.2002.1035731
  6. J. J. Burred and A. Lerch, 'A hierarchical approach to automatic musical genre classification,' in Proc. DAFx03, 2003, pp. 308-311
  7. G. Guo and S. Z. Li, 'Content-based audio classification and retrieval by support vector machine,' IEEE Trans. on neural networks, vol. 14, no. 1, pp. 209-215, Jan. 2003 https://doi.org/10.1109/TNN.2002.806626
  8. J. Foote et al, 'An overview of audio information retrieval,' ACM-Springer Multimedia Systems, vol. 7, no. 1, pp. 2-11, Jan. 1999 https://doi.org/10.1007/s005300050106
  9. Y. Wang, Z. Liu and J. Huang, 'Multimedia content analysis: using both audio and visual clues,' IEEE Signal Proc. Mag., Nov. 2000 https://doi.org/10.1109/79.888862
  10. S. Blackburn, 'Content based retrieval and navigation of music, 1999, Mini-thesis, University of Southampton
  11. S. Z. Li, 'Content-based classification and retrieval audio using the nearest feature line method,' IEEE Trans. on Speech Audio Processing, vol. 8, pp.619-625, Sept. 2000 https://doi.org/10.1109/89.861383
  12. T. Zhang and C. Kuo, 'Hierarchical system for content-based audio classification and retrieval,' Proceedings of SPIE98, vol.3527, pp. 398-409, Boston, Nov., 1998 https://doi.org/10.1117/12.325832
  13. M. Liu and C. Wan, 'A study on content-based classification retrieval of audio database,' Proc. of the International Database Engineering & Applications Symposium, pp. 339-345. 2001 https://doi.org/10.1109/IDEAS.2001.938102
  14. R. Duda, P. Hart and D. Stork, Pattern Classification, 2nd Ed., Wiley-Interscience Publication, 2001