Sequence-based Similar Music Retrieval Scheme

시퀀스 기반의 유사 음악 검색 기법

  • 전상훈 (고려대학교 전자전기공학과) ;
  • 황인준 (고려대학교 전기전자전파공학과)
  • Published : 2009.06.30

Abstract

Music evokes human emotions or creates music moods through various low-level musical features. Typical music clip consists of one or more moods and this can be used as an important criteria for determining the similarity between music clips. In this paper, we propose a new music retrieval scheme based on the mood change patterns of music clips. For this, we first divide music clips into segments based on low level musical features. Then, we apply K-means clustering algorithm for grouping them into clusters with similar features. By assigning a unique mood symbol for each cluster, we can represent each music clip by a sequence of mood symbols. Finally, to estimate the similarity of music clips, we measure the similarity of their musical mood sequence using the Longest Common Subsequence (LCS) algorithm. To evaluate the performance of our scheme, we carried out various experiments and measured the user evaluation. We report some of the results.

음악은 다양한 하위 레벨 음악 특징을 통하여 인간의 감정을 유발시키거나 음악적 무드를 만들어낸다. 보통 음악은 하나 이상의 무드로 구성되며 이것은 음악간 유사도를 결정하는 데 주요한 단서로 사용된다. 본 논문에서는 음악의 무드 변화 패턴을 기반으로 하는 새로운 음악 검색 기법을 제안한다. 이를 위해서, 우선 모든 음악에 대해 유사한 하위 레벨 특징을 가지는 세그먼트로 나누고, K-means 군집화 알고리즘을 적용하여 유사한 특징을 가지는 클러스터로 그룹화한다. 각 클러스터에 대해 유일한 무드 심볼을 정의하고 나면, 각 음악의 무드 변화 패턴은 일련의 무드 심볼 시퀀스로 표현이 가능하다. 마지막으로 음악간 유사도를 측정하기 위해서 longest common subsequence (LCS)알고리즘을 적용한다. 제안된 검색 기법의 성능을 측정하기 위해 다양한 실험과 사용자 만족도 조사를 수행하고 결과를 분석한다.

Keywords

References

  1. Sanghoon Jun, Seungmin Rho, Byeong-jun Han, Eenjun Hwang, "A Fuzzy Inference-based Music Emotion Recognition System," Visual Information Engineering, pp.673-677, Aug. 2008
  2. Hoashi, Zeitler, Inoue, "Implementation of relevance feedback for content-based music retrieval based on user preferences," ACM SIGIR, pp.385-286, 2002
  3. Hoashi, Matsumoto, Inoue, "Personalization of user profiles for content-based music retrieval based on relevance feedback," ACM Multimedia, pp.110-119, 2003
  4. Carol Krumhansl, Cognitive Foundations of Musical Pitch, Oxford University Press, 1990
  5. S. Rho, B. Han, E. Hwang and M. Kim, "MUSEMBLE: A Novel Music Retrieval System with Automatic Voice Query Transcription and Reformulation," Journal of Systems and Software, Vol. 81(7), pp. 1065-1080, 2008 https://doi.org/10.1016/j.jss.2007.05.038
  6. Hung-Chen Chen and Arbee L. P. Chen, "A Music Recommendation System Based on Music and User Grouping," Journal of Intelligent Information Systems, Vol. 24(3), pp. 113-132, 2005 https://doi.org/10.1007/s10844-005-0319-3
  7. A. Ghias, et al., "Query by humming - musical information retrieval in an audio database," Proceedings of ACM Multimedia 95, pp.231-236, 1995
  8. Lie Lu, Dan Liu, Hong-Jiang Zhang, "Automatic Mood Detection and Tracking of Music Audio Signals," IEEE Transactions on Audio, Speech and Audio Processing, vol.14, no.1, pp. 5-18, Jan. 2006 https://doi.org/10.1109/TSA.2005.860344
  9. Yazhong F,Yueting Z, Yunhe P, "Music information retrieval by detecting mood via computational media aesthetics," IEEE/WIC International Conference on, pp235-241, Oct. 2003
  10. P.N. Juslin, J.A. Sloboda, Music and Emotion: Theory and research, New York: Oxford University Press, 2001.
  11. L. Bergroth, H. Hakonen, T. Raita, "A Survey of Longest Common Subsequence Algorithms," in Proceedings of the 7th International Symposium, 2000.
  12. S. Daniel, Hirschberg, "Algorithms for the Longest Common Subsequence Problem," Journal of ACM, vol.24, no.4, Oct. 1977.
  13. R.A. Wagner and M.J. Fischer, "The String-toString Correction Problem," Journal of the ACM, vol.21, no.1, pp 168-173, Jan. 1975 https://doi.org/10.1145/321796.321811
  14. Tzanetakis, G. and P. Cook, "Musical genre classification of audio signals.," IEEE Transactions on Audio, Speech and Audio Processing, 10(5):293-302. 2002 https://doi.org/10.1109/TSA.2002.800560
  15. L.R. Rabiner, B.-H, Juang, Fundamentals of Speech Recognition, Prentice Hall, New Jersey, 1993.
  16. Young-tae Kim and Tat-Seng Chua, "Retrieval of News Video Using Video Sequence Matching," Multimedia Modelling Conference, 2005. MMM 2005. Proceedings of the 11th International, pp. 68-75, 12. Jan. 2005
  17. Daniel P.W. Ellis, Graham E. Poliner, "Identifying 'Cover Songs' with Chroma Features and Dynamic Programming Beat Tracking," IEEE Conferences on Acoustics, Speech, and Signal Processing, vol.IV, pp.1429-1432, 2007