Browse > Article

Automatic Music Summarization Using Similarity Measure Based on Multi-Level Vector Quantization  

Kim, Sung-Tak (The School of Engineering at Information and Communications University)
Kim, Sang-Ho (The School of Engineering at Information and Communications University)
Kim, Hoi-Rin (The School of Engineering at Information and Communications University)
Abstract
Music summarization refers to a technique which automatically extracts the most important and representative segments in music content. In this paper, we propose and evaluate a technique which provides the repeated part in music content as music summary. For extracting a repeated segment in music content, the proposed algorithm uses the weighted sum of similarity measures based on multi-level vector quantization for fixed-length summary or optimal-length summary. For similarity measures, count-based similarity measure and distance-based similarity measure are proposed. The number of the same codeword and the Mahalanobis distance of features which have same codeword at the same position in segments are used for count-based and distance-based similarity measure, respectively. Fixed-length music summary is evaluated by measuring the overlapping ratio between hand-made repeated parts and automatically generated ones. Optimal-length music summary is evaluated by calculating how much automatically generated music summary includes repeated parts of the music content. From experiments we observed that optimal-length summary could capture the repeated parts in music content more effectively in terms of summary length than fixed-length summary.
Keywords
Music summarization; Similarity measure; Multi-level vector quantization; Fixed-length music summary; Optimal-length music summary; Mahalanobis distance;
Citations & Related Records
연도 인용수 순위
  • Reference
1 L, Mani, M, T, Mabury, Advances in Automatic Text Summarization, (MIT Press, 1999)
2 Y, Gong and X, Liu, 'Summarizing video by minimizing visual content redundancies,' in Proc, IEEE International Conference on Multimedia and Expo, 788-791, Tokyo, Japan, 2001
3 G, Peeter, A, L, Burthe, and X, Rodet, 'Toward automatic music audio summary generation from signal analysis,' in Proc, International Symposium on Music Information Retrieval, 2002
4 C, Xu, X, Shao, N, C, Maddage, M, S. Kankanhalli, and a, Tian, 'Automatically summarize musical audio using adaptive clustering,' in Proc IEEE International Conference on Multimedia and Expo, 2063-2066, 2004
5 X. Shao, C, Xu, Y, Wang, and M, S, Kankanhalli, 'Automatic music summarization in compressed domain,' in Proc, ICASSP, 4,261-264, 2004
6 X, Shao, C, Xu, and M, S, Kankanhalli, 'A new approach to automatic music video summarization,' in Proc, ICIP, 625-628, 2004
7 I, Yahiaoui, B, Merialdo and B, Huet, 'Generating summaries of muli-episode video,' in Proc, IEEE International Conference on Multimedia and Expo, 792-795, Tokyo, Japan, 2001
8 C,Xu, N, C, Maddage, and X, Shao, 'Automatic music classification and summarization,' IEEE Tans, Speech and Audio Process, 13 (3) May 2005
9 L,R, Rabiner and B.H, Juang, Fundamentals of Speech Recognition, (Prentice-Hall, 1993)
10 C, Xu, Y, Zhu, and a, Tian, 'Automatic music summarization based on temporal, spectral and cepstral feature,' in Proc, IEEE International Conference on Multimedia and Expo, 117-120, 2002
11 Seoyoung Koh, Jeongsik Park, and Yung-hwan Oh, 'Improvement of mp3-based music summarization using linear regression,' in Proc, The KSPS Fall Conference, 55-58, 2005
12 B, Logan and S, Chu, 'Music summarization using key phrases,' in Proc, IEEE International Conference on Audio, Speech and Signal Processing, 749-752, 2000
13 X, Shao, N, C, Maddage, C, Xu, and M, S, Kankanhalli, 'Automatic music summarization based on music structure analysis,' in Proc, ICASSP, 2, 1169-1172, 2005