Browse > Article
http://dx.doi.org/10.9717/kmms.2012.15.1.018

Design and Implementation of Matching Engine for QbSH System Based on Polyphonic Music  

Park, Sung-Joo (전자부품연구원 디지털미디어연구센터)
Chung, Kwang-Sue (광운대학교 전자통신공학과)
Publication Information
Abstract
This paper proposes a matching engine of query-by-singing/humming (QbSH) system which retrieves the most similar music information by comparing the input data with the extracted feature information from polyphonic music like MP3. The feature sequences transcribed from polyphonic music may have many errors. So, to reduce the influence of errors and improve the performance, the chroma-scale representation, compensation and asymmetric DTW (Dynamic Time Warping) are adopted in the matching engine. The performance of various distance metrics are also investigated in this paper. In our experiment, the proposed QbSH system achieves MRR (Mean Reciprocal Rank) of 0.718 for 1000 singing/humming queries when searching from a database of 450 polyphonic musics.
Keywords
QbSH; Matching Engine; Polyphonic Music; Feature Extraction;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 A. Uitdenbogerd and J. Zobel, "Melodic Matching Techniques for Large Music Database," Proc. ACM Int. Conf. on Multimedia, pp. 57-66, 1999.
2 Haus, G. and Pollstri. E, "An Audio Front end for Query-by-Humming Systems," Proc. Int. Symp. Music Information Retrieval , pp. 65-72, 2001.
3 I. Cohen, "Noise Spectrum Estimation in Adverse Environments: Improved Minima Controlled Recursive Averaging," IEEE Trans. on Speech and Audio Processing, Vol.11, No.5, pp. 466-475, 2003.   DOI   ScienceOn
4 김기출, 박성주, 이석필, 김무영, "선형 보간법을 이용한 시간과 주파수 조합영역에서의 피치 추정 방법," 전자공학회논문지, 제47권, 제5호, pp. 100-108, 2010.
5 윤제열, 이석필, 서경학, 박호종, "하모닉 구조를 이용한 다성 음악의 주요 멜로디 검출," 전자공학회논문지, 제47권, 제5호, pp. 109-116, 2010.
6 H. M. Yu, W. H. Tsai, and H. M. Wang, "A Query by-Singing System for Retrieving Karaoke Music," IEEE Trans. on Multimedia, Vol.10, No.8, pp. 1626-1637, 2008.   DOI   ScienceOn
7 J. S. R. Jang and H. R. Lee, "A General Framework of Progressive Filtering and Its Application to Query by Singing/Humming," IEEE Trans. on Audio, Speech, and Language Processing, Vol.16, No.2, pp. 350-358, 2008.   DOI   ScienceOn
8 Y. Zhu and D. Shasha, "Warping Indexes with Envelope Transforms for Query by Humming," Proc. Int. Conf. on Management of Data, pp. 181-192, 2003.
9 X. Nguyen, M. J. Wainwright, and M. I. Jordan, "On Divergences, Surrogate Loss Functions and Decentralized Detection Department of Statistics," Tech. Rep. 695, Dept of Statistics, Univ. of California at Berkeley, 2005.
10 J. S. R. Jang, N. J. Lee, and C. L. Hsu, "Simple But Effective Methods for QbSH at MIREX 2006," Proc. Int. Symp. Music Information Retrieval , pp. 5-7, 2006.
11 M. Ryynanen and A. Klapuri, "Query by Humming of MIDI and Audio using Locality Sensitive Hashing," Proc. Int. Conf. Acoustic, Speech and Signal Processing, pp. 2249-2252, 2008.
12 A. Duda, A. N¨urnberger, and S. Stober, "Towards Query by Humming/Singing on Audio Databases," Proc. Int. Symp. Music Information Retrieval, pp. 331-334, 2007.
13 A. Ghias, J Logan, and D Chamberlin, "Query by Humming : Musical Information Retrieval in an Audio Database," Proc. ACM Int. Conf. on Multimedia, pp. 231-236, 1995.
14 G. Tzanetakis, "Automatic Genre Classification of Audio Signals," IEEE Trans. on Speech and Audio Processing, Vol.10, No.5, pp. 293-302, 2001.
15 D. Jang, M. Jin, and C. D. Yoo, "Music Genre Classification using Novel Features and a Weighted Voting Method," Proc. Int. Conf. on Multimedia and Expo, pp. 1377-1380, 2008.
16 G. Poliner, D. Ellis, A. Ehmann, E. Gomez, S. Streich, and B. Ong, "Melody Transcription from Music Audio: Approaches and Evaluation," IEEE Trans. on Audio, Speech, Language Processing, Vol.15, No.4, pp. 1247-1256, 2007.   DOI   ScienceOn
17 S. Jo and C. D. Yoo, "Melody Extraction from Polyphonic Audio Based on Particle Filter," Proc. Int. Symp. Music Information Retrieval, pp. 357-362, 2010.
18 D. P. W. Ellis and G. E. Poliner, "Identifying Cover Songs with Chroma Features and Dynamic Programming Beat Tracking," Proc. Int. Conf. Acoustic, Speech and Signal Processing, Vol.4, pp. 1429-1432, 2007.
19 S. W. Hainsworth and M. D. Macleod, "Particle Filtering Applied to Musical Tempo Tracking," EURASIP Journal on Applied Signal Processing, Vol.2004, Issue15, pp. 2385-2395, 2004.   DOI
20 J. S. Seo, M. Jin, S. Lee, D. Jang, S. Lee, and C. D. Yoo, "Audio Fingerprinting Based on Normalized Spectral Subband Moments," IEEE Signal Processing Letters, Vol.13, Issue4, pp. 209-212, 2006.   DOI   ScienceOn
21 허태관, 조황원, 남기표, 이재현, 이석필, 박성주, 박강령, "내용 기반 음원 검출 시스템 구현 에 관한 연구," 한국멀티미디어학회논문지, 제 12권, 제11호, pp. 1581-1592, 2009.
22 H. Sakoe and S. Chiba, "Dynamic Programming Algorithm Optimization for Spoken Word Recognition," IEEE Trans. on Acoustics, Speech and Signal Processing, Vol.ASSP-26, No.1, pp. 43-49, 1978.