Browse > Article
http://dx.doi.org/10.7776/ASK.2006.25.4.151

Audio Fingerprint Extraction Method Using Multi-Level Quantization Scheme  

Song Won-Sik (한국정보통신대학교)
Park Man-Soo (한국정보통신대학교)
Kim Hoi-Rin (한국정보통신대학교)
Abstract
In this paper, we proposed a new audio fingerprint extraction method, based on Philips' music retrieval algorithm, which uses the energy difference of neighboring filter-bank and probabilistic characteristics of music. Since Philips method uses too many filter-banks in limited frequency band, it may cause audio fingerprints to be highly sensitive to additive noises and to have too high correlation between neighboring bands. The proposed method improves robustness to noises by reducing the number of filter-banks while it maintains the discriminative power by representing the energy difference of bands with 2 bits where the quantization levels are determined by probabilistic characteristics. The correlation which exists among 4 different levels in 2 bits is not only utilized in similarity measurement. but also in efficient reduction of searching area. Experiments show that the proposed method is not only more robust to various environmental noises (street, department, car, office, and restaurant), but also takes less time for database search than Philips in the case where music is highly degraded.
Keywords
Audio Fingerprint; Probabilistic Characteristics; Quantization; Energy Difference of Neighboring Filter-Banks;
Citations & Related Records
연도 인용수 순위
  • Reference
1 J. Herre, E. Allamanche, and O. Helimuth, 'Robust matching of audio signals using spectral flatness features, ' Proc. of Workshop on Applications of Signal Processing to Audio and Acoustics2001, IEEE, 127-130, 2001
2 E. Allamanche, J. Herre, and O. Helimuth, 'Content-based Identification of Audio Material Using MPEG-7 Low Level Description, 'Proc. of ISMIR2001, 197-204, 2001
3 Jonathan T. Foote, 'Content-Based Retrieval of Music and Audio,' Proc. of SPIE, Multimedia Storage and Archiving Systems II, 3229, 138-147, 1997
4 AudibleMagic, http://audiblemagic.com
5 ShazamEntertainment, http://www.shazam.com
6 Gracenote, http://www.gracenote.com
7 Haitsma J., Kalker T. and Oostveen J., 'Robust Audio Hashing for Content Identification,' Proc. the Content Based Multimedia Indexing2001, 2001
8 Mansoo Park et al., 'Content-based Music information Retrieval using Pitch Histogram of Band Pass Filter Signal,' Proc. of AIRS2004, 245-248, 2004
9 J.A. Haitsma and T. Kalker, 'A Highly Robust Audio Fingerprinting System,' Proc. ISMIR2002, 144-148, 2002
10 P.J.O. Doets and R.L. Lagendijk, 'Theoretical Modeling Of A Robust Audio Fingerprinting System,' Fourth IEEE Benelux Signal Processing Symposium, 2004