[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.5392/IJoC.2015.11.4.001

Conjoined Audio Fingerprint based on Interhash and Intra hash Algorithms

Kim, Dae-Jin (Electronic Engineering Daejin University)
Choi, Hong-Sub (Electronic Engineering Daejin University)

Publication Information

International Journal of Contents / v.11, no.4, 2015 , pp. 1-6 More about this Journal

Abstract

In practice, the most important performance parameters for music information retrieval (MIR) service are robustness of fingerprint in real noise environments and recognition accuracy when the obtained query clips are matched with the an entry in the database. To satisfy these conditions, we proposed a conjoined fingerprint algorithm for use in massive MIR service. The conjoined fingerprint scheme uses interhash and intrahash algorithms to produce a robust fingerprint scheme in real noise environments. Because the interhash and intrahash algorithms are masked in the predominant pitch estimation, a compact fingerprint can be produced through their relationship. Experimental performance comparison results showed that our algorithms were superior to existing algorithms, i.e., the sub-mask and Philips algorithms, in real noise environments.

Keywords

Music Information Retrieval; Conjoined Fingerprint; Interhash; Intrahash;

Citations & Related Records

Reference

1	Wooram Son, Hyun-Tae Cho, Kyoungro Yoon, and Seok-Pil Lee, “Sub-fingerprint Masking for a Robust Audio Fingerprinting System in Real-noise Environment for Portable Consumer Devices,” IEEE Transactions on Consumer Electronics, vol. 56, no. 1, 2010, pp. 156-160. DOI
2	J. Song, S. Bae, and K. Yoon, “Mid-level music melody representation of polyphonic audio for query-by-humming system,” International Symposium on Music Information Retrieval, 2002.
3	J. Song, S. Bae, and K. Yoon, “Query by humming: matching humming query to polyphonic audio,” IEEE International Conference on Multimedia and Expo, 2002.
4	J. Chen, K. Paliwal, and S. Nakamura, “Cepstrum derived from Differentiated Power Spectrum for Robust Speech Recognition,” Speech Communication, vol. 41, 2003, pp. 469-484. DOI
5	H.-Y. Jung, “Filtering of Filter-Bank Energies for Robust Speech Recognition,” ETRI Journal, vol. 26, no. 3, 2004, pp. 273-276. DOI
6	P. Cano, E. Batlle, T. Kalker, and J. Haitsma, “A Review of Audio Fingerprinting,” J. VLSI Signal Processing Systems for Signal Image Video Technology, vol. 41, no. 3, 2005, pp. 271-284. DOI
7	J. Haitsma and T. Kalker, “A Highly Robust Audio Fingerprinting System,” Proc. Of the 3rd Int. Symposium on Music Information Retrieval, 2002, pp. 144-148.
8	Mansoo Park, Hoi-Rin Kim, and Seung Hyun Yang, “Frequency Temporal Filtering for a Robust Audio Fingerprinting Scheme in Real-Noise Environments,” ETRI Journal, vol. 28, no. 4, 2006, pp. 509-512. DOI