DOI QR코드

DOI QR Code

A Comparative Analysis of Music Similarity Measures in Music Information Retrieval Systems

  • Gurjar, Kuldeep (Dept. of Computer Science, Kangwon National University) ;
  • Moon, Yang-Sae (Dept. of Computer Science, Kangwon National University)
  • 투고 : 2017.11.13
  • 심사 : 2017.12.18
  • 발행 : 2018.02.28

초록

The digitization of music has seen a considerable increase in audience size from a few localized listeners to a wider range of global listeners. At the same time, the digitization brings the challenge of smoothly retrieving music from large databases. To deal with this challenge, many systems which support the smooth retrieval of musical data have been developed. At the computational level, a query music piece is compared with the rest of the music pieces in the database. These systems, music information retrieval (MIR systems), work for various applications such as general music retrieval, plagiarism detection, music recommendation, and musicology. This paper mainly addresses two parts of the MIR research area. First, it presents a general overview of MIR, which will examine the history of MIR, the functionality of MIR, application areas of MIR, and the components of MIR. Second, we will investigate music similarity measurement methods, where we provide a comparative analysis of state of the art methods. The scope of this paper focuses on comparative analysis of the accuracy and efficiency of a few key MIR systems. These analyses help in understanding the current and future challenges associated with the field of MIR systems and music similarity measures.

키워드

E1JBB0_2018_v14n1_32_f0001.png 이미지

Fig. 1. Functionality of an MIR system.

E1JBB0_2018_v14n1_32_f0002.png 이미지

Fig. 2. Mechanism of feature extraction process.

E1JBB0_2018_v14n1_32_f0003.png 이미지

Fig. 3. Tree based representation of musical features.

E1JBB0_2018_v14n1_32_f0004.png 이미지

Fig. 4. Music genre classification.

Table 1. Comparative description of similarity measures

E1JBB0_2018_v14n1_32_t0001.png 이미지

Table 2. Application areas of MIR

E1JBB0_2018_v14n1_32_t0002.png 이미지

Table 3. Accuracy results from Robine et al. [30] during MIREX 2005

E1JBB0_2018_v14n1_32_t0003.png 이미지

참고문헌

  1. J. S. Downie, "Music information retrieval," Annual Review of Information Science and Technology, vol. 37, no. 1, pp. 295-340, 2003.
  2. M. A. Casey, R. Veltkamp, M. Goto, M. Leman, C. Rhodes, and M. Slaney, "Content-based music information retrieval: current directions and future challenges," Proceedings of the IEEE, vol. 96, no. 4, pp. 668- 696, 2008. https://doi.org/10.1109/JPROC.2008.916370
  3. M. Kassler, "Toward musical information retrieval," Perspectives of New Music, vol. 4, no. 2, pp. 59-67, 1966. https://doi.org/10.2307/832213
  4. M. Kassler, "MIR: a simple programming language for musical information retrieval," in The Computer and Music. Ithaca, NY: Cornell University Press, 1970, pp. 299-327.
  5. H. B. Lincoln, "Some criteria and techniques for developing computerized thematic indices," in Elektronische Datenverarbeitung in der Musikwissenschaft. Regensburg, Germany: Gustave Bosse Verlag, 1967, pp. 57-62.
  6. D. Deutsch, "Octave generalization and tune recognition," Perception & Psychophysics, vol. 11, no. 6, pp. 411- 412, 1972. https://doi.org/10.3758/BF03206280
  7. D. A. Byrd, "Music notation by computer," Ph.D. dissertation, Indiana University, Ann Arbor, MI, 1984.
  8. M. Mongeau and D. Sankoff, "Comparison of musical sequences," Computers and the Humanities, vol. 24, no. 3, pp. 161-175, 1990. https://doi.org/10.1007/BF00117340
  9. J. Foote, "An overview of audio information retrieval," Multimedia Systems, vol. 7, no. 1, pp. 2-10, 1999. https://doi.org/10.1007/s005300050106
  10. D. Byrd and T. Crowford, "Problems of music information retrieval in the real world," Information Processing & Management, vol. 38, no. 2, pp. 249-272, 2002. https://doi.org/10.1016/S0306-4573(01)00033-4
  11. J. S. Downie, "Evaluating a simple approach to music information retrieval: conceiving melodic n-grams as text," Ph.D. dissertation, University of Western Ontario, London, Canada, 1999.
  12. A. Ghias, J. Logan, D. Chamberlin, and B. C. Smith, "Query by humming: musical information retrieval in an audio database," in Proceedings of the 3rd ACM International Conference on Multimedia, San Francisco, CA, 1995, pp. 231-236.
  13. A. Freed, "Music metadata quality: a multiyear case study using the music of Skip James," in Audio Engineering Society Convention 121. New York, NY: Audio Engineering Society, 2006, pp. 1314-1325.
  14. M. C. Jones, J. S. Downie, and A. F. Ehmann, "Human similarity judgments: implications for the design of formal evaluations," in Proceedings of 8th International Conference on Music Information Retrieval (ISMIR 2007), Vienna, Austria, 2007, pp. 539-542.
  15. M. Schedl, A. Flexer, and J. Urbano, "The neglected user in music information retrieval research," Journal of Intelligent Information Systems, vol. 41, no. 3, pp. 523-539, 2013. https://doi.org/10.1007/s10844-013-0247-6
  16. J. Urbano, J. Morato, M. Marrero, and D. Martin, "Crowdsourcing preference judgments for evaluation of music similarity tasks," in Proceedings of the 1st ACM SIGIR Workshop on Crowdsourcing for Search Evaluation, Geneva, Switzerland, 2010, pp. 9-16.
  17. U. M. Julian, "Evaluation in audio music similarity," Ph.D. dissertation, Charles III University of Madrid, Madrid, Spain, 2013.
  18. J. S. Downie, "The scientific evaluation of music information retrieval systems: foundations and future," Computer Music Journal, vol. 28, no. 2, pp. 12-23, 2004. https://doi.org/10.1162/014892604323112211
  19. R. J. Demopoulos and M. J. Katchabaw, "Music information retrieval: a survey of issues and approaches," Department of Computer Science, University of Western Ontario, London, Canada, Technical Report #677, 2007.
  20. M. Casey and T. Crawford, "Automatic location and measurement of ornaments in audio recordings," in Proceedings of 5th International Conference on Music Information Retrieval (ISMIR 2004), Barcelona, Spain, 2004, pp. 311-317.
  21. T. Lambrou, P. Kudumakis, R. Speller, M. Sandler, and A. Linney, "Classification of audio signals using statistical features on time and wavelet transform domains," in Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, Seattle, WA, 1998, pp. 3621-3624.
  22. T. Zhang and C. C. J. Kuo, "Audio content analysis for online audiovisual data segmentation and classification," IEEE Transactions on Speech and Audio Processing, vol. 9, no. 4, pp. 441-457, 2001. https://doi.org/10.1109/89.917689
  23. G. Tzanetakis and P. Cook, "Musical genre classification of audio signals," IEEE Transactions on Speech and Audio Processing, vol. 10, no. 5, pp. 293-302, 2002. https://doi.org/10.1109/TSA.2002.800560
  24. R. Typke, M. den Hoed, J. de Nooijer, F. Wiering, and R. C. Veltkamp, "A ground truth for half a million musical incipits," in Proceedings of the 5th Dutch-Belgian Information Retrieval Workshop, Utrecht, The Netherlands, 2005, pp. 63-70.
  25. F. Alias, J. C. Socoro, and X. Sevillano, "A review of physical and perceptual feature extraction techniques for speech, music and environmental sounds," Applied Sciences, vol. 6, no. 5, article no. 143, 2016.
  26. D. Mitrovic, M. Zeppezauer and C. Breiteneder, "Feature of content-based retrieval systems" in Advances in Computers: Improving the Web. London, UK: Academic, 2010, pp. 71-150.
  27. D. Gusfield, Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology. Cambridge, UK: Cambridge University Press, 1997, pp. 91-95.
  28. S. Needleman and C. D. Wunsch, "A general method applicable to the search for similarities in the amino acid sequence of two proteins," Journal of Molecular Biology, vol. 48, no. 3, pp. 443-453, 1970. https://doi.org/10.1016/0022-2836(70)90057-4
  29. D. Sankoff and J. B. Kruskal, Time Warps, Strings Edits, and Macromolecules: The Theory and Practice of Sequence Comparison. Reading, MA: Addison-Wesley, 1983.
  30. M. Robine, P. Hanna, and P. Ferraro, "Music similarity: improvements of edit-based algorithms by considering music theory," in Proceedings of the International Workshop on Multimedia Information Retrieval, Bavaria, Germany, 2007, pp. 135-142.
  31. K. Lemstrom, P. Laine, and S. Perttu, "Using relative interval slope in music information retrieval," in Proceedings of the International Conference on Computer Music, Beijing, China, 1999, pp. 317-320.
  32. R. J. McNab, L. A. Smith, I. H. Witten, C. I. Henderson, and S. J. Cunningham, "Towards the digital music library: tune retrieval from acoustic input," in Proceedings of the first ACM International Conference on Digital Libraries, Bethesda, MD, 1996, pp. 11-18.
  33. V. I. Levenshtein, "Binary codes capable of correcting deletions, insertions, and reversals," Soviet Physics Doklady, vol. 10, no. 8, pp. 707-710, 1966.
  34. R. A. Wagner and M. J. Fischer, "The string-to-string correction problem," Journal of the ACM, vol. 21, no. 1, pp. 168-173, 1974. https://doi.org/10.1145/321796.321811
  35. T. F. Smith and M. S. Waterman, "Identification of common molecular subsequences," Journal of Molecular Biology, vol. 147, no.1, pp. 195-197, 1981. https://doi.org/10.1016/0022-2836(81)90087-5
  36. Y. Zhu and D. Shasha, "Warping indexes with envelope transforms for query by humming," in Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, San Diego, CA, 2003, pp. 181-192.
  37. M. J. Dovey, "An algorithm for locating polyphonic phrases within a polyphonic musical piece," in Proceedings of the Artificial Intelligence and Simulation of Behavior (AISB)'99 Symposium on Musical Creativity, Edinburgh, UK, 1999, pp. 48-53.
  38. E. Ukkonen, K. Lemstrom, and V. Makinen, "Geometric algorithms for transposition invariant contentbased music retrieval," in Proceedings of the 4th International Conference on Music Information Retrieval (ISMIR 2003), Baltimore, MD, 2003, pp. 193-199.
  39. R. Typke, R. C. Veltkamp, and F. Wiering, "Searching notated polyphonic music using transportation distances," in Proceedings of the 12th Annual ACM International Conference on Multimedia, New York, NY, 2004, pp. 128-135.
  40. Y. Rubner, C. Tomasi, and L. J. Guibas, "The earth mover's distance as a metric for image retrieval," International Journal of Computer Vision, vol. 40, no. 2, pp. 99-121, 2001. https://doi.org/10.1023/A:1026543900054
  41. N. J. Bryan, G. J. Mysore, and G. Wang, "Source separation of polyphonic music with interactive userfeedback on a piano roll display," in Proceedings of the 14th International Conference on Music Information Retrieval (ISMIR 2013), Curitiba, Brazil, 2013, pp. 119-124.
  42. S. Doraisamy and S. Ruger, "Robust polyphonic music retrieval with n-grams," Journal of Intelligent Information systems, vol. 21, no. 1, pp. 53-70, 2003. https://doi.org/10.1023/A:1023553801115
  43. S. Downie and M. Nelson "Evaluation of a simple and effective music information retrieval method," in Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Athens, Greece, 2000, pp. 73-80.
  44. R. Typke, P. Giannopoulos, R. C. Veltkamp, F. Wiering, and R. van Oostrum, "Using transportation distances for measuring melodic similarity," in Proceedings of the 4th International Conference on Music Information Retrieval (ISMIR 2003), Baltimore, MD, 2003, pp. 193-199.
  45. R. Typke, R. C. Veltkamp, and F. Wiering, "A measure for evaluating retrieval techniques based on partially ordered ground truth lists," in Proceedings of 2006 IEEE International Conference on Multimedia and Expo, Toronto, Canada, 2006, pp. 1793-1796.
  46. C. Dodge and T. Jerse, Computer Music: Synthesis, Composition, and Performance. New York, NY: Schirmer Books, 1997, pp. 361-368.
  47. G. R. Grimmett and D. R. Stirzaker, Probability and Random Processes. Oxford, UK: Oxford University Press, 2001, pp. 245-288.
  48. H. H. Hoos, K. Renz, and M. Gorg, "GUIDO/MIR-an experimental musical information retrieval system based on GUIDO music notation," in Proceedings of the 2nd International Conference on Music Information Retrieval (ISMIR 2001), Bloomington, IN, 2001, pp. 41-50.
  49. R. C. Veltkamp, F. Wiering, and R. Typke, "Content based music retrieval," in Encyclopedia of Multimedia, Boston, MA: Springer, 2008, pp. 97-98.
  50. D. Mullensiefen and K. Frieler, "Melodic similarity: approaches and applications," in Proceedings of the 8th International Conference on Music Perception and Cognition, Evanston, IL, 2004.
  51. P. Foster, M. Mauch, and S. Dixon, "Sequential complexity as a descriptor for musical similarity," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 22, no. 12, pp. 1965-1977, 2014. https://doi.org/10.1109/TASLP.2014.2357676
  52. M. W. Park and E. C. Lee, "Similarity measurement method between two songs by using the conditional euclidean distance," WSEAS Transactions on Information Science and Applications, vol. 10, no. 12, pp. 381- 388, 2013.
  53. D. Mullensiefen and K. Frieler, "Cognitive adequacy in the measurement of melodic similarity: algorithmic vs. human judgments," Computing in Musicology, vol. 13, no. 2003, pp. 147-176, 2004.
  54. P. Ferraro and P. Hanna, "Optimizations of local edition for evaluating similarity between monophonic musical sequences," in Proceeding RIAO 2007 Large Scale Semantic Access to Content (Text, Image, Video, and Sound), Pittsburgh, PA, 2007, pp. 64-69.
  55. Y. Tang, U. Leong Hou, Y. Cai, N. Mamoulis, and R. Cheng, "Earth mover's distance based similarity search at scale," Proceedings of the VLDB Endowment, vol. 7, no. 4, pp. 313-324, 2013.
  56. J. Lijffijt, P. Papapetrou, J. Hollmen, and V. Athitsos, "Benchmarking dynamic time warping for music retrieval," in Proceedings of the 3rd International Conference on PErvasive Technologies Related to Assistive Environments, Samos, Greece, 2010, pp. 1-7.
  57. S. Doraisamy, "Polyphonic music retrieval: the N-gram approach," Ph.D. dissertation, University of London, London, UK, 2004.
  58. A. Uitdenbogered and J. Zobel, "Melodic matching techniques for large music databases," in Proceedings of the 7th ACM international conference on Multimedia, Orlando, FL, 1999, pp. 56-66.
  59. G. Tzanetakis, "Manipulation, analysis and retrieval systems for audio signals," Ph.D. dissertation, Princeton University, Princeton, NJ, 2002.
  60. G. Peeters, "A large set of audio set for sound description (similarity and classification) in the CUIDADO project," 2004 [Online]. Available: http://www.citeulike.org/group/1854/article/1562527.
  61. P. Knees, T. Pohle, M. Schedl, and G. Widmer, "A music search engine built upon audio-based and webbased similarity measures," in Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Amsterdam, The Netherlands, 2007, pp. 447-554.
  62. D. Turnbull, L. Barrington, M. Yazdani, and G. Lanckriet, "Combining audio content and social context for semantic music discovery," in Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Boston, MA, 2009, pp. 387-394.
  63. E. J. Isaacson, "Music IR for music theory," in Proceedings of the Workshop on the Creation of Standardized Test Collections, Tasks, and Metrics for MIR and MDL Evaluation, Portland, OR, 2002, pp. 23-26.
  64. C. Laurier, O. Meyers, J. Serra, M. Blech, P. Herrera, and X. Serra, "Indexing music by mood: design and integration of an automatic content-based annotator," Multimedia Tools and Applications, vol. 48, no. 1, pp. 161-184, 2010. https://doi.org/10.1007/s11042-009-0360-2
  65. Y. E. Kim, E. M. Schmidt, R. Migneco, B. G. Morton, P. Richardson, J. J. Scott, J. A. Speck, and D. Turnbull, "Music emotion recognition: a state of the art review," in Proceedings of the 11th International Conference on Music Information Retrieval (ISMIR 2010), Utrecht, The Netherlands, 2010, pp. 255-266.
  66. A. Wang, "An industrial strength audio search algorithm," in Proceedings of 4th International Conference on Music Information Retrieval (ISMIR 2013), Baltimore, MD, 2003, pp. 7-13.
  67. C. S. Dhir, J. Lee, and S. Y. Lee, "Extraction of independent discriminant features for data with asymmetric distribution," Knowledge and Information Systems, vol. 30, no. 2, pp. 359-375, 2012. https://doi.org/10.1007/s10115-011-0381-9
  68. R. B. Dannenberg, W. P. Birmingham, B. Pardo, N. Hu, C. Meek, and G. Tzanetakis, "A comparative evaluation of search techniques for query-by-humming using the MUSART testbed," Journal of the American Association for Information Science and Technology, vol. 58, no. 5, pp. 687-701, 2007. https://doi.org/10.1002/asi.20532
  69. J. Salamon, J. Serra, and E. Gomez, "Tonal representations for music retrieval: from version identification to query-by-humming," International Journal of Multimedia Information Retrieval, vol. 2, no. 1, pp. 45-58, 2013. https://doi.org/10.1007/s13735-012-0026-0
  70. J. Serra, E. Gomez, and P. Herrera, "Audio cover song identification and similarity: background, approaches, evaluation, and beyond," in Advances in Music Information Retrieval, Berlin, Germany: Springer, 2010, pp. 307-332.
  71. H. Kirchhoff and A. Lerch, "Evaluation of features for audio-to-audio alignment," Journal of New Music Research, vol. 40, no. 1, pp. 27-41, 2011. https://doi.org/10.1080/09298215.2010.529917
  72. S. Dixon and G. Widmer, "Match: a music alignment tool chest," in Proceedings of 6th International Conference on Music Information Retrieval (ISMIR 2005), London, UK, 2005, pp. 492-497.
  73. M. Muller, H. Mattes, and F. Kurth, "An efficient multiscale approach to audio synchronization," in Proceedings of 7th International Conference on Music Information Retrieval (ISMIR 2006), Victoria, Canada, 2006, pp.192-197.
  74. J. Lee, S. Park, S. Jo, and C. D. Yoo, "Music plagiarism detection system," in Proceedings of the 26th International Technical Conference on Circuits/Systems, Computers and Communications, Gyeongju, Korea, 2011, pp. 828-830.
  75. C. Dittmar, K. F. Hildebrand, D. Gaertner, M. Winges, F. Muller, and P. Aichroth, "Audio forensics meets music information retrieval: a toolbox for inspection of music plagiarism," in Proceedings of the 20th European Signal Processing Conference (EUSIPCO), Bucharest, Romania, 2012, pp. 1249-1253.
  76. J. I. Park, S. W. Kim, M. Shin, "Music plagiarism detection using melody databases," in Proceedings of the 9th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, Melbourne, Australia, 2005, pp. 684-693.
  77. A. Saracoglu, E. Esen, T. K. Ates, B. O. Acar, U. Zubari, E. C. Ozan, E. Ozalp, A. A. Alatan, and T. Ciloglu, "Content based copy detection with coarse audio-visual fingerprints," in Proceedings of the 7th International Workshop on Content-Based Multimedia Indexing, Chania, Greece, 2009, pp. 213-218.
  78. M. Schedl, E. Gomez, and J. Urbano, "Music information retrieval: recent developments and applications," Foundations and Trends in Information Retrieval, vol. 8, no. 2-3, pp. 127-261, 2014. https://doi.org/10.1561/1500000042
  79. R. Burke, "Hybrid recommender systems: survey and experiments," User Modeling and User-Adapted Interaction, vol. 12, no. 4, pp. 331-370, 2002. https://doi.org/10.1023/A:1021240730564
  80. Y. Song, S. Dixon, and M. Pearce, "A survey of music recommendation systems and future perspectives," in Proceedings of the 9th International Symposium on Computer Music Modelling and Retrieval (CMMR 2012), London, UK, 2012, pp. 395-410.
  81. L. Baltrunas, M. Kaminskas, B. Ludwig, O. Moling, F. Ricci, A. Aydin, K. H. Luke, and R. Schwaiger, "InCarMusic: context-aware music recommendations in a car," in Proceedings of the International Conference on Electronic Commerce and Web Technologies, Toulouse, France, 2011, pp. 89-100.
  82. Y. C. Zhang, D. O. Seaghdha, D. Quercia, and T. Jambor, "Auralist: introducing serendipity into music recommendation," in Proceedings of the 5th ACM International Conference on Web Search and Data Mining, Seattle, WA, 2012, pp. 13-22.
  83. M. Schedl, D. Hauger, and D. Schnitzer, "A model for serendipitous music retrieval," in Proceedings of the 2nd Workshop on Context-Awareness in Retrieval and Recommendation, Lisbon, Portugal, 2012, pp. 10-13.
  84. M. Schedl and D. Schnitzer, "Hybrid retrieval approaches to geospatial music recommendation," in Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, 2013, pp. 793-796.
  85. D. Perrot and R. O. Gjerdigen, "Scanning the dial: an exploration of factors in the identification of musical style," in Proceedings of the Society for Music Perception and Cognition, Evanston, IL, 1999.
  86. F. Pachet and D. Cazaly, "A taxonomy of musical genres," in Proceedings of RIAO 2000: Content-Based Multimedia Information Access, Paris, France, 2000, pp. 1238-1245.
  87. G. Tzanetakis and P. Cook, "Marsyas: a framework for audio analysis," Organised Sound, vol. 4, no. 3, pp. 169-175, 2000. https://doi.org/10.1017/S1355771800003071
  88. C. N. Silla, A. L. Koerich, and C. A. A. Kaestner, "A machine learning approach to automatic music genre classification," Journal of the Brazilian Computer Society, vol. 14, no. 3, pp. 7-18, 2008. https://doi.org/10.1007/BF03192568
  89. T. Pohle, P. Knees, M. Schedl, E. Pampalk, and G. Widmer, "Reinventing the wheel: a novel approach to music player interfaces," IEEE Transactions on Multimedia, vol. 9, no. 3, pp. 567-575, 2007. https://doi.org/10.1109/TMM.2006.887991
  90. D. Schnitzer, T. Pohle, P. Knees, and G. Widmer, "One-touch access to music on mobile devices," in Proceedings of the 6th International Conference on Mobile and Ubiquitous Multimedia, Oulu, Finland, 2007, pp. 103-109.
  91. N. Koenigstein and Y. Shavitt, "Song ranking based on piracy in peer-to-peer networks," in Proceedings of 10th International Conference on Music Information Retrieval (ISMIR 2009), Kobe, Japan, 2009, pp. 633-638.
  92. M. Schedl, T. Pohle, N. Koenigstein, and P. Knees, "What's hot? Estimating country-specific artist popularity," in Proceedings of 11th International Conference on Music Information Retrieval (ISMIR 2010), Utrecht, The Netherlands, 2010, pp. 117-122.
  93. R. Dhanaraj and B. Logan, "Automatic prediction of hit songs," in Proceedings of 6th International Conference on Music Information Retrieval (ISMIR 2005), London, UK, 2005, pp. 488-491.
  94. I. Karydis, A. Gkiokas, and V. Katsouros, "Musical track popularity mining dataset," in Proceedings of the 12th International Conference on Artificial Intelligence Applications and Innovations, Thessaloniki, Greece, 2016, pp. 562-572.
  95. X. Serra, M. Magas, E. Benetos, M. Chudy, S. Dixon, A. Flexer, E., et al, Roadmap for Music Information ReSearch. London, UK: MIRES Consortium, 2013.
  96. Y. H. Yang and H. H. Chen, Music Emotion Recognition. Boca Raton, FL: CRC Press, 2011.
  97. P. Cano, E. Batlle, E. Gomez, L. de C. T. Gomes, and M. Bonnet, "Audio fingerprinting: concepts and applications," in Computational Intelligence for Modelling and Prediction. Berlin, Germany: Springer Science and Business Media, 2005, pp. 233-245.
  98. T. Kageyama, K. Mochizuki, and Y. Takashima, "Melody retrieval with humming," in Proceedings of the International Computer Music Conference, Tokyo, Japan, 1993, pp. 349-351.
  99. N. Kosugi, Y. Nishihara, T. Sakata, M. Yamamuro, and K. Kushima, "A practical query-by-humming system for a large music database," in Proceedings of the 8th ACM international conference on Multimedi, Los Angeles, CA, 2000, pp. 333-342.
  100. T. B. Mahieux and D. P. W. Ellis, "Large-scale cover song recognition using the 2D Fourier transform magnitude," in Proceedings of 13th International Conference on Music Information Retrieval (ISMIR 2012), Porto, Portugal, 2012.
  101. M. P. Ryynanen and A. P. Klapuri, "Automatic transcription of melody, bass line, and chords in polyphonic music," Computer Music Journal, vol. 32, no. 3, pp. 72-86, 2008. https://doi.org/10.1162/comj.2008.32.3.72
  102. K. Kashino, K. Nakadai, T. Kinoshita, and H. Tanaka, "Application of Bayesian probability network to music scene analysis," Computational Auditory Scene Analysis, vol. 1, no. 998, pp. 1-15, 1995.
  103. F. Pachet and P. Roy, "Hit song science is not yet a science," in Proceedings of 9th International Conference on Music Information Retrieval (ISMIR 2008), Philadelphia, PA, 2008, pp. 355-360.
  104. J. S. Downie, K. West, A. Ehmann, and E. Vincent, "The 2005 Music Information Retrieval Evaluation eXchange (MIREX 2005): Preliminary Overview," in Proceedings of 6th International Conference on Music Information Retrieval, London, UK, 2005, pp. 320-323.
  105. M. D. Ferreira, D. C. Correa, M. A. Grivet, G. T. dos Santos, R. F. de Mello, and L. G. Nonato, "On accuracy and time processing evaluation of cover song identification systems," Journal of New Music Research, vol. 45, no. 4, pp. 333-342, 2016. https://doi.org/10.1080/09298215.2016.1249490
  106. R. Scholz, G. Ramalho, and G. Cabral, "Cross task study on MIREX recent results: an index for evolution measurement and some stagnation hypothesis," in Proceedings of 17th International Conference on Music Information Retrieval (ISMIR 2016), New York, NY, 2016, pp. 372-378.
  107. K. Lemstrom and J. Tarhio, "Searching monophonic patterns within polyphonic sources," in Proceedings of RIAO 2000: Content-Based Multimedia Information Access, Paris, France, 2000, pp. 1261-1279.
  108. M. Grachten, J. L. Arcos, and R. L. de Mantaras, "Melodic similarity: looking for a good abstraction level," in Proceedings of 5th International Conference on Music Information Retrieval (ISMIR 2004), Barcelona, Spain, 2004, pp. 210-215.
  109. N. Orio, "Music retrieval: a tutorial and review," Foundations and Trends in Information Retrieval, vol. 1, no. 1, pp. 1-90, 2006. https://doi.org/10.1561/1500000002
  110. P. Herrera-Boyer, X. Amatriain, E. Batlle, and X. Serra, "Towards instrument segmentation for music content description: a critical review of instrument classification techniques," in Proceedings of 1st International Conference on Music Information Retrieval (ISMIR 2000), Plymouth, MA, 2000, pp. 115-119.
  111. P. Lamere, "Social tagging and music information retrieval," Journal of New Music Research, vol. 37, no. 2, pp. 101-114, 2008. https://doi.org/10.1080/09298210802479284
  112. A. Jha, "Music machine to predict tomorrow's hits," The Guardian, 2006 [Online]. Available: https://www.theguardian.com/technology/2006/jan/17/news.science.
  113. M. Goto, "Grand challenges in music information research," in Multimodal Music Processing, Dagstuhl, Germany: Dagstuhl Publishing, 2012, pp. 217-225.