Browse > Article
http://dx.doi.org/10.7776/ASK.2010.29.2.102

Music Transcription Using Non-Negative Matrix Factorization  

Park, Sang-Ha (서울대학교 전기.컴퓨터공학부 뉴미디어통신 공동연구소 음향공학 연구실)
Lee, Seok-Jin (서울대학교 전기.컴퓨터공학부 뉴미디어통신 공동연구소 음향공학 연구실)
Sung, Koeng-Mo (서울대학교 전기.컴퓨터공학부 뉴미디어통신 공동연구소 음향공학 연구실)
Abstract
Music transcription is extracting pitch (the height of a musical note) and rhythm (the length of a musical note) information from audio file and making a music score. In this paper, we decomposed a waveform into frequency and rhythm components using Non-Negative Matrix Factorization (NMF) and Non-Negative Sparse coding (NNSC) which are often used for source separation and data clustering. And using the subharmonic summation method, fundamental frequency is calculated from the decomposed frequency components. Therefore, the accurate pitch of each score can be estimated. The proposed method successfully performed music transcription with its results superior to those of the conventional methods which used either NMF or NNSC.
Keywords
Non Negative Factorization; Music Transcription; Subharmonic Summation;
Citations & Related Records
연도 인용수 순위
  • Reference
1 M. Goto, "A robust predominant-f0 estimation method for real-time detection of melody and bass lines in CD recordings", ICASSP, pp. 757-706, 2000.
2 M.P. Ryynnen and A.P. Klapuri, "Automatic transcription of melody, bass line, and chords in polyphonic music", Computer Music Journal, vol.32, no.3, pp. 72-86, 2008.   DOI   ScienceOn
3 P. Smaragdis and J.C. Brown, "Non-negative matrix factorization for polyphonic music transcription", in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 177-180, 2003.
4 D.D. Lee and H.S. Seung, "Learning the parts of objects by non-negative matrix factorization", Nature, vol. 401, no. 6755, pp.788-791, 1999.   DOI   ScienceOn
5 D.D. Lee and H.S. Seung, "Algorithms for non-negative matrix factorization", in Advances in Neural Information Processing systems, pp. 556-562, MIT Press, 2001.
6 P.O. Hoyer, "Non-negative sparse coding", in Neural Networks for Signal Processing, IEEE Workshop, pp. 557-565, 2002.
7 D. Hermes, "Measurement of pitch by subharmonic summation", J. Acoust. Soc. Am., vol. 83, no. 1, pp. 257-264, 1988.   DOI