Browse > Article
http://dx.doi.org/10.7776/ASK.2009.28.8.697

Blind Rhythmic Source Separation  

Kim, Min-Je (한국전자통신연구원 방통융합미디어연구부)
Yoo, Ji-Ho (포항공과대학교 컴퓨터공학과)
Kang, Kyeong-Ok (한국전자통신연구원 방통융합미디어연구부)
Choi, Seung-Jin (포항공과대학교 컴퓨터공학과)
Abstract
An unsupervised (blind) method is proposed aiming at extracting rhythmic sources from commercial polyphonic music whose number of channels is limited to one. Commercial music signals are not usually provided with more than two channels while they often contain multiple instruments including singing voice. Therefore, instead of using conventional modeling of mixing environments or statistical characteristics, we should introduce other source-specific characteristics for separating or extracting sources in the under determined environments. In this paper, we concentrate on extracting rhythmic sources from the mixture with the other harmonic sources. An extension of nonnegative matrix factorization (NMF), which is called nonnegative matrix partial co-factorization (NMPCF), is used to analyze multiple relationships between spectral and temporal properties in the given input matrices. Moreover, temporal repeatability of the rhythmic sound sources is implicated as a common rhythmic property among segments of an input mixture signal. The proposed method shows acceptable, but not superior separation quality to referred prior knowledge-based drum source separation systems, but it has better applicability due to its blind manner in separation, for example, when there is no prior information or the target rhythmic source is irregular.
Keywords
Blind Source Separation; Nonnegative Matrix Factorization; Nonnegative Matrix Partial Co-Factorization; Rhythmic Source Separation; Musical Information REtrieval;
Citations & Related Records
연도 인용수 순위
  • Reference
1 D. FitzGerald, M. Cranitch, and E. Coyle, “Sound source separation using shifted non-negative tensor factorisation,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. V653-656, May, 2006   DOI
2 D. D. Lee and H. S. Seung, "Learning the parts of objects by non-negative matrix factorization," Nature, vol. 401, no. 6755, pp. 788 - 791, 1999   DOI   ScienceOn
3 J. Yoo and S. Choi, "Weighted nonnegative matrix co-tri factorization for collaborative prediction," in Proc. of 1st Asian Conference on Machine Learning, pp. 396-411, Nov., 2009   DOI   ScienceOn
4 M. Kim, J. Yoo, K. Kang, and S. Choi, "Blind Rhythmic Source Separation: Nonnegativity and Repeatibility," in Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010 (submitted for publication)
5 D. D. Lee and H. S. Seung, “Algorithms for non-negative matrix factorization,” in Advances in Neural Information Processing Systems (NIPS), vol. 13. MIT Press, pp. 556-562, 2001
6 M. Kim and S. Choi, “On spectral basis selection for single channel polyphonic music separation,” in Proc. International Conference on Artificial Neural Networks (ICANN), vol. 2, pp. 157 - 162, Sept., 2005
7 M. Helen and T. Virtanen, “Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine,” in Proc. European Signal Processing Conference, Sept., 2005
8 D. FitzGerald, M. Cranitch, and E. Coyle, "Shifted nonnegative matrix factorisation for sound source separation," in Proc. IEEE Workshop on Statistical Signal Processing, pp.1132-1136, July, 2005   DOI
9 T. O. Virtanen, "Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria," IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 3, pp. 1066 - 1074, 2007   DOI   ScienceOn
10 J. Yoo, M. Kim, K. Kang, and S. Choi, “Nonnegative matrix partial co-factorization for drum source separation,” in Proc. of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2010 (submitted for publication)
11 H. Lee and S. Choi, “Group nonnegative matrix factorization for EEG classification,” in Proc. International Conference on Artificial Intelligence and Statistics (AISTATS), pp. 320-327, April, 2009
12 P. Smaragdis and J. C. Brown, "Non-negative matrix factorization for polyphonic music transcription," in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 177 -180, Oct., 2003
13 Y. C. Cho and S. Choi, “Nonnegative features of spectro-temporal sounds for classification,” Pattern Recognition Letters, vol. 26, no. 9, pp. 1327 - 1336, 2005   DOI   ScienceOn