Browse > Article
http://dx.doi.org/10.13067/JKIECS.2015.10.1.1

Vocal Separation in Music Using SVM and Selective Frequency Subtraction  

Kim, Hyun-Tae (동의대학교 멀티미디어공학과)
Publication Information
The Journal of the Korea institute of electronic communication sciences / v.10, no.1, 2015 , pp. 1-6 More about this Journal
Abstract
Recently, According to increasing interest to original sound Karaoke instrument, MIDI type karaoke manufacturer attempt to make more cheap method instead of original recoding method. The specific method is to make the original sound accompaniment to remove only the voice of the singer in the singer music album. In this paper, a system to separate vocal components from music accompaniment for stereo recordings were proposed. Proposed system consists of two stages. The first stage is a vocal detection. This stage classifies an input into vocal and non vocal portions by using SVM with MFCC. In the second stage, selective frequency subtractions were performed at each frequency bin in vocal portions. Listening test with removed vocal music from proposed system show relatively high satisfactory level.
Keywords
MFCC; SVM; Vocal Remover; Selective Frequency Subtraction;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 W. Tsai and H. Wang, "Automatic singer recognition of popular music recordings via estimation and modeling of solo vocal signals," IEEE Trans. Audio, Speech, and Language Processing, vol. 14, issue 1, 2006, pp. 330-341.   DOI
2 S. Vembu and S. Baumann, "Separation of vocals from polyphonic audio recordings," In Proc. Int. Society for Music Information Retrieval, London, UK, Sept., 2005.
3 H. Kim, G. Lee, J. park, and Y. Yu, "Vehicle Detection in Tunnel using Gaussian Mixture Model and Mathematical Morphological Processing," J. of the Korea Institute of Electronic Communication Science, vol. 7, no. 5, 2012, pp. 967-974.   과학기술학회마을   DOI
4 K. Park and H. Kim, "A Study for Video-based Vehicle Surveillance on Outdoor Road," J. of the Korea Institute of Electronic Communication Science, vol. 8, no. 11, 2013, pp. 1647-1653.   DOI   ScienceOn
5 H. Kim and J. Park, "Smoke Detection in Outdoor Using Its Statistical Characteristics," J. of the Korea Institute of Electronic Communication Science, vol. 9, no. 2, 2014, pp. 149-154.   과학기술학회마을   DOI
6 T. Leung, C. Ngo, and R. W. H. Lau, "Ica-fx features for classification of singing voice and instrumental sound," In Proc. Int. Conf. on Pattern Recognition, Cambridge, UK, vol. 2, Aug. 2004.
7 A. Berenzweig and D. P. W. Ellis, "Locating singing voice segments within music signals," In Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA' 2001), New York, NY, Oct. 2001.
8 T. Virtanen, A. Mesaros, and M. Ryynanen, "Combining Pitch-Based Inference and Non-Negative Spectrogram Factorization in Separating Vocals from Polyphonic Music," In Proc. Statistical and Perceptual Audition, Brisbane, Australia, Sept. 2008.
9 J.-L. Durrieu, A. Ozerov, C. Fevotte, G. Richard, and B. David, "Main instrument separation from stereophonic audio signals using a source/filter model," 17th European Signal Processing Conf. (EUSIPCO 2009) Glasgow, Scotland, Aug. 2009.
10 H. Park and K. Lee, "Pattern and Machine Learning from Fundamental to Applications, Goyang, Korea : Ihan Press, 2011.