Browse > Article

Extracting Predominant Melody from Polyphonic Music using Harmonic Structure  

Yoon, Jea-Yul (Kwangwoon University)
Lee, Seok-Pil (Korean Electronics Technology Institute)
Seo, Kyeung-Hak (Korean Electronics Technology Institute)
Park, Ho-Chong (Kwangwoon University)
Publication Information
Abstract
In this paper, we propose a method for extracting predominant melody of polyphonic music based on harmonic structure. Since polyphonic music contains multiple sound sources, the process of melody detection consists of extraction of multiple fundamental frequencies and determination of predominant melody using those fundamental frequencies. Harmonic structure is an important feature parameter of monophonic signal that has spectral peaks at the integer multiples of its fundamental frequency. We extract all fundamental frequency candidates contained in the polyphonic signal by verifying the required condition of harmonic structure. Then, we combine those harmonic peaks corresponding to each extracted fundamental frequency and assign a rank to each after calculating its harmonic average energy. We finally run pitch tracking based on the rank of extracted fundamental frequency and continuity of fundamental frequency, and determine the predominant melody. We measure the performance of proposed method using ADC 2004 DB and 100 Korean pop songs in terms of MIREX 2005 evaluation metrics, and pitch accuracy of 90.42% is obtained.
Keywords
polyphonic music; predominant melody extraction; multi-pitch extraction; harmonic structure;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 A. P. Klapuri, "Multiple fundamental frequency estimation by summing harmonic amplitudes," in Proc. 7th Int. Symposium on Music Information Retrieval, pp.216-221, Victoria, Canada, Oct 2006.
2 E. M. Voorhess, D. M. Tice, "The TREC-8 Question Answering Track Evaluation," in Proc. 8th Text Retrieval Conference, pp. 77-82, NIST, Gaithersburg, MD, 1999.
3 M. Goto, "A predominant-F0 estimation method for real-world musical audio signals: MAP estimation for incorporating prior knowledge about F0s and tone models," in Proc. IEEE International Conference on Acoustics, Speech and Signal Process., pp. 3365-3368, Aalborg, Denmark, June 2001.
4 Y.-G. Zhang and C.-S. Zhang, "Separation of music signals by harmonic structure modeling," Neural Information Processing Systems, pp. 184-191, 2005.
5 A. P. Klapuri, "Multiple fundamental frequency estimation based on harmonicity and spectral smoothness," IEEE Trans. Speech and Audio process., Vol.11, No.6, pp.804-815, 2003.   DOI   ScienceOn
6 김무영, 이석필, "MIREX 기술 동향," 전자공학회지, 제37권, 제1호, 88-102쪽, 2010년 1월   과학기술학회마을
7 박호종, 윤제열, "오디오 신호의 다중 피치 검출기술," 전자공학회지, 제37권, 제1호, 63-72쪽, 2010 년 1월   과학기술학회마을
8 M. Goto, "A robust predominant-F0 estimation method for real-time detection of melody and bass lines in CD recordings", in Proc. IEEE International Conference on Acoustics, Speech and Signal Process., Vol.2 pp.757-760, Istanbul, Turkey, June 2000.
9 G. Poliner, D. P. Ellis, A. F. Ehmann, E. Gomez, S. Streich, B. Ong, "Melody Transcription from Music Audio: Approaches and Evaluation," IEEE Trans. Audio, Speech and Language Process., Vol. 15, No.4, pp.1066-1074, May 2007.   DOI
10 http://www.music-ir.org/mirex/2009/index.php/ Audio Melody Extraction Results.
11 M. Lagrange, L. G. Martins and J. Murdoch, "Normalized cuts for predominant melodic source separation," IEEE Trans. Audio, Speech, Language process., vol. 16, no. 2, Feb. 2008.
12 J.-L. Durrieu, G. Richard, and B. David, "Singer melody extraction in polyphonic signals using source separation methods," in Proc. IEEE International Conference on Acoustics, Speech and Signal Process., pp.169-172, Las Vegas, U.S.A. April 2008.
13 E. Vincent, N. Bertin, and R. Badeau, "Harmonic and inharmonic non-negative matrix factorization for polyphonic pitch transcription,".in Proc. IEEE International Conference on Acoustics, Speech and Signal Process., pp.109- 112, Las Vegas, U.S.A. April 2008.