Vocal Enhancement for Improving the Performance of Vocal Pitch Detection

Lee, Se-Won;Song, Chai-Jong;Lee, Seok-Pil;Park, Ho-Chong;

doi:10.7776/ASK.2011.30.6.353

The Journal of the Acoustical Society of Korea (한국음향학회지)

Volume 30 Issue 6
/
Pages.353-359
/
2011
/
1225-4428(pISSN)
/
2287-3775(eISSN)

The Acoustical Society of Korea (한국음향학회)

DOI QR Code

Vocal Enhancement for Improving the Performance of Vocal Pitch Detection

보컬 피치 검출의 성능 향상을 위한 보컬 강화 기술

이세원 (광운대학교 전자공학과) ;
송재종 (전자부품연구원) ;
이석필 (전자부품연구원) ;
박호종 (광운대학교 전자공학과)

Received : 2011.04.29
Accepted : 2011.07.29
Published : 2011.08.31

https://doi.org/10.7776/ASK.2011.30.6.353 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

This paper proposes a vocal enhancement technique for improving the performance of vocal pitch detection in polyphonic music signal. The proposed vocal enhancement technique predicts an accompaniment signal from the input signal and generates an accompaniment replica signal according to the vocal power. Then, it removes the accompaniment replica signal from the input signal, resulting in a vocal-enhanced signal. The performance of the proposed method was measured by applying the same vocal pitch extraction method to the original and the vocal-enhanced signal, and the vocal pitch detection accuracy was increased by 7.1 % point in average.

본 논문에서는 다성 음악 신호의 보컬 피치 검출 성능을 향상시키기 위해 음악 신호의 보컬 신호를 강화시키는 전처리 기술을 제안한다. 제안한 보컬 강화 기술은 입력된 다성 음악 신호로부터 반주 신호를 예측하고, 예측된 반주 신호를 입력된 보컬 신호의 크기에 맞춰 가공하여 반주 복사본 신호를 생성한다. 마지막으로 주파수 영역에서 반주 복사본 신호를 원래 다성 음악 신호에서 제거하여 보컬이 강화된 출력 신호를 생성한다. 원 음악 신호와 제안한 방법으로 보컬이 강화된 신호에 동일한 보컬 피치 검출 방법을 각각 적용하여 피치 검출의 정확도를 측정하였고, 제안한 기술에 의하여 피치 검출 정확도가 평균 7.1 % 포인트 향상된 것을 확인하였다.

Keywords

References

Yipeng Li and DeLiang Wang, "Detecting pitch of singing voice in polyphonic audio," IEEE Conf.Acoustics, Speech, and Signal Processing, vol.3, pp.17-20, 2005.
Jean-Louis Durrieu, Gael Richard and Bertrand David, "Singer melody extraction in polyphonic signals using source separation methods," IEEE Conf.Acoustics, Speech, and Signal Processing, vol.43, no.4, pp.169-172, 2008.
Masataka Goto, Takeshi Saitou, Tomoyasu Nakano and Hiromasa Fujihara, "Singing Information Processing based on singing voice modeling," IEEE Conf.Acoustics, Speech, and Signal Processing, pp.5506-5509, 2010.
Vishweshwara Rao and Preeti Rao, "Vocal melody extraction in the presence of pitched accompaniment in polyphonic music," IEEE Trans.Audio, Speech, and Language Processing, vol.18, pp.2145-2154, 2010. https://doi.org/10.1109/TASL.2010.2042124
Anssi Klapuri, "Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model." IEEE Trans.Audio, Speech, and Language Processing, vol.16, pp.255-266, 2008. https://doi.org/10.1109/TASL.2007.908129
N.Ono, K.Miyamoto, J.Le Roux, H.Kameoka and S. Sagayama "Separation of a monaural audio signals into harmonic/percussive components by complementary diffusion on spectrogram," Processings of EUSIPCO, 2008.
TIA/EIA/IS-127, Enhanced Variable Rate Codec, Speech Service Option 3 for Wideband Spread Spectrum Digital Systems, Jan.1997.
S.F.Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans.Acoustics, Speech, Signal Processing, vol.27, pp.113-120, 1979. https://doi.org/10.1109/TASSP.1979.1163209
http://labrosa.ee.columbia.edu/projects/melody
Yipeng Li and DeLiang Wang, "Separation of singing voice from music accompaniment for monaural recording," IEEE Trans. Audio, Speech, and Language Processing, vol.15, pp. 1475-1487, 2007. https://doi.org/10.1109/TASL.2006.889789
Sen Zhang, "An energy-based adaptive voice detection approach," Proc.8th International Conf.Signal Processing, vol.1, pp.1109-1113, 2006.

The Journal of the Acoustical Society of Korea (한국음향학회지)

Vocal Enhancement for Improving the Performance of Vocal Pitch Detection

보컬 피치 검출의 성능 향상을 위한 보컬 강화 기술

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)