A Study on Speech Separation using Sinusoidal Model and Psycoacoustics Model

정현파 모델과 사이코어쿠스틱스 모델을 이용한 음성 분리에 관한 연구

  • Hwang, Sun-Il (Dept. of Electrical and Electronic Eng, Yonsei University) ;
  • Han, Doo-Jin (Dept. of Electrical and Electronic Eng, Yonsei University) ;
  • Kwon, Chul-Hyun (Dept. of Electrical and Electronic Eng, Yonsei University) ;
  • Shin, Dae-Kyu (Dept. of Electrical and Electronic Eng, Yonsei University) ;
  • Park, Sang-Hui (Dept. of Electrical and Electronic Eng, Yonsei University)
  • 황선일 (연세대학교 전기 전자 공학과) ;
  • 한두진 (연세대학교 전기 전자 공학과) ;
  • 귄철현 (연세대학교 전기 전자 공학과) ;
  • 신대규 (연세대학교 전기 전자 공학과) ;
  • 박상희 (연세대학교 전기 전자 공학과)
  • Published : 2001.07.18

Abstract

In this thesis, speaker separation is employed when speech from two talkers has been summed into one signal and it is desirable to recover one or both of the speech signals from the composite signal. This paper proposed the method that separated the summed speeches and proved the similarity between the signals by the cross correlation between the signals for exact between original signal and separated signal. This paper uses frequency sampling method based on sinusoidal model to separate the composite signal with vocalic speech and vocalic speech and noise masking method based on psycoacoustics model to separate the composite signal with vocalic speech and nonvocalic speech.

Keywords