Optimization of State-Based Real-Time Speech Endpoint Detection Algorithm

상태변수 기반의 실시간 음성검출 알고리즘의 최적화

  • 김수환 (경상대학교 전자공학과) ;
  • 이영재 (경상대학교 전자공학과) ;
  • 김영일 (경상대학교 전자공학과) ;
  • 정상배 (경상대학교 전자공학과)
  • Received : 2010.10.29
  • Accepted : 2010.12.16
  • Published : 2010.12.31

Abstract

In this paper, a speech endpoint detection algorithm is proposed. The proposed algorithm is a kind of state transition-based ones for speech detection. To reject short-duration acoustic pulses which can be considered noises, it utilizes duration information of all detected pulses. For the optimization of parameters related with pulse lengths and energy threshold to detect speech intervals, an exhaustive search scheme is adopted while speech recognition rates are used as its performance index. Experimental results show that the proposed algorithm outperforms the baseline state-based endpoint detection algorithm. At 5 dB input SNR for the beamforming input, the word recognition accuracies of its outputs were 78.5% for human voice noises and 81.1% for music noises.

Keywords