A Study on the Efficient Speech Recognition System using Database Grouping

어휘 그룹화를 이용한 음성인식시스템의 성능향상에 관한 연구

  • 우상욱 (한양대학교 전자통신전파공학과) ;
  • 권승호 (한양대학교 전자통신전파공학과) ;
  • 한수양 (한양대학교 전자통신전파공학과) ;
  • 이동규 (한양대학교 전자통신전파공학과) ;
  • 이두수 (한양대학교 전자통신전파공학과)
  • Published : 2003.07.01

Abstract

In this paper, the Classification of Energy Labeling has been Proposed. Energy Parameters of input signal which is extracted from each phoneme is labelled. And groups of labelling according to detected energies of input signals are detected. Next, DTW processes in a selected group of labeling. This leads to DTW processing faster than a previous algorithm. In this Method, because an accurate detection of parameters is necessary on the assumption in steps of a detection of speeching duration and a detection of energy parameters, variable windows which are decided by pitch period is used. Extract algorithms don't search for exact frame energy, because 256 frame window-sizes is fixed. For this reason, a new energy extraction method has been proposed. A pitch period is detected firstly; next window scale is decided between 200 frames and 300 frames. The proposed method make it possible to cancel an influence of windows.

Keywords