Korean Speech Segmentation and Recognition by Frame Classification via GMM

GMM을 이용한 프레임 단위 분류에 의한 우리말 음성의 분할과 인식

  • 권호민 (동아대학교 전자공학과) ;
  • 한학용 (동아대학교 전자공학과) ;
  • 고시영 (경일대학교 전자정보공학과) ;
  • 허강인 (동아대학교 전자공학과)
  • Published : 2003.06.01

Abstract

In general it has been considered to be the difficult problem that we divide continuous speech into short interval with having identical phoneme quality. In this paper we used Gaussian Mixture Model (GMM) related to probability density to divide speech into phonemes, an initial, medial, and final sound. From them we peformed continuous speech recognition. Decision boundary of phonemes is determined by algorithm with maximum frequency in a short interval. Recognition process is performed by Continuous Hidden Markov Model(CHMM), and we compared it with another phoneme divided by eye-measurement. For the experiments result we confirmed that the method we presented is relatively superior in auto-segmentation in korean speech.

Keywords