모음길이 비율에 따른 발화속도 보상을 이용한 한국어 음성인식 성능향상

An Improvement of Korean Speech Recognition Using a Compensation of the Speaking Rate by the Ratio of a Vowel length

  • 박준배 (인하대학교 전자계산공학과) ;
  • 김태준 (인하대학교 전자계산공학과) ;
  • 최성용 (인하대학교 전자계산공학과) ;
  • 이정현 (인하대학교 컴퓨터 공학부)
  • 발행 : 2003.11.01

초록

The accuracy of automatic speech recognition system depends on the presence of background noise and speaker variability such as sex, intonation of speech, and speaking rate. Specially, the speaking rate of both inter-speaker and intra-speaker is a serious cause of mis-recognition. In this paper, we propose the compensation method of the speaking rate by the ratio of each vowel's length in a phrase. First the number of feature vectors in a phrase is estimated by the information of speaking rate. Second, the estimated number of feature vectors is assigned to each syllable of the phrase according to the ratio of its vowel length. Finally, the process of feature vector extraction is operated by the number that assigned to each syllable in the phrase. As a result the accuracy of automatic speech recognition was improved using the proposed compensation method of the speaking rate.

키워드