Multi-stage Speech Recognition Using Confidence Vector

신뢰도 벡터 기반의 다단계 음성인식

  • 전형배 (한국전자통신연구원(ETRI)음성/언어정보연구센터 음성처리연구팀) ;
  • 황규웅 (한국전자통신연구원(ETRI)음성/언어정보연구센터 자동통역연구팀) ;
  • 정훈 (한국전자통신연구원(ETRI)음성/언어정보연구센터 음성처리연구팀) ;
  • 김승희 (한국전자통신연구원(ETRI)음성/언어정보연구센터 자동통역연구팀) ;
  • 박준 (한국전자통신연구원(ETRI)음성/언어정보연구센터 자동통역연구팀) ;
  • 이윤근 (한국전자통신연구원(ETRI)음성/언어정보연구센터 음성처리연구팀)
  • Published : 2007.09.30

Abstract

In this paper, we propose a use of confidence vector as an intermediate input feature for multi-stage based speech recognition architecture to improve recognition accuracy. A multi-stage speech recognition structure is introduced as a method to reduce the computational complexity of the decoding procedure and then accomplish faster speech recognition. Conventional multi-stage speech recognition is usually composed of three stages, acoustic search, lexical search, and acoustic re-scoring. In this paper, we focus on improving the accuracy of the lexical decoding by introducing a confidence vector as an input feature instead of phoneme which was used typically. We take experimental results on 220K Korean Point-of-Interest (POI) domain and the experimental results show that the proposed method contributes on improving accuracy.

Keywords