MALSORI (대한음성학회지:말소리)
- Issue 63
- /
- Pages.113-124
- /
- 2007
- /
- 1226-1173(pISSN)
Multi-stage Speech Recognition Using Confidence Vector
신뢰도 벡터 기반의 다단계 음성인식
-
Jeon, Hyung-Bae
(ETRI) ;
- Hwang, Kyu-Woong (ETRI) ;
-
Chung, Hoon
(ETRI) ;
-
Kim, Seung-Hi
(ETRI) ;
-
Park, Jun
(ETRI) ;
-
Lee, Yun-Keun
(ETRI)
-
전형배
(한국전자통신연구원(ETRI)음성/언어정보연구센터 음성처리연구팀) ;
- 황규웅 (한국전자통신연구원(ETRI)음성/언어정보연구센터 자동통역연구팀) ;
-
정훈
(한국전자통신연구원(ETRI)음성/언어정보연구센터 음성처리연구팀) ;
-
김승희
(한국전자통신연구원(ETRI)음성/언어정보연구센터 자동통역연구팀) ;
-
박준
(한국전자통신연구원(ETRI)음성/언어정보연구센터 자동통역연구팀) ;
-
이윤근
(한국전자통신연구원(ETRI)음성/언어정보연구센터 음성처리연구팀)
- Published : 2007.09.30
Abstract
In this paper, we propose a use of confidence vector as an intermediate input feature for multi-stage based speech recognition architecture to improve recognition accuracy. A multi-stage speech recognition structure is introduced as a method to reduce the computational complexity of the decoding procedure and then accomplish faster speech recognition. Conventional multi-stage speech recognition is usually composed of three stages, acoustic search, lexical search, and acoustic re-scoring. In this paper, we focus on improving the accuracy of the lexical decoding by introducing a confidence vector as an input feature instead of phoneme which was used typically. We take experimental results on 220K Korean Point-of-Interest (POI) domain and the experimental results show that the proposed method contributes on improving accuracy.