Multi-stage Speech Recognition Using Confidence Vector

Jeon, Hyung-Bae;Hwang, Kyu-Woong;Chung, Hoon;Kim, Seung-Hi;Park, Jun;Lee, Yun-Keun;

MALSORI (대한음성학회지:말소리)

Issue 63
/
Pages.113-124
/
2007
/
1226-1173(pISSN)

The Korean Society Of Phonetic Sciences And Speech Technology (대한음성학회)

Multi-stage Speech Recognition Using Confidence Vector

신뢰도 벡터 기반의 다단계 음성인식

Jeon, Hyung-Bae (ETRI) ;
Hwang, Kyu-Woong (ETRI) ;
Chung, Hoon (ETRI) ;
Kim, Seung-Hi (ETRI) ;
Park, Jun (ETRI) ;
Lee, Yun-Keun (ETRI)

전형배 (한국전자통신연구원(ETRI)음성/언어정보연구센터 음성처리연구팀) ;
황규웅 (한국전자통신연구원(ETRI)음성/언어정보연구센터 자동통역연구팀) ;
정훈 (한국전자통신연구원(ETRI)음성/언어정보연구센터 음성처리연구팀) ;
김승희 (한국전자통신연구원(ETRI)음성/언어정보연구센터 자동통역연구팀) ;
박준 (한국전자통신연구원(ETRI)음성/언어정보연구센터 자동통역연구팀) ;
이윤근 (한국전자통신연구원(ETRI)음성/언어정보연구센터 음성처리연구팀)

Published : 2007.09.30

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, we propose a use of confidence vector as an intermediate input feature for multi-stage based speech recognition architecture to improve recognition accuracy. A multi-stage speech recognition structure is introduced as a method to reduce the computational complexity of the decoding procedure and then accomplish faster speech recognition. Conventional multi-stage speech recognition is usually composed of three stages, acoustic search, lexical search, and acoustic re-scoring. In this paper, we focus on improving the accuracy of the lexical decoding by introducing a confidence vector as an input feature instead of phoneme which was used typically. We take experimental results on 220K Korean Point-of-Interest (POI) domain and the experimental results show that the proposed method contributes on improving accuracy.

MALSORI (대한음성학회지:말소리)

Multi-stage Speech Recognition Using Confidence Vector

신뢰도 벡터 기반의 다단계 음성인식

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)