Introduction of ETRI Broadcast News Speech Recognition System

ETRI 방송뉴스음성인식시스템 소개

  • Park Jun (Voice Interface Research Team, Speech/Language Information Research Center Electronics and Telecommunications Research Institute)
  • 박준 (ETRI 음성/언어정보연구센터 음성인터페이스연구팀)
  • Published : 2006.05.01

Abstract

This paper presents ETRI broadcast news speech recognition system. There are two major issues on the broadcast news speech recognition: 1) real-time processing and 2) out-of-vocabulary handling. For real-time processing, we devised the dual decoder architecture. The input speech signal is segmented based on the long-pause between utterances, and each decoder processes the speech segment alternatively. One decoder can start to recognize the current speech segment without waiting for the other decoder to recognize the previous speech segment completely. Thus, the processing delay is not accumulated. For out-of-vocabulary handling, we updated both the vocabulary and the language model, based on the recent news articles on the internet. By updating the language model as well as the vocabulary, we can improve the performance up to 17.2% ERR.

Keywords