Korean LVCSR for Broadcast News Speech

Lee, Gang-Seong;

The Journal of the Acoustical Society of Korea

Volume 20 Issue 2E
/
Pages.3-8
/
2001
/
1225-4428(pISSN)

The Acoustical Society of Korea (한국음향학회)

Korean LVCSR for Broadcast News Speech

Lee, Gang-Seong (Computer Engineering Dept. KwangWoon Univ.)

Published : 2001.06.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, we will examine a Korean large vocabulary continuous speech recognition (LVCSR) system for broadcast news speech. The combined vowel and implosive unit is included in a phone set together with other short phone units in order to obtain a longer unit acoustic model. The effect of this unit is compared with conventional phone units. The dictionary units for language processing are automatically extracted from eojeols appearing in transcriptions. Triphone models are used for acoustic modeling and a trigram model is used for language modeling. Among three major speaker groups in news broadcasts-anchors, journalists and people (those other than anchors or journalists, who are being interviewed), the speech of anchors and journalists, which has a lot of noise, was used for testing and recognition.

The Journal of the Acoustical Society of Korea

Korean LVCSR for Broadcast News Speech

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)