Spontaneous Speech Language Modeling using N-gram based Similarity

Park Young-Hee;Chung Minhwa;

MALSORI (대한음성학회지:말소리)

Issue 46
/
Pages.117-126
/
2003
/
1226-1173(pISSN)

The Korean Society Of Phonetic Sciences And Speech Technology (대한음성학회)

Spontaneous Speech Language Modeling using N-gram based Similarity

N-gram 기반의 유사도를 이용한 대화체 연속 음성 언어 모델링

박영희 (서강대) ;
정민화 (서강대)

Published : 2003.06.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

This paper presents our language model adaptation for Korean spontaneous speech recognition. Korean spontaneous speech is observed various characteristics of content and style such as filled pauses, word omission, and contraction as compared with the written text corpus. Our approaches focus on improving the estimation of domain-dependent n-gram models by relevance weighting out-of-domain text data, where style is represented by n-gram based tf/sup */idf similarity. In addition to relevance weighting, we use disfluencies as Predictor to the neighboring words. The best result reduces 9.7% word error rate relatively and shows that n-gram based relevance weighting reflects style difference greatly and disfluencies are good predictor also.

MALSORI (대한음성학회지:말소리)

Spontaneous Speech Language Modeling using N-gram based Similarity

N-gram 기반의 유사도를 이용한 대화체 연속 음성 언어 모델링

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)