Acoustic Modeling and Energy-Based Postprocessing for Automatic Speech Segmentation

Park Hyeyoung;Kim Hyungsoon;

대한음성학회지:말소리 (MALSORI)

제43호
/
Pages.137-150
/
2002
/
1226-1173(pISSN)

대한음성학회 (The Korean Society Of Phonetic Sciences And Speech Technology)

자동 음성 분할을 위한 음향 모델링 및 에너지 기반 후처리

Acoustic Modeling and Energy-Based Postprocessing for Automatic Speech Segmentation

박혜영 (부산대) ;
김형순 (부산대)

발행 : 2002.06.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

Speech segmentation at phoneme level is important for corpus-based text-to-speech synthesis. In this paper, we examine acoustic modeling methods to improve the performance of automatic speech segmentation system based on Hidden Markov Model (HMM). We compare monophone and triphone models, and evaluate several model training approaches. In addition, we employ an energy-based postprocessing scheme to make correction of frequent boundary location errors between silence and speech sounds. Experimental results show that our system provides 71.3% and 84.2% correct boundary locations given tolerance of 10 ms and 20 ms, respectively.

대한음성학회지:말소리 (MALSORI)

자동 음성 분할을 위한 음향 모델링 및 에너지 기반 후처리

Acoustic Modeling and Energy-Based Postprocessing for Automatic Speech Segmentation

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)