Implementation of Text-to-Audio Visual Speech Synthesis Using Key Frames of Face Images

Kim MyoungGon;Kim JinYoung;Baek SeongJoon;

MALSORI (대한음성학회지:말소리)

Issue 43
/
Pages.73-88
/
2002
/
1226-1173(pISSN)

The Korean Society Of Phonetic Sciences And Speech Technology (대한음성학회)

Implementation of Text-to-Audio Visual Speech Synthesis Using Key Frames of Face Images

키프레임 얼굴영상을 이용한 시청각음성합성 시스템 구현

김명곤 (전남대) ;
김진영 (전남대) ;
백성준 (전남대)

Published : 2002.06.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, for natural facial synthesis, lip-synch algorithm based on key-frame method using RBF(radial bases function) is presented. For lips synthesizing, we make viseme range parameters from phoneme and its duration information that come out from the text-to-speech(TTS) system. And we extract viseme information from Av DB that coincides in each phoneme. We apply dominance function to reflect coarticulation phenomenon, and apply bilinear interpolation to reduce calculation time. At the next time lip-synch is performed by playing the synthesized images obtained by interpolation between each phonemes and the speech sound of TTS.

MALSORI (대한음성학회지:말소리)

Implementation of Text-to-Audio Visual Speech Synthesis Using Key Frames of Face Images

키프레임 얼굴영상을 이용한 시청각음성합성 시스템 구현

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)