Subword-based Lip Reading Using State-tied HMM

상태공유 HMM을 이용한 서브워드 단위 기반 립리딩

  • 김진영 (전남대학교 공과대학 정보통신공학부) ;
  • 신도성 (전남대학교 공과대학 대학원 전자공학과)
  • Published : 2001.09.01

Abstract

In recent years research on HCI technology has been very active and speech recognition is being used as its typical method. Its recognition, however, is deteriorated with the increase of surrounding noise. To solve this problem, studies concerning the multimodal HCI are being briskly made. This paper describes automated lipreading for bimodal speech recognition on the basis of image- and speech information. It employs audio-visual DB containing 1,074 words from 70 voice and tri-viseme as a recognition unit, and state tied HMM as a recognition model. Performance of automated recognition of 22 to 1,000 words are evaluated to achieve word recognition of 60.5% in terms of 22word recognizer.

Keywords