Improvement of Rejection Performance using the Lip Image and the PSO-NCM Optimization in Noisy Environment

잡음 환경 하에서의 입술 정보와 PSO-NCM 최적화를 통한 거절 기능 성능 향상

  • 김병돈 (동신대학교 공과대학 컴퓨터학과) ;
  • 최승호 (동신대학교 공과대학 컴퓨터학과)
  • Received : 2011.05.25
  • Accepted : 2011.06.23
  • Published : 2011.06.30


Recently, audio-visual speech recognition (AVSR) has been studied to cope with noise problems in speech recognition. In this paper we propose a novel method of deciding weighting factors for audio-visual information fusion. We adopt the particle swarm optimization (PSO) to weighting factor determination. The AVSR experiments show that PSO-based normalized confidence measures (NCM) improve the rejection performance of mis-recognized words by 33%.