Phonetics and Speech Sciences (말소리와 음성과학)
- Volume 3 Issue 2
- /
- Pages.65-70
- /
- 2011
- /
- 2005-8063(pISSN)
- /
- 2586-5854(eISSN)
Improvement of Rejection Performance using the Lip Image and the PSO-NCM Optimization in Noisy Environment
잡음 환경 하에서의 입술 정보와 PSO-NCM 최적화를 통한 거절 기능 성능 향상
- Received : 2011.05.25
- Accepted : 2011.06.23
- Published : 2011.06.30
Abstract
Recently, audio-visual speech recognition (AVSR) has been studied to cope with noise problems in speech recognition. In this paper we propose a novel method of deciding weighting factors for audio-visual information fusion. We adopt the particle swarm optimization (PSO) to weighting factor determination. The AVSR experiments show that PSO-based normalized confidence measures (NCM) improve the rejection performance of mis-recognized words by 33%.
Keywords
- audio-visual speech recognition;
- particle swarm optimization;
- normalized confidence measure;
- rejection performance