Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2008.15-B.1.53

Improved Automatic Lipreading by Multiobjective Optimization of Hidden Markov Models  

Lee, Jong-Seok (한국과학기술원 전자전산학부)
Park, Cheol-Hoon (한국과학기술원 전자전산학부)
Abstract
This paper proposes a new multiobjective optimization method for discriminative training of hidden Markov models (HMMs) used as the recognizer for automatic lipreading. While the conventional Baum-Welch algorithm for training HMMs aims at maximizing the probability of the data of a class from the corresponding HMM, we define a new training criterion composed of two minimization objectives and develop a global optimization method of the criterion based on simulated annealing. The result of a speaker-dependent recognition experiment shows that the proposed method improves performance by the relative error reduction rate of about 8% in comparison to the Baum-Welch algorithm.
Keywords
Automatic Lipreading; Multiobjective Optimization; Discriminative Training; Simulated Annealing;
Citations & Related Records
연도 인용수 순위
  • Reference
1 L. A. Ross, D. Saint-Amour, V. M. Leavitt, D. C. Javitt, and J. J. Foxe, 'Do you see what I am saying? Exploring visual enhancement of speech comprehension in noisy environments,' Cerebral Cortex, Vol. 17, No. 5, pp. 1147-1153, 2007   DOI   ScienceOn
2 L. Rabiner and B.-H. Juang, 'Fundamentals of Speech Recognition,' Prentice-Hall, 1993
3 W. Chou, 'Discriminant-function-based minimum recognition error rate pattern-recognition approach to speech recognition,' Proc. IEEE, Vol. 88, No. 8, pp. 1201-1223, 2000   DOI   ScienceOn
4 B.-H. Juang, W. Chou, and C.-H. Lee, 'Minimum classification error rate methods for speech recognition,' IEEE Trans. Speech and Audio Processing, Vol. 5, No. 3, pp. 257-265, 1997   DOI   ScienceOn
5 A. Ben-Yishai and D. Burshtein, 'A discriminative training algorithm for hidden Markov models,' IEEE Trans. Speech and Audio Processing, Vol. 12, No. 3, pp. 204-216, 2004   DOI   ScienceOn
6 S. Kirkpatrick, C. D. Gerlatt, and M. P. Vecchi, 'Optimization by simulated annealing,' Science, Vol. 220, pp. 671-680, 1983   DOI   ScienceOn
7 이종석, 심선희, 김소영, 박철훈, '제어되지 않은 조명 조건하에서 입술움직임의 강인한 특징추출을 이용한 바이모달 음성인식,' Telecommunications Review, 14권 1호, pp. 123-134, 2004
8 R. L. Yang, 'Convergence of the simulated annealing algorithm for continuous global optimization,' J. Optimization Theory and Applications, Vol. 104, No. 3, pp. 691-716, 2004
9 H. H. Szu and R. L. Hartley, 'Fast simulated annealing,' Phys. Lett. A, Vol. 122, No. 3-4, pp. 157-162, June 1987   DOI   ScienceOn
10 D. Nam, J.-S. Lee, and C. H. Park, 'n-dimensional Cauchy neighbor generation for the fast simulated annealing,' IEICE Trans. Information and Systems, Vol. E87-D, No. 11, pp. 2499-2502, 2004
11 D. Nam and C. H. Park, 'Pareto-based costsimulated annealing for multiobjective optimization,' Proc. Asia-Pacific Conf. Simulated Evolution and Learning, Vol. 2, pp. 522-526, Singapore, 2002
12 K. Deb, A. Pratap, and T. Meyarivan, 'A fast and elitist multiobjective genetic algorithm: NSGA-II,' IEEE Trans. Evolutionary Computation, Vol. 6, No. 2, pp. 182-197, Apr. 2002   DOI   ScienceOn
13 E. Zitzler, M. Laumanns, and S. Bleuler, 'A tutorial on evolutionary multiobjective optimization,' Metaheuristics for Multiobjective Optimisation, Lecture Notes in Economics and Mathematical Systems, X. Gandibleux, M. Sevaux, K. Sorensen, and V. T'kindt, Eds. Springer-Verlag, Vol. 535, pp. 3-37, 2004
14 C. C. Chibelushi, F. Deravi, and J. S. D. Mason, 'A review of speech-based bimodal recognition,' IEEE Trans. Multimedia, Vol. 4, No. 1, pp. 23-37, 2002   DOI   ScienceOn
15 N. Metropolis, A. W. Rosenbluth, M. N. Rosenbluth, A. H. Teller and E. Teller, 'Equation of state calculations by fast computing machines,' J. Chem Phys., Vol. 21, No. 6, pp. 1087-1092, 1953   DOI