Browse > Article

Confusion Model Selection Criterion for On-Line Handwritten Numeral Recognition  

Park, Mi-Na (강원대학교 컴퓨터정보통신공학과)
Ha, Jin-Young (강원대학교 컴퓨터학부)
Abstract
HMM tends to output high probability for not only the proper class data but confusable class data, since the modeling power increases as the number of parameters increases. Thus it may not be helpful for discrimination to simply increase the number of parameters of HMM. We proposed two methods in this paper. One is a CMC(Confusion Likelihood Model Selection Criterion) using confusion class data probability, the other is a new recognition method, RCM(Recognition Using Confusion Models). In the proposed recognition method, confusion models are constructed using confusable class data, then confusion models are used to depress misrecognition by confusion likelihood is subtracted from the corresponding standard model probability. We found that CMC showed better results using fewer number of parameters compared with ML, ALC2, and BIC. RCM recorded 93.08% recognition rate, which is 1.5% higher result by reducing 17.4% of errors than using standard model only.
Keywords
HMM; BIC; CMC; ALC; Confustion; Topology optimization;
Citations & Related Records
연도 인용수 순위
  • Reference
1 하진영, A. Biem, J. Subrahmonia, 박미나, '모델의 사전 확률 추정을 통한 HMM 구조의 최적화', 한국정보과학회 추계 학술발표논문집, 제28권 제2호, pp. 325-327, 2001년 10월
2 Andress Stolcke and stephen Omohundro, 'Hidden Markov Model induction by Bayesian model merging,' in Advances in NIPS, Vol.5. pp. 11-18, 1993
3 S. Chen and P. S. Gopalakrishnan. 'Clustering via the Bayesian Information criterion with applications in speech recognition,' ICASSP, 2, pp. 645-649, 1998
4 Schwarz, 'Estimating the dimension of a model,' Ann, Statist, Vol.6, No.2, pp. 461-464, 1978   DOI   ScienceOn
5 Mi-Na Park, Jin-Young Ha, 'On-Line Handwritten Numeral Recognition using Topology Optimized HMMs,' The 5th Asia Pacific International Symposium on Information Technology, ISSN 1738-9062, pp. 87-290, 2006
6 http://clopinet.com/isabelle/Projects/modelselect/
7 A. Barron, J. Rissanen, and B. Yu, 'The minimum description length principle in coding and modeling,' IEEE Trans. Inform. Theory, Vol.44, pp. 2743-2760, 1998   DOI   ScienceOn
8 A. Biem, J.-Y. Ha and J. Subrahmonia, 'A Bayesian Model Selection Criterion for HMM Topology Optimization,' ICASSP 2002. Orlando, Florida, U.S.A., pp. I-989 - I-992, May, 2002
9 Jin-Young Ha, Alain Biem and Jayasheree Subrahmonia 'Use of Model Prior for HMM Topology Optimization,' The 4th Korea-China Joint Symposium on Information Technology for Oriental Language Processing and Pattern Recognition, 2001
10 Raymond C.Vasko, Jr., Amro EL-Jaroudi J.Robert, Boston 'Application of Hidden Markov Model Topology Estimation To Repetitive Lifting Data,' IEEE, pp. 4073-4076, 1997
11 Nopsuwanchai Roongroj and D. Povey, 'Discriminative training for HMM-based offline handwritten character recognition,' Proc. of 7th International Conference on Document Analysis and Recognition, Vol.1, pp. 114-118, 2003
12 Mi-Na Park, Jin-young Ha, 'HMM Topology Optimization for Handwriting and Speech Recognition,' The 20th Commemorative International Technical Conference on Circutes, Systems, Computer and Communications, Proceedings Volume1, pp. 261-262, July 2005
13 Raymond C.Vasko, Jr., Amro EL-Jaroudi J.Robert, Boston 'An Algorithm To Determine Hidden Markov Model Topology,' IEEE, pp. 3577-3579, 1996
14 Belen Martin Barragan, 'Binarized Support Vector Machines: detecting relevant variables and interactions,' International Workshop on Feature Selection for Data Mining:Interfacing Machine Learning and Statistics(FSDM), 2006
15 D. Li, A. Biem and J. Subrahmonia, 'HMM Topology Optimization for Handwriting Recognition,' ICASSP 2001
16 L.R. Bahl, P. F. Brown, P. V. de Souza, and R. L. Mercer, 'Maximum mutual information estimation of hidden Markov model parameters for speech recognition,' in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, Tokyo, Japan, 1986, pp. 49-52
17 박미나, 하진영, 'HMM 모델링을 위한 HMM의 State수와 Mixture 수 분석', 한국정보과학회 춘계 학술발표논문집, 제29권 제1호, pp. 658-660, 2002년 4월
18 H.Singer and M, ostendorf, 'Maximum likelihood successive state splitting,' ICASSP, pp. 601-604, 1996
19 MacKay, 'Bayesian interpolation,' Neural computation, Vol.4, No.3, pp. 415-447, 1992   DOI
20 L. R. Bahl, P. F. Brown, P. V. de Souza, and R. L. Mercer, 'Maximum mutual information estimation of hidden Markov model parameters for speech recognition,' Proc. ICASSP-86, pp. 49-52, 1986
21 Steve Young, Dan KerShaw, Julian Odell, Dave Ollason, Valtcho Valthchev, Phil Woodland, The HTK Book(for HTK version 3.0), Microsoft Corporation, 1999
22 E. Kass and A. Raftery. 'Bayes factors,' Technical Report 254, University of Washington, Department of Statistics. 1994
23 Walter Zucchini, 'An Introduction to Model Selection,' Journal of Mathematical Psychology 44, 41-61, 2000   DOI   ScienceOn
24 R. A. Fisher, 'The use of multiple measure in taxonomic problems,' Cont. Math. Stat., pp. 32. 179-32. 188, 1950
25 하진영, 신봉기, '온라인 한글 인식을 위한 HMM 상태 수의 최적화', 한국정보과학회 추계 학술발표논문집, pp. 372-374, 1998