Browse > Article

Utilizing Experiences of Supervisor in Sequential Learning for Multilayer Perceptron  

Lee, Jae-Young (경북대학교 컴퓨터과학과)
Kim, Hwang-Soo (경북대학교 컴퓨터학부)
Abstract
Evaluating the level of achievement and providing the knowledge which is appropriate at the evaluated level have great influence in studying of the human beings. This shows the importance of the order of training and the training order should be considered in machine learning. In this research, to assess the influence of the order of training, we propose a method of controlling the order of training samples utilizing the experience of supervisor in the training of MLP. The supervisor finds out the current state of MLP using teaching experience and student evaluation, and then selects the most instructive sample for MLP in that state. We use CRF to represent and utilize the experience of supervisor. While the proposed method is similar to active learning in selecting samples, it is basically different in that selection is not to reduce the number of samples to be used but to assist the learning progress. The result from classification problem shows that the method is usually effective in terms of time taken in training in contrast to random selection.
Keywords
MLP; Training Sequence; Experience; CRF; Classification;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Rahul Gupta, "Conditional random Fields," Technical report, IIT Bombay, 2006.
2 Fei Sha and Fernando Pereira, "Shallow parsing with Conditional Random Fields," Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology (NAACL '03), pp.134-141, 2003.
3 Mingkun Li and Ishwar K.Sethi, "Confidence-Based Active Learning," IEEE Transaction on Pattern Analysis and Machine Intelligence, vol.28, no.8, pp.1251-1261, Aug. 2006.
4 Burr Settles, "Active Learning Literature Survey," Computer Sciences Technical Report 1648, University of Wisconsin-Madison. 2009.
5 Fredrik Olsson, "A literature survey of active machine learning in the context of natural language processing," SICS Technical Report T2009:06. Swedish Institute of Computer Science, Kista, Sweden. 2009.
6 J.-Y. Lee, H.-S. Kim, "Effect of Training Sequence Control in On-line Learning for Multilayer Perceptron," Journal of KIISE : Software and Applications, vol.37, no.7, pp.491-502, Jul. 2010. (in Korean)   과학기술학회마을
7 L.R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proceedings of the IEEE, vol.77, no.2, pp.257-286, 1989.   DOI   ScienceOn
8 Andrew McCallum, "MALLET: A Machine Learning for Language Toolkit," http://mallet.cs.umass.edu, 2002.
9 Simon Haykin, Neural Networks:A Comprehensive Foundation, 2nd Ed., Prentice Hall, 1999.
10 David MacKay, Information Theory, Inference and Learning Algorithms, Cambridge University Press. 2003.
11 Corinna Cortes and Vladimir Vapnik, "Support- Vector Networks," Machine Leaming, vol.20, pp.273-297, 1995.
12 R. Klinger and K. Tomanek, "Classical probabilistic models and conditional random fields," Technische University at Dortmund, Electronic Publication, 2007.
13 J. Lafferty, A. Mccallum and F. Pereira, "Conditional random fields: Probabilistic models for segmenting and labeling sequence data," Proc. 18th International Conf. on Machine Learning, Morgan Kaufmann, San Francisco, CA, pp.282-289, 2001.
14 Burr Settles, "Biomedical Named Entity Recognition using Conditional Random Fields and Rich Feature Sets," International Joint workshop on Natural Language Processing in Biomedicine and its Applications (NLPBA/BioNLP), pp.107-110, 2004.
15 Charles Sutton and Andrew McCallum, "An Introduction to Conditional Random Fields for Relational Learning : Introduction to Statistical Relational Learning," MIT Press, Chap. 4, pp.93-127, November 2007.
16 UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/