Browse > Article

A Performance Study of Gaussian Radial Basis Function Model for the Monk's Problems  

Shin, Mi-Young (School of Electrical Engineering and Computer Science, Kyungpook National University)
Park, Joon-Goo (School of Electrical Engineering and Computer Science, Kyungpook National University)
Publication Information
Abstract
As art analytic method to uncover interesting patterns hidden under a large volume of data, data mining research has been actively done so far in various fields. However, current state-of-the-arts in data mining research have several challenging problems such as being too ad-hoc. The existing techniques are mostly the ones designed for individual problems, so there is no unifying theory applicable for more general data mining problems. In this paper, we address the problem of classification, which is one of significant data mining tasks. Specifically, our objective is to evaluate radial basis function (RBF) model for classification tasks and investigate its usefulness. For evaluation, we analyze the popular Monk's problems which are well-known datasets in data mining research. First, we develop RBF models by using the representational capacity based learning algorithm, and then perform a comparative assessment of the results with other models generated by the existing techniques. Through a variety of experiments, it is empirically shown that the RBF model has not only the superior performance on the Monk's problems but also its modeling process can be controlled in a systematic way, so the RBF model with RC-based algorithm might be a good candidate to handle the current ad-hoc problem.
Keywords
data mining; classification model; radial basis functions; Monk's problems; performance study;
Citations & Related Records
연도 인용수 순위
  • Reference
1 A. L. Goel and Miyoung Shin, 'Radial basis functions: an algebraic approach (with data mining applications),' Tutorial notes in European conference on Machine Learning, Pisa, Italy, September 2004
2 S. Saxon and Alwyn Barry, 'XCS and the Monk's problems in learning classifier systems: from foundations to applications,' P.L. Lanzi et al, Ed., Lecture Notes in Computer Science, vol. 1813, pp. 440-448, 2000
3 M. Casey and K. Ahmad, 'In-situ learning in multi-net systems,' Lecture Notes in Computer Science, vol. 3177, pp. 752-757, 2004   DOI
4 K. Toh, Q-L Tran and O. Srinivasan, 'Benchmarking a reduced multivariate polynomial pattern classifier,' IEEE Trans. on Pattern Anal. and Machine Intelligence, vol.16, no.2, pp. 460-474, 2005
5 S. H. Huang, 'Dimensionality reduction in automatic knowledge acquisition: a simple greedy search approach,' IEEE Transactions on Knowledge and Data Engineering. vol. 16, no. 6, pp. 1364-1373, 2003   DOI   ScienceOn
6 Q.Yang and X. Wu, '10 Challenging problems in data mining research,' in presentation slides of IEEE conference on Data Mining, 15. Dec, 2005
7 S.B. Thrun et al, 'The Monk's problems: a performance comparison of different learning algorithms,' Technical Report CMU-CS-91-197, Carnegie Mellon University. 1991
8 H. Xiong, M.N. S. Swamy, and M. Omair Ahmad, 'Optimizing the kernel in the empirical feature space,' IEEE Transactions on Neural Networks. March 2005
9 M. W. Mitchell, 'An architecture for situated learning agents,' Ph.D. Dissertation, Monash University, Australia, 2003
10 G.V. Kass, 'An Exploratory technique for investigating large quantities of categorical data,' Applied Statistics, pp.119-127, 1980
11 D. Michie, D.J. Spiegelhalter and C.C. Taylor (eds), Machine learning, Neural and Statistical Classification, Ellis Horwood, 1994
12 L. Breiman, J.H. Friedman, R.A. Olshen and C.J. Stone, Classification and Regression Trees, Wadsworth, 1984