DOI QR코드

DOI QR Code

A Comparison Study of Classification Algorithms in Data Mining

  • Lee, Seung-Joo (Department of Bioinformatics & Statistics, Cheongju University) ;
  • Jun, Sung-Rae (Department of Bioinformatics & Statistics, Cheongju University)
  • Published : 2008.03.01

Abstract

Generally the analytical tools of data mining have two learning types which are supervised and unsupervised learning algorithms. Classification and prediction are main analysis tools for supervised learning. In this paper, we perform a comparison study of classification algorithms in data mining. We make comparative studies between popular classification algorithms which are LDA, QDA, kernel method, K-nearest neighbor, naive Bayesian, SVM, and CART. Also, we use almost all classification data sets of UCI machine learning repository for our experiments. According to our results, we are able to select proper algorithms for given classification data sets.

Keywords

References

  1. L. Breiman, J. H. Friedman, R. A. Olshen, C. J. Stone, Classification and Regression Trees, Wadsworth Inc., 1984
  2. V. Cherkassky, F. Mulier, Learning From Data Concepts, Theory, and Methods, John Wiley & Sons, 1998
  3. T. Hastie, R. Tibshirani, J. Friedman, The Elements of Statistical Learning: Data mining, Inference, and Prediction, Springer, 2001
  4. R. A. Johnson, D. W. Wichern, Applied Multivariate Statistical Analysis, Prentice Hall, 1992
  5. P. Giudici, Applied Data Mining, Wiley, 2003
  6. J. Han, M. Kamber, Data Mining Concepts and Techniques, Morgan Kaufmann, 2001
  7. S. Haykin, Neural Networks, Prentice Hall, 1999
  8. K. B. Korb, A. E. Nicholson, Bayesian Artificial Intelligence, Chapman & Hall, 2004
  9. T. M. Mitchell, Machine Learning, McGraw-Hill, 1997
  10. A. S. Pandya, R. B. Macy, Pattern Recognition with Neural Networks in C++, IEEE Press, 1995
  11. UCI Machine Learning Repository, http://www1.ics.uci.edu/~mlearn
  12. V. N. Vapnik, The Nature of Statistical Learning Theory, Springer, 1995
  13. V. Vapnik, Statistical Learning Theory, John Wiley & Sons, Inc. 1998
  14. V. N. Vapnik, "An Overview of Statistical Learning Theory", IEEE Transactions on Neural Networks, vol. 10, no. 5, pp. 988-999, 1999 https://doi.org/10.1109/72.788640

Cited by

  1. A neuro-genetic controller for nonminimum phase systems vol.6, pp.5, 1995, https://doi.org/10.1109/72.410379
  2. SnO/sub 2/ gas sensing array for combustible and explosive gas leakage recognition vol.2, pp.3, 2002, https://doi.org/10.1109/JSEN.2002.800685