Browse > Article

Improving Classification Performance for Data with Numeric and Categorical Attributes Using Feature Wrapping  

Lee, Jae-Sung (중앙대학교 컴퓨터공학과)
Kim, Dae-Won (중앙대학교 컴퓨터공학과)
Abstract
In this letter, we evaluate the classification performance of mixed numeric and categorical data for comparing the efficiency of feature filtering and feature wrapping. Because the mixed data is composed of numeric and categorical features, the feature selection method was applied to data set after discretizing the numeric features in the given data set. In this study, we choose the feature subset for improving the classification performance of the data set after preprocessing. The experimental result of comparing the classification performance show that the feature wrapping method is more reliable than feature filtering method in the aspect of classification accuracy.
Keywords
Classification; Mixed-type data; Feature wrapping;
Citations & Related Records
연도 인용수 순위
  • Reference
1 K. Cios and G. W. Moore, 'Uniqueness of Medical Data Mining,' Artificial Intelligence in Medicine journal, vol.26, no.1, pp.1-24, Sep, 2002   DOI   ScienceOn
2 E. Tuv, A. Borisov and K. Torkkola, 'Unsupervised learning with mixed numeric and nominal data,' IEEE Transactions on Knowledge and Data Engineering, vol.14, no.4, pp.673-690 2002   DOI   ScienceOn
3 A. Asuncion, and D. J. Newman, 'UCI Machine Learning Repository [http://www.ics.uci.edu/mlearn/MLRepository .html],' irvine, CA: University of California, School of Information and Computer Science, 2007
4 Y. Su, T. M. Murali, V. Pavlovic, M. Schaffer, and Simon Kasif, 'Rankgene: a program to rank genes from expression data,' Bioinformatics, vol. 19, no.12, pp.1578-1579, Jan, 2003   DOI   ScienceOn
5 D. R. Wilson, and T. R. Martinez, 'Improved Heterogeneous Distance Functions,' Journal of Artificial Intelligence Research, vol.6, no.1, pp.1-34, Jun, 1997
6 Z. Sun, G. Bebis, and R. Miller, 'Object detection using feature subset selection,' Pattern recognition, vol.37, no.11, pp.2165-2176, Nov, 2004   DOI   ScienceOn