New Splitting Criteria for Classification Trees

  • Published : 2001.12.01

Abstract

Decision tree methods is the one of data mining techniques. Classification trees are used to predict a class label. When a tree grows, the conventional splitting criteria use the weighted average of the left and the right child nodes for measuring the node impurity. In this paper, new splitting criteria for classification trees are proposed which improve the interpretablity of trees comparing to the conventional methods. The criteria search only for interesting subsets of the data, as opposed to modeling all of the data equally well. As a result, the tree is very unbalanced but extremely interpretable.

Keywords

References

  1. Machine Learning v.24 Technical Note: Some Properties of Splitting Criteria Breiman,L.
  2. Classification and Regression Tress Breiman,L.;Frieman,J.H.;Olshen,R.A.;Stone,C.J.
  3. The Korean Communications in Statistics v.7 interpretation of Data Mining Prediction Model Using Decision Tree Kang,Hyuncheol;Han,Sang Tae;Choi Jong Hoo
  4. C4.5: Programs for Machine Learning Quinlan,J.R.
  5. S-PLUS Guide to Statistical and Mathematical Analysis (Version 3.3) StatSci