Browse > Article
http://dx.doi.org/10.7465/jkdi.2012.23.2.299

A study on decision tree creation using marginally conditional variables  

Cho, Kwang-Hyun (Department of Early Childhood Education, Changwon National University)
Park, Hee-Chang (Department of Statistics, Changwon National University)
Publication Information
Journal of the Korean Data and Information Science Society / v.23, no.2, 2012 , pp. 299-307 More about this Journal
Abstract
Data mining is a method of searching for an interesting relationship among items in a given database. The decision tree is a typical algorithm of data mining. The decision tree is the method that classifies or predicts a group as some subgroups. In general, when researchers create a decision tree model, the generated model can be complicated by the standard of model creation and the number of input variables. In particular, if the decision trees have a large number of input variables in a model, the generated models can be complex and difficult to analyze model. When creating the decision tree model, if there are marginally conditional variables (intervening variables, external variables) in the input variables, it is not directly relevant. In this study, we suggest the method of creating a decision tree using marginally conditional variables and apply to actual data to search for efficiency.
Keywords
Data mining; decision tree; external variable; intervening variable; marginally conditional variables;
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 Breiman, L., Friedman, J. H., Olshen, R. A. and Stone, C. J. (1984). Classification and regression trees, Wadsworth and books, California.
2 Cho, K. H. and Park, H. C. (2011a). A study on insignificant rules discovery in association rule mining. Journal of the Korean Data & Information Science Society, 22, 81-88.
3 Cho, K. H. and Park, H. C. (2011b). A study on decision tree creation using intervening variable. Journal of the Korean Data & Information Science Society, 22, 671-678.
4 Cho, K. H. and Park, H. C. (2011c). A study on removal of unnecessary input variables using multiple external association rule. Journal of the Korean Data & Information Science Society, 22, 877-884.
5 Cho, K. H. and Park, H. C. (2011d). Discovery of insignificant association rules using external variable. Journal of the Korean Data Analysis Society, 13, 1343-1352.
6 Hartigan, J. A. (1975). Clustering algorithms, John Wiley & Sons, New York.
7 Park, H. C. (2010). Association rule ranking function by decreased lift influence. Journal of the Korean Data & Information Science Society, 21, 397-405.
8 Quinlan, J. R. (1993). C4.5 programs for machine learning, Morgan Kaufmann Publishers, San Francisco.