[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.7465/jkdi.2012.23.2.299

A study on decision tree creation using marginally conditional variables

Cho, Kwang-Hyun (Department of Early Childhood Education, Changwon National University)
Park, Hee-Chang (Department of Statistics, Changwon National University)

Publication Information

Journal of the Korean Data and Information Science Society / v.23, no.2, 2012 , pp. 299-307 More about this Journal

Abstract

Data mining is a method of searching for an interesting relationship among items in a given database. The decision tree is a typical algorithm of data mining. The decision tree is the method that classifies or predicts a group as some subgroups. In general, when researchers create a decision tree model, the generated model can be complicated by the standard of model creation and the number of input variables. In particular, if the decision trees have a large number of input variables in a model, the generated models can be complex and difficult to analyze model. When creating the decision tree model, if there are marginally conditional variables (intervening variables, external variables) in the input variables, it is not directly relevant. In this study, we suggest the method of creating a decision tree using marginally conditional variables and apply to actual data to search for efficiency.

Keywords

Data mining; decision tree; external variable; intervening variable; marginally conditional variables;

Citations & Related Records

Times Cited By KSCI : 4 (Citation Analysis)

Reference
Cited By KSCI

1	Breiman, L., Friedman, J. H., Olshen, R. A. and Stone, C. J. (1984). Classification and regression trees, Wadsworth and books, California.
2	Cho, K. H. and Park, H. C. (2011a). A study on insignificant rules discovery in association rule mining. Journal of the Korean Data & Information Science Society, 22, 81-88.
3	Cho, K. H. and Park, H. C. (2011b). A study on decision tree creation using intervening variable. Journal of the Korean Data & Information Science Society, 22, 671-678.
4	Cho, K. H. and Park, H. C. (2011c). A study on removal of unnecessary input variables using multiple external association rule. Journal of the Korean Data & Information Science Society, 22, 877-884.
5	Cho, K. H. and Park, H. C. (2011d). Discovery of insignificant association rules using external variable. Journal of the Korean Data Analysis Society, 13, 1343-1352.
6	Hartigan, J. A. (1975). Clustering algorithms, John Wiley & Sons, New York.
7	Park, H. C. (2010). Association rule ranking function by decreased lift influence. Journal of the Korean Data & Information Science Society, 21, 397-405.
8	Quinlan, J. R. (1993). C4.5 programs for machine learning, Morgan Kaufmann Publishers, San Francisco.

6	Jang Sik Cho. (2013) Journal of the Korean Data and Information Science Society Determinants of student course evaluation using hierarchical linear model / 24 (6) , 1285
3	Hyeonah Park. (2013) Journal of the Korean Data and Information Science Society Usage of auxiliary variable and neural network in doubly robust estimation / 24 (3) , 659
4	Jang Sik Cho. (2014) Journal of the Korean Data and Information Science Society Analysis of employee's characteristic using data visualization / 25 (4) , 727
5	Kwang-Hyun Cho. (2012) Journal of the Korean Data and Information Science Society A study on 3-step complex data mining in society indicator survey / 23 (5) , 983
2	Sungik Park. (2015) Journal of the Korean Data and Information Science Society The study on the determinants of the number of job changes / 26 (2) , 387
1	Jea-Young Lee. (2013) Journal of the Korean Data and Information Science Society Major gene interactions effect identification on the quality of Hanwoo by radial graph / 24 (1) , 151

1	A study on 3-step complex data mining in society indicator survey / [Cho, Kwang-Hyun;Park, Hee-Chang;] / Journal of the Korean Data and Information Science Society
2	Major gene interactions effect identification on the quality of Hanwoo by radial graph / [Lee, Jea-Young;Bae, Jae-Young;Lee, Jin-Mok;Oh, Dong-Yep;Lee, Seong-Won;] / Journal of the Korean Data and Information Science Society
3	Usage of auxiliary variable and neural network in doubly robust estimation / [Park, Hyeonah;Park, Wonjun;] / Journal of the Korean Data and Information Science Society
4	Determinants of student course evaluation using hierarchical linear model / [Cho, Jang Sik;] / Journal of the Korean Data and Information Science Society
5	Analysis of employee's characteristic using data visualization / [Cho, Jang Sik;] / Journal of the Korean Data and Information Science Society

KSCI

A study on decision tree creation using marginally conditional variables 주변조건부 변수를 이용한 의사결정나무모형 생성에 관한 연구

A study on decision tree creation using marginally conditional variables