Browse > Article
http://dx.doi.org/10.5351/CKSS.2003.10.1.239

Input Variable Importance in Supervised Learning Models  

Huh, Myung-Hoe (Dept. of Statistics, Korea University)
Lee, Yong Goo (Dept. of Applied Statistics, Chung-Ang University)
Publication Information
Communications for Statistical Applications and Methods / v.10, no.1, 2003 , pp. 239-246 More about this Journal
Abstract
Statisticians, or data miners, are often requested to assess the importances of input variables in the given supervised learning model. For the purpose, one may rely on separate ad hoc measures depending on modeling types, such as linear regressions, the neural networks or trees. Consequently, the conceptual consistency in input variable importance measures is lacking, so that the measures cannot be directly used in comparing different types of models, which is often done in data mining processes, In this short communication, we propose a unified approach to the importance measurement of input variables. Our method uses sensitivity analysis which begins by perturbing the values of input variables and monitors the output change. Research scope is limited to the models for continuous output, although it is not difficult to extend the method to supervised learning models for categorical outcomes.
Keywords
Supervised Learning; Input Variable Importance; Linear Regression; Neural Network; Regression Tree; Sensitivity Analysis; Data Mining;
Citations & Related Records
Times Cited By KSCI : 9  (Citation Analysis)
연도 인용수 순위
1 /
[ Breiman, L.;Friedman, J.H.;Olshen, R.A.;Stone, C.J. ] / Classification and Regression Trees
2 Clementine's neural networks technical overview /
[ Watkins, D. ] / Unpublished White Paper
3 /
[ Sarle, W.S. ] / How to measure importance of inputs? Unpublished White Paper
4 A study on unbiased methods in constructing classification trees /
[ Lee, Y.M.;Song, M.S. ] / Korean Communications in Statistics   과학기술학회마을   DOI   ScienceOn
5 /
[ Hastie, T.;Tibshirani, R.;Friedman, J. ] / The Elements of Statistical Learning
6 A study on variable selection bias in data mining softwares /
[ Song, M.S.;Yoon, Y.J. ] / Korean Journal of Applied Statistics   과학기술학회마을
7 Model selection for tree-structured regression /
[ Kim, S.H. ] / Journal of Korean Statistical Society   과학기술학회마을
8 /
[ SPSS Inc. ] / Clementine 7.0 User's Guide
9 Bayesian analysis for neural network models /
[ Chung, Y.S.;Jung, J.Y.;Kim, C.S. ] / Korean Communications in Statistics   과학기술학회마을   DOI   ScienceOn
10 A combined multiple regression trees predictor for screening large chemical databases /
[ Lim, Y.B.;Lee, S.Y.;Chung, J.H. ] / Korean Journal of Applied Statistics   과학기술학회마을
11 Interpretation of data mining prediction model using decision tree /
[ Kang, H.C.;Han, S.T.;Choi, J.H. ] / Korean Communications in Statistics   과학기술학회마을
12 Tree-structured classification for high risk dental caries /
[ Lee, T.R.;Moon, H.S. ] / Journal of Data Science and Classification (Korean Classification Society)
13 /
[ Ripley, R.D. ] / Pattern Recognition and Neural Network
14 A comparison on the efficiency of data mining softwares /
[ Han, S.T.;Kang, H.C.;Lee, S.K.;Lee, D.K. ] / Korean Journal of Applied Statistics   DOI   ScienceOn
15 Bootstrap model selection criterion for determining the number of hidden units in neural network model /
[ Hwang, C.H.;Kim, D.H. ] / Korean Communications in Statistics   과학기술학회마을