Browse > Article

VRIFA: A Prediction and Nonlinear SVM Visualization Tool using LRBF kernel and Nomogram  

Kim, Sung-Chul (POSTECH 컴퓨터공학과)
Yu, Hwan-Jo (POSTECH 컴퓨터공학과)
Publication Information
Abstract
Prediction problems are widely used in medical domains. For example, computer aided diagnosis or prognosis is a key component in a CDSS (Clinical Decision Support System). SVMs with nonlinear kernels like RBF kernels, have shown superior accuracy in prediction problems. However, they are not preferred by physicians for medical prediction problems because nonlinear SVMs are difficult to visualize, thus it is hard to provide intuitive interpretation of prediction results to physicians. Nomogram was proposed to visualize SVM classification models. However, it cannot visualize nonlinear SVM models. Localized Radial Basis Function (LRBF) was proposed which shows comparable accuracy as the RBF kernel while the LRBF kernel is easier to interpret since it can be linearly decomposed. This paper presents a new tool named VRIFA, which integrates the nomogram and LRBF kernel to provide users with an interactive visualization of nonlinear SVM models, VRIFA visualizes the internal structure of nonlinear SVM models showing the effect of each feature, the magnitude of the effect, and the change at the prediction output. VRIFA also performs nomogram-based feature selection while training a model in order to remove noise or redundant features and improve the prediction accuracy. The area under the ROC curve (AUC) can be used to evaluate the prediction result when the data set is highly imbalanced. The tool can be used by biomedical researchers for computer-aided diagnosis and risk factor analysis for diseases.
Keywords
Decision support systems; feature selection; localized radial basis function(LRBF) kernel; nomograms; support vector machines (SVMs); visualization;
Citations & Related Records
연도 인용수 순위
  • Reference
1 G. Cohen, M. Hilario, H. Sax, S. Hugonnet, and A. Geissbuhler, "Learning from imbalanced data in surveillance of nosocomial infection," Artificial Intelligence in Medicine, Vol.37, pp. 7-18, 2006.   DOI
2 T. S. Furey, N. Cristianini, N. Duffy, D. W. Bednarski, M. Schummer, and D. Haussler, "Support vector machine classification and validation of cancer tissue samples using microarray expression data," Bioirformatics, Vol.16, pp. 906-914, 2000.   DOI   ScienceOn
3 C. Cortes and V. Vapnik, "Support vector networks," Machine Learning, Vol.20, pp. 273-297, 1995.
4 V. Vapnik, The Nature of Statistical Learning Theory, New York: Springer-Verlag, 1995.
5 J. H. OH, J. Gao, A. nandi. P. Gurnani, L. Knowles, J. Schorge, and K. P. Rosenblatt. "Multicategory classification using extended SVM-RFE and markov blanket on SELDI-TOF mass spectrometry data," the Institute of Electrical and Electronics Engineers Symposium. Computational Intelligence in Bioinformatics and Computational Biology, 2005.
6 M. E. Mavroforakix, H. V. Georgiou, N. Dimitropuoulox, D. Cavoura, and S. Theodoridis, "Mammographic masses characterization based on localized texture and dataset fractal analysis using linear, neural and support vector machine classifiers," Artificial Intelligence in Medicine, Vol.37, pp. 145-162, 2006.   DOI
7 L. Ramirez, N. G. Durdle, V. J. Raso, and D. L. Hill, "A support vector machines classifier to assess the severity of idiopathic scoliosis from surface topology," the Institute of Electrical and Electronics Engineers Transactions on Information Technology in Biomedicine, Vol.10, No.1, pp. 84-91, 2006.
8 K. Takeuchi and N. Collier, "Bio-medical entity extraction using support vector machines," Artificial Intelligence in Medicine, Vol.33, pp. 125-137, 2005.   DOI
9 T.Arodz, M. Kurdziel, E. O. D. Sevre, and D. A. Yuen, "Pattern recognition techniques for automatic detection of suspicious-looking anomalies in mammograms," Computer Methods and Programs in Biomedicine, Vol.79, pp. 135-149, 2005.   DOI   ScienceOn
10 B. Cho, H. Yu, J. Lee, Y. Chee, and I. Kim, "Nonlinear support vector machine visualization for risk factor analysis using nomograms and localized radial basis function kernels," the Institute of Electrical and Electronics Engineers Transactions on Information Technology in Biomedicine, 2005.
11 A. Jakulin, M. Mozina, J. Demsar, I. Bratko, and B. Zupan, "Nomograms for visualizing support vector machines," Knowledge Discouery and Data Mining, 2005.
12 C. J. C. Burges, "A tutorial on support vector machines for pattern recognition," Data Mining Knowledge Discovery, Vol.2, pp. 121-167, 1998.   DOI   ScienceOn
13 C.-C. Chang and C.-J. Lin, "LIBSVM: A library for support vector machines [Online]," Available: http://www.csie.ntu.edu.tw/-cjlin/libsvm, 2001.
14 I. Guyon, J. Weston, S. Barnhill, and V. Vapnik, "Gene selection for cancer classification using support vector machines," Machine Learning, Vol.46, pp. 389-422, 2002.   DOI
15 I. Guyon and A. Elisseeff, "An introduction to variable and feature selection," the Journal of Machine Learning Research, Vol.3, pp. 1157-1182, 2003.