Application of Random Forest Algorithm for the Decision Support System of Medical Diagnosis with the Selection of Significant Clinical Test

Yun, Tae-Gyun;Yi, Gwan-Su;

The Transactions of The Korean Institute of Electrical Engineers (전기학회논문지)

Volume 57 Issue 6
/
Pages.1058-1062
/
2008
/
1975-8359(pISSN)
/
2287-4364(eISSN)

The Korean Institute of Electrical Engineers (대한전기학회)

Application of Random Forest Algorithm for the Decision Support System of Medical Diagnosis with the Selection of Significant Clinical Test

의료진단 및 중요 검사 항목 결정 지원 시스템을 위한 랜덤 포레스트 알고리즘 적용

윤태균 (한국정보통신대학 공학부) ;
이관수 (한국정보통신대학 공학부)

Published : 2008.06.01

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In clinical decision support system(CDSS), unlike rule-based expert method, appropriate data-driven machine learning method can easily provide the information of individual feature(clinical test) for disease classification. However, currently developed methods focus on the improvement of the classification accuracy for diagnosis. With the analysis of feature importance in classification, one may infer the novel clinical test sets which highly differentiate the specific diseases or disease states. In this background, we introduce a novel CDSS that integrate a classifier and feature selection module together. Random forest algorithm is applied for the classifier and the feature importance measure. The system selects the significant clinical tests discriminating the diseases by examining the classification error during backward elimination of the features. The superior performance of random forest algorithm in clinical classification was assessed against artificial neural network and decision tree algorithm by using breast cancer, diabetes and heart disease data in UCI Machine Learning Repository. The test with the same data sets shows that the proposed system can successfully select the significant clinical test set for each disease.

Keywords

References

DL Hudson, ME Cohen, A neural network learning algorithm, for development of diagnostic decision strategies, IEEE Engineering in Medicine and Biology, 1990; 12:1451-1452
SJ Fakih, TL Das, LEAD: A methodology for learning efficient approaches to medical diagnosis, IEEE Trans. Information Technology in Biomedicine, 2006; 10 (2):220-228 https://doi.org/10.1109/TITB.2005.855538
RO Duda, R.O., PE Hart., Pattern Classification and Scene Analysis, Wiley-Interscience, New York, 1973
DL Hudson, ME Cohen, Neural Networks and Artificial Intelligence in Biomedical Engineering, IEEE Press/Wiley, 1999
Breiman L: Random forests, Machine Learning 2001, 45 pp. 5-32 https://doi.org/10.1023/A:1010933404324
http://www.ics.uci.edu/~mlearn/MLRepository
ME Cohen, DL Hudson, Combining Evidence in Hybrid Medical Decision Support Models, Proceeding of IEEE EMBS, 2007
R.E. Abdel-Aal, Improved classification of medical data using abductive network committees trained on different feature subsets, Computer Methods and Programs in Biomedicine, 2005, 80 pp. 141-153 https://doi.org/10.1016/j.cmpb.2005.08.001
http://www.r-project.org/
W. Duch, R. Adamczak, K. Grabczewski, A new methodology of extraction, optimization and application of crisp and fuzzy logical rules, IEEE Trans. Neural Networks 12, 2001, pp. 277-306 https://doi.org/10.1109/72.914524
F. Zhu, S. Guan, Feature selection for modular GA-based classification, Appl. Soft Comput, 2004, 4 pp. 381-393 https://doi.org/10.1016/j.asoc.2004.02.001

The Transactions of The Korean Institute of Electrical Engineers (전기학회논문지)

Application of Random Forest Algorithm for the Decision Support System of Medical Diagnosis with the Selection of Significant Clinical Test

의료진단 및 중요 검사 항목 결정 지원 시스템을 위한 랜덤 포레스트 알고리즘 적용

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)