Browse > Article
http://dx.doi.org/10.9708/jksci.2022.27.05.021

Heart Disease Prediction Using Decision Tree With Kaggle Dataset  

Noh, Young-Dan (Dept. of Computer Science, Inha Technical College)
Cho, Kyu-Cheol (Dept. of Computer Science, Inha Technical College)
Abstract
All health problems that occur in the circulatory system are refer to cardiovascular illness, such as heart and vascular diseases. Deaths from cardiovascular disorders are recorded one third of in total deaths in 2019 worldwide, and the number of deaths continues to rise. Therefore, if it is possible to predict diseases that has high mortality rate with patient's data and AI system, they would enable them to be detected and be treated in advance. In this study, models are produced to predict heart disease, which is one of the cardiovascular diseases, and compare the performance of models with Accuracy, Precision, and Recall, with description of the way of improving the performance of the Decision Tree(Decision Tree, KNN (K-Nearest Neighbor), SVM (Support Vector Machine), and DNN (Deep Neural Network) are used in this study.). Experiments were conducted using scikit-learn, Keras, and TensorFlow libraries using Python as Jupyter Notebook in macOS Big Sur. As a result of comparing the performance of the models, the Decision Tree demonstrates the highest performance, thus, it is recommended to use the Decision Tree in this study.
Keywords
heart disease; diseases that has high morality rate; AI; detected; Decision Tree;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Taewoo Lee, Jungwoo Kim, Deokyoo Kim, Jaeho Lee, "Performance Evaluations and Feature Vector by Implementing Hybrid Filter in BLE-Based Fingerprinting", The Journal of Korean Institute of Communications and Information Sciences, Vol. 44, No. 8, pp.1556-1565, Aug 2019. DOI: 10.7840/kics.2019.44.8.1556   DOI
2 Won-Bin Oh, Ill-Soo Kim, Tae-Jong Yun, Bo-Ram Lee, Chung-Woo Lee, Ki-Young Park, Byeong-Ju Jin and Yu-Cheol Lee, "A Study on the Prediction of Real-Time Bead Width Using a DNN Algorithm in GTA Welding", Journal of Welding and Joining, Vol.38, No.6, pp.593-601, Dec 2020. DOI: 10.5781/JWJ.2020.38.6.10   DOI
3 The importance of cardiovascular disease checkup, http://iheartwell.com/htm/community_info_read.php?id=1746&mode=&cate=&page=7&key=&keyword=
4 Altman, N. S., "An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression", The American Statistician, Vol. 46, No. 3, pp.175-185, 1992, DOI: 10.1080/00031305.1992.10475879   DOI
5 Patient data from Kaggle, https://www.kaggle.com/ronitf/heartdisease-uci
6 Gregory A. Roth et al., "Global Burden of Cardiovascular Diseases and Risk Factors, 1990-2019: Update From the GBD 2019 Study", JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, Vol. 76, No. 25, pp.2982-3021, Dec 2020. DOI : 10.1016/j.jacc.2020.11.010   DOI
7 Yongjung Son, Hyunduk Kim, "Forecasting Export & Import Container Cargoes using a Decision Tree Analysis", Journal of Korea Port Economic Association, Vol. 28, No. 4, pp.193-207, Dec 2012.
8 Youtaek Jeon, HyungJun Cho, "Model based hybrid decision tree", Journal of the Korean Data And Information Science Society, Vol. 30, No.3, pp.515-524, May 2019. DOI: 10.7465/jkdi.2019.30.3.515   DOI
9 Kyong-Rok Lee, "A Study on SVM-Based Speaker Classification Using GMM-supervector", Journal of IKEEE, Vol. 24, No.4, pp.102-107, Dec 2002. DOI: 10.7471/ikeee.2020.24.4.1022   DOI
10 Yang Seok Kim, Do Hwan Lee, Seong Kook Kim, "Fault Classification for Rotating Machinery Using Support Vector Machines with Optimal Features Corresponding to Each Fault Type", The Korean Society of Mechanical Engineers, Vol.34, No.11, pp.1681-1689, Nov 2010, DOI:10.3795/KSME-A.2010.34.11.1681   DOI
11 N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, R. Salakhutdinov, "Dropout: A simple way to prevent neural networks from overfitting", The Journal of Machine Learning Research, Vol. 15, No. 1, pp. 1929-1958, Jan 2014.
12 Yunseok Rhee, "Malicious Code Detection Method Using LSTM Learning on the File Access Behavior", The Journal of Korean Institute of Information Technology, pp.25-32, Vol. 18, No. 2, Feb 2020. DOI : 10.14801/jkiit.2020.18.2.25   DOI
13 Jeong-Il Go, Eui-Young Lee, Min-Jae Lee, Seong-Dae Choi, Jang-Wook Hur, "Corrosion Failure Diagnosis of Rolling Bearing with SVM", Journal of the Korean Society of Manufacturing Process Engineers, Vol. 20, No. 9, pp.35-41, Sep 2021. DOI: 10.14775/ksmpe.2021.20.09.035   DOI
14 Gyeonggon Kim, Chansoo Park, Wooyeong Kim, Jeeyeon Jeon, Miyeon Jeon, Choongsik Bae, "The Effect of Natural Gas Substitution Ratio and Diesel Injection Timing on Accuracy of In-cylinder Pressure Prediction DNN Model from Vibration Signal in a CNG-Diesel Dual-Fuel Engine", Transaction of the Korean Society of Automotive Engineers, Vol. 29, No. 10, pp.909-919, Oct 2021. DOI: 10.7467/KSAE.2021.29.10.909   DOI