DOI QR코드

DOI QR Code

A Study on the Comparison of Predictive Models of Cardiovascular Disease Incidence Based on Machine Learning

  • Received : 2023.01.20
  • Accepted : 2023.03.04
  • Published : 2023.03.30

Abstract

In this paper, a study was conducted to compare the prediction model of cardiovascular disease occurrence. It is the No.1 disease that accounts for 1/3 of the world's causes of death, and it is also the No. 2 cause of death in Korea. Primary prevention is the most important factor in preventing cardiovascular diseases before they occur. Early diagnosis and treatment are also more important, as they play a role in reducing mortality and morbidity. The Results of an experiment using Azure ML, Logistic Regression showed 88.6% accuracy, Decision Tree showed 86.4% accuracy, and Support Vector Machine (SVM) showed 83.7% accuracy. In addition to the accuracy of the ROC curve, AUC is 94.5%, 93%, and 92.4%, indicating that the performance of the machine learning algorithm model is suitable, and among them, the results of applying the logistic regression algorithm model are the most accurate. Through this paper, visualization by comparing the algorithms can serve as an objective assistant for diagnosis and guide the direction of diagnosis made by doctors in the actual medical field.

Keywords

Acknowledgement

This work was supported by the research grant of the KODISA Scholarship Foundation in 2023.

References

  1. Choi. J.S. (2000). Modern statistical analysis using SPSS Ver 10. Bogdoo Publish. pp.514 (in Korean).
  2. Gi-Hun J. (2020). Cardiovascular Disease Prediction using the NHIS Big Data and Machine Learning. Department of Computer Science. Graduate School. Kangwon National University. Korea.
  3. Kaggle. Heart Failure Prediction Dataset. (2021). From https://www.kaggle.com/datasets/fedesoriano/heart-failure-prediction
  4. Lee D. C. Sui X. Church T. S. Lavie C. J. Jackson A. S. & Blair S. N. (2012). Changes in fitness and fatness on the development of cardiovascular disease risk factors: hypertension, metabolic syndrome, and hypercholesterolemia. Journal of the American College of Cardiology. 59(7). pp.665-672. https://doi.org/10.1016/j.jacc.2011.11.013
  5. Li X. and D. Lord and Y. Zhang and Y. Xie. (2008). "Predicting motor vehicle crashes using Support Vector Machine models". Accident Analysis & Prevention. 40(4). pp.1611-1618. https://doi.org/10.1016/j.aap.2008.04.010
  6. Microsoft Azure Machine Learning. (2022). From https://docs.microsoft.com/ko-kr/azure/machine-learning/studio/what-is-ml-studio
  7. Min Soo K. Eun Soo C. (2018). Getting started Machine Learning with MicroSoft AZURE ML. Hanti Media. pp.63-71
  8. Microsoft Azure Machine Learning. (2022). From https://learn.microsoft.com/ko-kr/azure/architecture/data-science-process/prepare-data
  9. National Statistical Office. (2022). 2021 Causes of Death Statistics Results.
  10. Pyoung-Woo P. Min-Koo K. Hong Seok L. Duk-Yong Y. SeokWon L. (2018). A Comparative Study of Machine Learning Algorithms for Diagnosis of Ischemic Heart Disease. Journal of KIISE, 45(4). pp. 376-389. https://doi.org/10.5626/JOK.2018.45.4.376
  11. Pyoung-Woo P. Seok-Won L. (2017). "Classification of Heart Disease Using K-Nearest Neighbor Imputation". Journal of the Korean Society for Information Processing. 24(2). pp.742-745.
  12. Park S. H. and B. Hansen. (2012). "Prediction of Protein-Protein Interaction Sites Based on 3D Surface Patches Using SVM". Korea Information Processing Society. 19(1). pp.21-28. https://doi.org/10.9708/jksci.2014.19.9.021
  13. Sikandar A. (2021). Deep neural network-based clinical decision support and diagnosis system for cardiovascular disease in patients with acute myocardial infarction. Graduate School of Chungbuk National University Cheongju. Korea
  14. Seungseok, W. (2022). Relationship between Cardiovascular Disease Risk Factors & Rate and Physical Fitness. The 6th to 8th Korea National Health and Nutrition Examination Survey (2014-2019) subjects.
  15. Tae-Won K. Yu-Hoe K. (2007). Comparative Analysis Comparative Analysis Analysis of Prediction Prediction Prediction Taekwondo Taekwondo Taekwondo Trainee's Trainee's Defection using Defection using Decision Decision Decision Tree and Logistic Logistic Logistic Regression. The Korea Journal of Sports Science. 17(2). pp.71-83.
  16. Rapportian. (2022). The prevalence of cardiovascular disease increases after COVID-19 infection. From https://www.rapportian.com/.
  17. Wikipedia. (2022). From https://en.wikipedia.org/wiki/Cardiovascular_disease
  18. WHO. (2019). From https://www.who.int/news-room/factsheets/detail/cardiovascular-diseases-(cvds)
  19. Yu Jun, J. (2018). Development of mortality prediction model during admission and 1-year after discharge in acutemyocardial infarction patients using machine learning techniques. Graduate School of Chungbuk National University Cheongju. Korea.
  20. Yun Seon R. Heung Sik C. Sun Woong K. (2016). Prediction of VKOSPI Changes and Application to Actual Option Trading Using SVM. Korea Intelligent Information System Society. 22(4). pp.177-192. https://doi.org/10.13088/jiis.2016.22.4.177