DOI QR코드

DOI QR Code

Feature Selection and Hyper-Parameter Tuning for Optimizing Decision Tree Algorithm on Heart Disease Classification

  • Tsehay Admassu Assegie (Aksum University, Department of Computer Science) ;
  • Sushma S.J (Department of Electronics and Communication Engineering, GSSS Institute of Engineering and Technology for Women) ;
  • Bhavya B.G (Computer Science and Engineering, Vidyavardhaka College of Engineering) ;
  • Padmashree S (Department of Electronics and Communication Engineering, GSSS Institute of Engineering and Technology for Women)
  • Received : 2024.02.05
  • Published : 2024.02.29

Abstract

In recent years, there are extensive researches on the applications of machine learning to the automation and decision support for medical experts during disease detection. However, the performance of machine learning still needs improvement so that machine learning model produces result that is more accurate and reliable for disease detection. Selecting the hyper-parameter that could produce the possible maximum classification accuracy on medical dataset is the most challenging task in developing decision support systems with machine learning algorithms for medical dataset classification. Moreover, selecting the features that best characterizes a disease is another challenge in developing machine-learning model with better classification accuracy. In this study, we have proposed an optimized decision tree model for heart disease classification by using heart disease dataset collected from kaggle data repository. The proposed model is evaluated and experimental test reveals that the performance of decision tree improves when an optimal number of features are used for training. Overall, the accuracy of the proposed decision tree model is 98.2% for heart disease classification.

Keywords

References

  1. Wan Hajarul Asikin Wan Zunaidi, RD Rohmat Saedudin, Zuraini Ali Shah, Shahreen Kasim, Choon Sen Seah, Maman Abdurohman, Performances Analysis of Heart Disease Dataset using Different Data Mining Classifications, International Journal on Advanced Science Engineering and Information Technology, 2018.
  2. Assegie Tsehay Admasssu, A support vector machine based heart disease prediction, Journal of Software Engineering & Intelligent Systems, 2019.
  3. Senthilkumar Mohan, Chandrasegar Thirumalai, Gautam Srivastava, Effective Heart Disease Prediction Using Hybrid Machine Learning Techniques, IEEE, 2019.
  4. Moloud Abdar, Elham Nasarian, Vivi Nur Wijayaningrum, Xujuan Zhou, Performance Improvement of Decision Trees for Diagnosis of Coronary Artery Disease Using Multi Filtering Approach, IEEE,2019.
  5. Divya Krishnani, Anjali Kumari, Akash Dewangan, Aditya Singh, Nenavath Srinivas Naik, Prediction of Coronary Heart Disease using Supervised Machine Learning Algorithms, IEEE, 2019.
  6. Rahul Katarya, Polipireddy Srinivas, Predicting Heart Disease at Early Stages using Machine Learning: A Survey, IEEE, 2020.
  7. Isra'a Ahmed Zriqat, Ahmad Mousa Altamimi, Mohammad Azze, A Comparative Study for Predicting Heart Diseases Using Data Mining Classification Methods, International Journal of Computer Science and Information Security (IJCSIS), Vol. 14, No. 12, December 2016.
  8. Noor Basha, Gopal Krishna C, Ashok Kumar P S, Venkatesh P, Early Detection of Heart Syndrome Using Machine Learning Technique, International Conference on Electrical, Electronics, Communication, Computer Technologies and Optimization Techniques, IEEE, 2019.
  9. Nikhil Gawande, Alka Barhatte, Heart Diseases Classification using Convolutional Neural Network, Proceedings of the 2nd International Conference on Communication and Electronics Systems, IEEE, 2017.
  10. Kathleen H. Miaoa, Julia H. Miao, Coronary Heart Disease Diagnosis using Deep Neural Networks, International Journal of Advanced Computer Science and Applications, Vol. 9, No. 10, 2018.
  11. Divyansh Khanna, Rohan Sahu, Veeky Baths, and Bharat Deshpande, Comparative Study of Classification Techniques (SVM, Logistic Regression and Neural Networks) to Predict the Prevalence of Heart Disease, International Journal of Machine Learning and Computing, Vol. 5, No. 5, October 2015.
  12. Amin Ul Haq, Jian Ping Li, Muhammad Hammad Memon, Shah Nazir, Ruinan Sun, A Hybrid Intelligent System Framework for the Prediction of Heart Disease Using Machine Learning Algorithms, Hindawi Mobile Information Systems Volume 2018.
  13. Aufzalina Mohd Yusof, Nor Azura Md. Ghani, Khairul Asri Mohd Ghani, Khairul Izan Mohd Ghani, A predictive model for prediction of heart surgery procedure, Indonesian Journal of Electrical Engineering and Computer Science Vol. 15, No. 3, September 2019.
  14. R. Chitra and Dr.V. Seenivasagam, Heart Disease Prediction System Using Supervised Learning Classifier, Bonfring International Journal of Software Engineering and Soft Computing, Vol. 3, No. 1, March 2013.
  15. V. Krishnaiah, G. Narsimha, N. Subhash Chandra, Heart Disease Prediction System using Data Mining Techniques and Intelligent Fuzzy Approach: A Review, International Journal of Computer Applications (0975 - 8887) Volume 136 - No.2, February 2016.
  16. R. Subha, K. Anandakumar, A. Bharathi, Study on Cardiovascular Disease Classification Using Machine Learning Approaches, International Journal of Applied Engineering Research ISSN 0973-4562 Volume 11, Number 6, 2016.
  17. Assegie, T.A, An optimized K-Nearest Neighbor based breast cancer detection, Journal of Robotics and Control (JRC) Volume 2, Issue 3, May 2020.
  18. Assegie, T.A, Sushma S.J, A Support Vector Machine and Decision Tree Based Breast Cancer Prediction, International Journal of Engineering and Advanced Technology (IJEAT) ISSN: 2249 - 8958, Volume-9 Issue-3, February, 2020.
  19. Assegie, T.A, Nair, P.S, The Performance of Different Machine Learning Models On Diabetes Prediction, International Journal Of Scientific & Technology Research Volume 9, Issue 01, January 2020.