DOI QR코드

DOI QR Code

An Outlier Detection Algorithm and Data Integration Technique for Prediction of Hypertension

고혈압 예측을 위한 이상치 탐지 알고리즘 및 데이터 통합 기법

  • Khongorzul Dashdondov (Department of Computer Engineering, College of IT convergence, Gachon University) ;
  • Mi-Hye Kim (Department of Computer Engineering, Chungbuk National University) ;
  • Mi-Hwa Song (School of Smart IT, Semyung University)
  • Published : 2023.05.18

Abstract

Hypertension is one of the leading causes of mortality worldwide. In recent years, the incidence of hypertension has increased dramatically, not only among the elderly but also among young people. In this regard, the use of machine-learning methods to diagnose the causes of hypertension has increased in recent years. In this study, we improved the prediction of hypertension detection using Mahalanobis distance-based multivariate outlier removal using the KNHANES database from the Korean national health data and the COVID-19 dataset from Kaggle. This study was divided into two modules. Initially, the data preprocessing step used merged datasets and decision-tree classifier-based feature selection. The next module applies a predictive analysis step to remove multivariate outliers using the Mahalanobis distance from the experimental dataset and makes a prediction of hypertension. In this study, we compared the accuracy of each classification model. The best results showed that the proposed MAH_RF algorithm had an accuracy of 82.66%. The proposed method can be used not only for hypertension but also for the detection of various diseases such as stroke and cardiovascular disease.

Keywords