• Title/Summary/Keyword: Disease prediction

Search Result 541, Processing Time 0.027 seconds

DNA methylation-based age prediction from various tissues and body fluids

  • Jung, Sang-Eun;Shin, Kyoung-Jin;Lee, Hwan Young
    • BMB Reports
    • /
    • v.50 no.11
    • /
    • pp.546-553
    • /
    • 2017
  • Aging is a natural and gradual process in human life. It is influenced by heredity, environment, lifestyle, and disease. DNA methylation varies with age, and the ability to predict the age of donor using DNA from evidence materials at a crime scene is of considerable value in forensic investigations. Recently, many studies have reported age prediction models based on DNA methylation from various tissues and body fluids. Those models seem to be very promising because of their high prediction accuracies. In this review, the changes of age-associated DNA methylation and the age prediction models for various tissues and body fluids were examined, and then the applicability of the DNA methylation-based age prediction method to the forensic investigations was discussed. This will improve the understandings about DNA methylation markers and their potential to be used as biomarkers in the forensic field, as well as the clinical field.

A Study on the Comparison of Predictive Models of Cardiovascular Disease Incidence Based on Machine Learning

  • Ji Woo SEOK;Won ro LEE;Min Soo KANG
    • Korean Journal of Artificial Intelligence
    • /
    • v.11 no.1
    • /
    • pp.1-7
    • /
    • 2023
  • In this paper, a study was conducted to compare the prediction model of cardiovascular disease occurrence. It is the No.1 disease that accounts for 1/3 of the world's causes of death, and it is also the No. 2 cause of death in Korea. Primary prevention is the most important factor in preventing cardiovascular diseases before they occur. Early diagnosis and treatment are also more important, as they play a role in reducing mortality and morbidity. The Results of an experiment using Azure ML, Logistic Regression showed 88.6% accuracy, Decision Tree showed 86.4% accuracy, and Support Vector Machine (SVM) showed 83.7% accuracy. In addition to the accuracy of the ROC curve, AUC is 94.5%, 93%, and 92.4%, indicating that the performance of the machine learning algorithm model is suitable, and among them, the results of applying the logistic regression algorithm model are the most accurate. Through this paper, visualization by comparing the algorithms can serve as an objective assistant for diagnosis and guide the direction of diagnosis made by doctors in the actual medical field.

Predicting idiopathic pulmonary fibrosis (IPF) disease in patients using machine approaches

  • Ali, Sikandar;Hussain, Ali;Kim, Hee-Cheol
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.144-146
    • /
    • 2021
  • Idiopathic pulmonary fibrosis (IPF) is one of the most dreadful lung diseases which effects the performance of the lung unpredictably. There is no any authentic natural history discovered yet pertaining to this disease and it has been very difficult for the physicians to diagnosis this disease. With the advent of Artificial intelligent and its related technologies this task has become a little bit easier. The aim of this paper is to develop and to explore the machine learning models for the prediction and diagnosis of this mysterious disease. For our study, we got IPF dataset from Haeundae Paik hospital consisting of 2425 patients. This dataset consists of 502 features. We applied different data preprocessing techniques for data cleaning while making the data fit for the machine learning implementation. After the preprocessing of the data, 18 features were selected for the experiment. In our experiment, we used different machine learning classifiers i.e., Multilayer perceptron (MLP), Support vector machine (SVM), and Random forest (RF). we compared the performance of each classifier. The experimental results showed that MLP outperformed all other compared models with 91.24% accuracy.

  • PDF

Air Pollution Risk Prediction System Utilizing Deep Learning Focused on Cardiovascular Disease

  • Lee, Jisu;Moon, Yoo-Jin
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.12
    • /
    • pp.267-275
    • /
    • 2022
  • This paper proposed a Deep Neural Network Model system utilizing Keras for predicting air pollution risk of the cardiovascular disease through the effect of each component of air on the harmful virus using past air information, with analyzing 18,000 data sets of the Seoul Open Data Plaza. By experiments, the model performed tasks with higher accuracy when using methods of sigmoid, binary_crossentropy, adam, and accuracy through 3 hidden layers with each 8 nodes, resulting in 88.92% accuracy. It is meaningful in that any respiratory disease can utilize the risk prediction system if there are data on the effects of each component of air pollution and fine dust on oil-borne diseases. It can be further developed to provide useful information to companies that produce masks and air purification products.

Predictors and management of intravenous immunoglobulin-resistant Kawasaki disease

  • Song, Min Seob
    • Clinical and Experimental Pediatrics
    • /
    • v.62 no.4
    • /
    • pp.119-123
    • /
    • 2019
  • Kawasaki disease (KD) is a systemic vasculitis that mainly affects younger children. Intravenous immunoglobulin (IVIG) resistant cases are at increasing risk for coronary artery complications. The strategy on prediction of potential nonresponders and treatment of IVIG-resistant patients is now controversial. In this review the definition and predictors of IVIG-resistant KD and current evidence to guide management are discussed.

Remote Health Monitoring of Parkinson's Disease Severity Using Signomial Regression Model (파킨슨병 원격 진단을 위한 Signomial 회귀 모형)

  • Jeong, Young-Seon;Lee, Chung-Mok;Kim, Nor-Man;Lee, Kyung-Sik
    • IE interfaces
    • /
    • v.23 no.4
    • /
    • pp.365-371
    • /
    • 2010
  • In this study, we propose a novel remote health monitoring system to accurately predict Parkinson's disease severity using a signomial regression method. In order to characterize the Parkinson's disease severity, sixteen biomedical voice measurements associated with symptoms of the Parkinson's disease, are used to develop the telemonitoring model for early detection of the Parkinson's disease. The proposed approach could be utilized for not only prediction purposes, but also interpretation purposes in practice, providing an explicit description of the resulting function in the original input space. Compared to the accuracy performance with the existing methods, the proposed algorithm produces less error rate for predicting Parkinson's disease severity.

Prediction of Quantitative Traits Using Common Genetic Variants: Application to Body Mass Index

  • Bae, Sunghwan;Choi, Sungkyoung;Kim, Sung Min;Park, Taesung
    • Genomics & Informatics
    • /
    • v.14 no.4
    • /
    • pp.149-159
    • /
    • 2016
  • With the success of the genome-wide association studies (GWASs), many candidate loci for complex human diseases have been reported in the GWAS catalog. Recently, many disease prediction models based on penalized regression or statistical learning methods were proposed using candidate causal variants from significant single-nucleotide polymorphisms of GWASs. However, there have been only a few systematic studies comparing existing methods. In this study, we first constructed risk prediction models, such as stepwise linear regression (SLR), least absolute shrinkage and selection operator (LASSO), and Elastic-Net (EN), using a GWAS chip and GWAS catalog. We then compared the prediction accuracy by calculating the mean square error (MSE) value on data from the Korea Association Resource (KARE) with body mass index. Our results show that SLR provides a smaller MSE value than the other methods, while the numbers of selected variables in each model were similar.

Prediction of Intravenous Immunoglobulin Nonresponse Kawasaki Disease in Korea (한국인에서 면역글로불린-저항성 가와사키병 환자의 예측)

  • Choi, Myung Hyun;Park, Chung Soo;Kim, Dong Soo;Kim, Ki Hwan
    • Pediatric Infection and Vaccine
    • /
    • v.21 no.1
    • /
    • pp.29-36
    • /
    • 2014
  • Purpose: The objective of this study was to find the predictors and generate a prediction scoring model of nonresponse to intravenous immunoglobulin in patients with Kawasaki disease. Methods: We examined 573 children diagnosed with KD at the Severance Children's Hospital between January 2009 and december 2012. We retrospectively reviewed their medical records. These patients were divided into 2 groups; the experimental group (N=433) and the validation group (N=140). Each group were divided into 2 groups the intravenous immunoglobulin nonresponders and the responders. Multivariate logistic regression analysis identified predictive factors of intravenous immunoglobulin nonresponders which make predictive scoring model. We practice internal validation and external validation. Results: Multivariate logistic regression analysis identified male, cervical lymphadenopathy, changes of the extremities, platelet, total bilirubin, alkaline phophatase, lactate dehydrogenase, C-reactive protein as significant predictors for nonresponse to intravenous immunoglobulin. We generated prediction score assigning 1 point for (1) male, (2) cervical lymphadenopathy, (3) changes of the extremities, (4) platelet (${\leq}368,000/mm^3$), (5) total bilirubin (${\geq}0.4mg/dL$), (6) alkaline phophatase (${\geq}227IU/L$), (7) lactate dehydrogenase (${\geq}268IU/L$), (8) C-reactive protein (>77.1 mg/dL). Using a cut-off point of 4 and more with this prediction score, we could identify the intravenous immunoglobulin nonresponder group. Sensitivity and specificity were 52.5% and 82.4% in experimental group and 37.8% and 81.8% in validation group, respectively. Conclusion: Our predictive scoring models had high specificity and low sensitivity in Korean patients. Therefore it is useful in predicting nonresponse to intravenous immunoglobulin with Kawasaki disease.

Prediction Model of User Physical Activity using Data Characteristics-based Long Short-term Memory Recurrent Neural Networks

  • Kim, Joo-Chang;Chung, Kyungyong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.4
    • /
    • pp.2060-2077
    • /
    • 2019
  • Recently, mobile healthcare services have attracted significant attention because of the emerging development and supply of diverse wearable devices. Smartwatches and health bands are the most common type of mobile-based wearable devices and their market size is increasing considerably. However, simple value comparisons based on accumulated data have revealed certain problems, such as the standardized nature of health management and the lack of personalized health management service models. The convergence of information technology (IT) and biotechnology (BT) has shifted the medical paradigm from continuous health management and disease prevention to the development of a system that can be used to provide ground-based medical services regardless of the user's location. Moreover, the IT-BT convergence has necessitated the development of lifestyle improvement models and services that utilize big data analysis and machine learning to provide mobile healthcare-based personal health management and disease prevention information. Users' health data, which are specific as they change over time, are collected by different means according to the users' lifestyle and surrounding circumstances. In this paper, we propose a prediction model of user physical activity that uses data characteristics-based long short-term memory (DC-LSTM) recurrent neural networks (RNNs). To provide personalized services, the characteristics and surrounding circumstances of data collectable from mobile host devices were considered in the selection of variables for the model. The data characteristics considered were ease of collection, which represents whether or not variables are collectable, and frequency of occurrence, which represents whether or not changes made to input values constitute significant variables in terms of activity. The variables selected for providing personalized services were activity, weather, temperature, mean daily temperature, humidity, UV, fine dust, asthma and lung disease probability index, skin disease probability index, cadence, travel distance, mean heart rate, and sleep hours. The selected variables were classified according to the data characteristics. To predict activity, an LSTM RNN was built that uses the classified variables as input data and learns the dynamic characteristics of time series data. LSTM RNNs resolve the vanishing gradient problem that occurs in existing RNNs. They are classified into three different types according to data characteristics and constructed through connections among the LSTMs. The constructed neural network learns training data and predicts user activity. To evaluate the proposed model, the root mean square error (RMSE) was used in the performance evaluation of the user physical activity prediction method for which an autoregressive integrated moving average (ARIMA) model, a convolutional neural network (CNN), and an RNN were used. The results show that the proposed DC-LSTM RNN method yields an excellent mean RMSE value of 0.616. The proposed method is used for predicting significant activity considering the surrounding circumstances and user status utilizing the existing standardized activity prediction services. It can also be used to predict user physical activity and provide personalized healthcare based on the data collectable from mobile host devices.