• 제목/요약/키워드: Disease Prediction

검색결과 540건 처리시간 0.03초

DNA methylation-based age prediction from various tissues and body fluids

  • Jung, Sang-Eun;Shin, Kyoung-Jin;Lee, Hwan Young
    • BMB Reports
    • /
    • 제50권11호
    • /
    • pp.546-553
    • /
    • 2017
  • Aging is a natural and gradual process in human life. It is influenced by heredity, environment, lifestyle, and disease. DNA methylation varies with age, and the ability to predict the age of donor using DNA from evidence materials at a crime scene is of considerable value in forensic investigations. Recently, many studies have reported age prediction models based on DNA methylation from various tissues and body fluids. Those models seem to be very promising because of their high prediction accuracies. In this review, the changes of age-associated DNA methylation and the age prediction models for various tissues and body fluids were examined, and then the applicability of the DNA methylation-based age prediction method to the forensic investigations was discussed. This will improve the understandings about DNA methylation markers and their potential to be used as biomarkers in the forensic field, as well as the clinical field.

A Study on the Comparison of Predictive Models of Cardiovascular Disease Incidence Based on Machine Learning

  • Ji Woo SEOK;Won ro LEE;Min Soo KANG
    • 한국인공지능학회지
    • /
    • 제11권1호
    • /
    • pp.1-7
    • /
    • 2023
  • In this paper, a study was conducted to compare the prediction model of cardiovascular disease occurrence. It is the No.1 disease that accounts for 1/3 of the world's causes of death, and it is also the No. 2 cause of death in Korea. Primary prevention is the most important factor in preventing cardiovascular diseases before they occur. Early diagnosis and treatment are also more important, as they play a role in reducing mortality and morbidity. The Results of an experiment using Azure ML, Logistic Regression showed 88.6% accuracy, Decision Tree showed 86.4% accuracy, and Support Vector Machine (SVM) showed 83.7% accuracy. In addition to the accuracy of the ROC curve, AUC is 94.5%, 93%, and 92.4%, indicating that the performance of the machine learning algorithm model is suitable, and among them, the results of applying the logistic regression algorithm model are the most accurate. Through this paper, visualization by comparing the algorithms can serve as an objective assistant for diagnosis and guide the direction of diagnosis made by doctors in the actual medical field.

Predicting idiopathic pulmonary fibrosis (IPF) disease in patients using machine approaches

  • Ali, Sikandar;Hussain, Ali;Kim, Hee-Cheol
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2021년도 춘계학술대회
    • /
    • pp.144-146
    • /
    • 2021
  • Idiopathic pulmonary fibrosis (IPF) is one of the most dreadful lung diseases which effects the performance of the lung unpredictably. There is no any authentic natural history discovered yet pertaining to this disease and it has been very difficult for the physicians to diagnosis this disease. With the advent of Artificial intelligent and its related technologies this task has become a little bit easier. The aim of this paper is to develop and to explore the machine learning models for the prediction and diagnosis of this mysterious disease. For our study, we got IPF dataset from Haeundae Paik hospital consisting of 2425 patients. This dataset consists of 502 features. We applied different data preprocessing techniques for data cleaning while making the data fit for the machine learning implementation. After the preprocessing of the data, 18 features were selected for the experiment. In our experiment, we used different machine learning classifiers i.e., Multilayer perceptron (MLP), Support vector machine (SVM), and Random forest (RF). we compared the performance of each classifier. The experimental results showed that MLP outperformed all other compared models with 91.24% accuracy.

  • PDF

Air Pollution Risk Prediction System Utilizing Deep Learning Focused on Cardiovascular Disease

  • Lee, Jisu;Moon, Yoo-Jin
    • 한국컴퓨터정보학회논문지
    • /
    • 제27권12호
    • /
    • pp.267-275
    • /
    • 2022
  • 이 논문은 대기오염의 심장병에 대한 위험도를 예측하기 위하여 Keras를 활용한 Deep Neural Network Model 시스템을 제안하였다. 연구 데이터로 서울열린데이터광장의 서울시 기간별 시간평균 대기환경 데이터 18,000개의 데이터 셋을 분석하여, 심장병 질병에 미치는 영향에 대한 정보를 얻을 수 있었다. 이 모델은 각각 8개의 노드를 가진 3개의 은닉층, Sigmoid, Binary_crossentropy, Adam과 Accuracy를 사용했을 때 88.92%의 높은 정확도를 얻을 수 있었다. 이 시스템은 각 지역별 대기오염에 따른 심장병 질병 위험도를 예측하여 유용한 질병 예방의 지표로 활용 가능하다고 사료되고, 대기오염과 미세먼지의 각 성분이 유해질환에 미치는 영향에 대한 데이터만 존재한다면 어떠한 호흡기 질환이든 위험도 예측 결과를 알 수 있다는 것에 의미가 있다. 이 시스템을 더욱 발전시킨다면, 마스크 및 공기정화제품 생산기업에게 유용한 정보를 제공하여 기업의 기술개발에 도움이 될 수 있다고 사료된다.

Predictors and management of intravenous immunoglobulin-resistant Kawasaki disease

  • Song, Min Seob
    • Clinical and Experimental Pediatrics
    • /
    • 제62권4호
    • /
    • pp.119-123
    • /
    • 2019
  • Kawasaki disease (KD) is a systemic vasculitis that mainly affects younger children. Intravenous immunoglobulin (IVIG) resistant cases are at increasing risk for coronary artery complications. The strategy on prediction of potential nonresponders and treatment of IVIG-resistant patients is now controversial. In this review the definition and predictors of IVIG-resistant KD and current evidence to guide management are discussed.

파킨슨병 원격 진단을 위한 Signomial 회귀 모형 (Remote Health Monitoring of Parkinson's Disease Severity Using Signomial Regression Model)

  • 정영선;이충목;;이경식
    • 산업공학
    • /
    • 제23권4호
    • /
    • pp.365-371
    • /
    • 2010
  • In this study, we propose a novel remote health monitoring system to accurately predict Parkinson's disease severity using a signomial regression method. In order to characterize the Parkinson's disease severity, sixteen biomedical voice measurements associated with symptoms of the Parkinson's disease, are used to develop the telemonitoring model for early detection of the Parkinson's disease. The proposed approach could be utilized for not only prediction purposes, but also interpretation purposes in practice, providing an explicit description of the resulting function in the original input space. Compared to the accuracy performance with the existing methods, the proposed algorithm produces less error rate for predicting Parkinson's disease severity.

Prediction of Quantitative Traits Using Common Genetic Variants: Application to Body Mass Index

  • Bae, Sunghwan;Choi, Sungkyoung;Kim, Sung Min;Park, Taesung
    • Genomics & Informatics
    • /
    • 제14권4호
    • /
    • pp.149-159
    • /
    • 2016
  • With the success of the genome-wide association studies (GWASs), many candidate loci for complex human diseases have been reported in the GWAS catalog. Recently, many disease prediction models based on penalized regression or statistical learning methods were proposed using candidate causal variants from significant single-nucleotide polymorphisms of GWASs. However, there have been only a few systematic studies comparing existing methods. In this study, we first constructed risk prediction models, such as stepwise linear regression (SLR), least absolute shrinkage and selection operator (LASSO), and Elastic-Net (EN), using a GWAS chip and GWAS catalog. We then compared the prediction accuracy by calculating the mean square error (MSE) value on data from the Korea Association Resource (KARE) with body mass index. Our results show that SLR provides a smaller MSE value than the other methods, while the numbers of selected variables in each model were similar.

한국인에서 면역글로불린-저항성 가와사키병 환자의 예측 (Prediction of Intravenous Immunoglobulin Nonresponse Kawasaki Disease in Korea)

  • 최명현;박청수;김동수;김기환
    • Pediatric Infection and Vaccine
    • /
    • 제21권1호
    • /
    • pp.29-36
    • /
    • 2014
  • 목적: 이 연구의 목적은 면역글로불린-저항성 가와사키병의 예측 인자를 찾고 점수화된 예측 모델을 만들고자 하는 것이다. 방법: 2009년 1월부터 2012년 12월까지 세브란스 어린이 병원에서 가와사키병으로 진단된 573명의 환자를 대상으로 하였다. 실험군과 검증군으로 나누었고, 각 군들은 면역글로불린-반응성과 저항성으로 나누었다. 실험군에서 면역글로불린의 예측 인자를 찾았고, 점수화된 예측모델을 만들었다. 그리고 외적, 내적 타당성 검증을 시행하였다. 결과: 남성, 경부림프절종대, 손과 발의 변화, 혈소판, 총빌리루빈, 젖산탈수효소, CRP가 면역글로불린-저항성 가와사키병의 예측 인자로 나타났다. 점수화된 예측 모델을 만들었고, 민감도와 특이도가 실험군에서는 52.5%와 82.4%, 검증군에서는 37.8%와 81.8%로 나타났다. 결론: 우리의 점수화된 예측 모델은 한국 환자에 적용하였을 때 높은 특이도와 낮은 민감도를 갖는다.

Prediction Model of User Physical Activity using Data Characteristics-based Long Short-term Memory Recurrent Neural Networks

  • Kim, Joo-Chang;Chung, Kyungyong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권4호
    • /
    • pp.2060-2077
    • /
    • 2019
  • Recently, mobile healthcare services have attracted significant attention because of the emerging development and supply of diverse wearable devices. Smartwatches and health bands are the most common type of mobile-based wearable devices and their market size is increasing considerably. However, simple value comparisons based on accumulated data have revealed certain problems, such as the standardized nature of health management and the lack of personalized health management service models. The convergence of information technology (IT) and biotechnology (BT) has shifted the medical paradigm from continuous health management and disease prevention to the development of a system that can be used to provide ground-based medical services regardless of the user's location. Moreover, the IT-BT convergence has necessitated the development of lifestyle improvement models and services that utilize big data analysis and machine learning to provide mobile healthcare-based personal health management and disease prevention information. Users' health data, which are specific as they change over time, are collected by different means according to the users' lifestyle and surrounding circumstances. In this paper, we propose a prediction model of user physical activity that uses data characteristics-based long short-term memory (DC-LSTM) recurrent neural networks (RNNs). To provide personalized services, the characteristics and surrounding circumstances of data collectable from mobile host devices were considered in the selection of variables for the model. The data characteristics considered were ease of collection, which represents whether or not variables are collectable, and frequency of occurrence, which represents whether or not changes made to input values constitute significant variables in terms of activity. The variables selected for providing personalized services were activity, weather, temperature, mean daily temperature, humidity, UV, fine dust, asthma and lung disease probability index, skin disease probability index, cadence, travel distance, mean heart rate, and sleep hours. The selected variables were classified according to the data characteristics. To predict activity, an LSTM RNN was built that uses the classified variables as input data and learns the dynamic characteristics of time series data. LSTM RNNs resolve the vanishing gradient problem that occurs in existing RNNs. They are classified into three different types according to data characteristics and constructed through connections among the LSTMs. The constructed neural network learns training data and predicts user activity. To evaluate the proposed model, the root mean square error (RMSE) was used in the performance evaluation of the user physical activity prediction method for which an autoregressive integrated moving average (ARIMA) model, a convolutional neural network (CNN), and an RNN were used. The results show that the proposed DC-LSTM RNN method yields an excellent mean RMSE value of 0.616. The proposed method is used for predicting significant activity considering the surrounding circumstances and user status utilizing the existing standardized activity prediction services. It can also be used to predict user physical activity and provide personalized healthcare based on the data collectable from mobile host devices.