• 제목/요약/키워드: health performance tree

검색결과 62건 처리시간 0.026초

고혈압 발생 예측 모형 개발 (Development of Hypertension Predictive Model)

  • 용왕식;박일수;강성홍;김원중;김공현;김광기;박노례
    • 보건교육건강증진학회지
    • /
    • 제23권4호
    • /
    • pp.13-28
    • /
    • 2006
  • Objectives: This study used the characteristics of the knowledge discovery and data mining algorithms to develop hypertension predictive model for hypertension management using the Korea National Health Insurance Corporation database(the insureds' screening and health care benefit data). Methods: This study validated the predictive power of data mining algorithms by comparing the performance of logistic regression, decision tree, and ensemble technique. On the basis of internal and external validation, it was found that the model performance of logistic regression method was the best among the above three techniques. Results: Major results of logistic regression analysis suggested that the probability of hypertension was: - lower for the female(compared with the male)(OR=0.834) - higher for the persons whose ages were 60 or above(compared with below 40)(OR=4.628) - higher for obese persons(compared with normal persons)(OR= 2.103) - higher for the persons with high level of glucose(compared with normal persons)(OR=1.086) - higher for the persons who had family history of hypertension(compared with the persons who had not)(OR=1.512) - higher for the persons who periodically drank alcohol(compared with the persons who did not)$(OR=1.037{\sim}1.291)$ Conclusions: This study produced several factors affecting the outbreak of hypertension using screening. It is considered to be a contributing factor towards the nation's building of a Hypertension Management System in the near future by bringing forth representative results on the rise and care of hypertension.

지역사회 주민의 심폐소생술 수행 자신감 예측요인 (Predicting Factors on Performance Confidence of Cardiopulmonary Resuscitation in Community Members)

  • 이수진
    • 디지털콘텐츠학회 논문지
    • /
    • 제19권9호
    • /
    • pp.1699-1705
    • /
    • 2018
  • 본 연구는 우리나라 지역주민의 심폐소생술 수행능력 자신감의 특성을 파악하기 위하여 지역사회건강조사 자료를 이차분석한 서술적 조사연구 이다. 연구 대상은 2014년, 2016년에 지역사회건강조사 전체 조사대상자 중 심폐소생술 인지군 357,176명이며 수집된 자료는 SPSS WIN 25.0 프로그램을 이용하여 복합표본 빈도분석과 의사결정나무 분석을 수행하였다. 본 연구의 결과 우리나라 지역주민의 심폐소생술 수행능력 자신감은 심폐소생술 교육 경험이 있을 경우, 최근 2년 동안 마네킹 실습 경험이 있을 경우, 심폐소생술 교육경험이 2년 이내일 경우, 남성, 41.5세 이하일 경우에 높은 것으로 나타났다.

중학교 여학생의 스마트폰 장시간 사용 관련요인 및 고위험군 특성 (The Factors related to Long Hours of Smartphone Usage and the Characteristics of High-risk Group in Female Middle School Students)

  • 박성희;이지선
    • 한국학교보건학회지
    • /
    • 제31권3호
    • /
    • pp.135-145
    • /
    • 2018
  • Purpose: The study aimed to investigate the factors associated with long hours of smartphone usage and to identify the characteristics of the high-risk group among female middle school students in South Korea. Methods: The study analyzed the data of 13,648 female middle school students using their own smartphone extracted from the 13th Youth Health Behavior Online Survey (2017). The factors related to using smartphones for a long time was analyzed by binomial logistic regression. The characteristics of the high-risk group was defined by a decision tree analysis. Results: The average hours spent on smartphone usage was 269.54 minutes per day. The significant factors associated with the long hours of smartphone usage were grade, living with parents, perceived household economic status, perceived academic achievement, stress, sadness and hopelessness, the main purpose of smartphone usage, drinking, body mass index, breakfast, and satisfaction with sleep quality. The subjects showing low academic performance and having breakfast four times a week or less were more likely to use their smartphone for a long time. Conclusion: Based on the results of the research, we need to establish intervention strategies focusing on the factors influencing long-time usage of smartphone. Particularly, the subjects who show poor academic performance and skip breakfast frequently should be considered as the high-risk group for spending long hours on smartphone usage.

딥러닝을 이용한 소외계층 아동의 스포츠 재활치료를 통한 정신 건강에 대한 변화 (Variation for Mental Health of Children of Marginalized Classes through Exercise Therapy using Deep Learning)

  • 김명미
    • 한국전자통신학회논문지
    • /
    • 제15권4호
    • /
    • pp.725-732
    • /
    • 2020
  • 본 논문은 소외계층 아동의 운동학습프로그램에서 체력 활동 중 나를 잘 따른다(0-9), 마음의 결정을 내리는데 많은 시간이 걸린다(0-9), 맥빠진(0-9) 등을 변수로 사용하여 '성별', '체육교실', 나이의 '상중하'를 분류하고 스포츠 재활치료를 통한 자아 탄력(ego-resiliency)과 자아 통제(self-control)의 변화를 관찰하여 정신 건강 변화를 알아본다. 이를 위해 취득한 데이터를 병합하고 Label encoder와 One-hot encoding을 사용하여 숫자의 크고 작음의 특성을 제거한 후 MLP, SVM, Dicesion tree, RNN, LSTM의 각각의 알고리즘을 적용하여 성능을 평가하기 위해 Train, Test 데이터를 75%, 25% 스플릿 한 뒤 Train 데이터로 알고리즘을 학습하고 Test 데이터로 알고리즘의 정확성을 측정한다. 측정 결과 성별에서는 LSTM, 체육 교실은 MLP와 LSTM, 나이는 SVM이 가장 우수한 결과를 보임을 확인하였다.

Word2vec을 이용한 오피니언 마이닝 성과분석 연구 (Performance Analysis of Opinion Mining using Word2vec)

  • 어균선;이건창
    • 한국콘텐츠학회:학술대회논문집
    • /
    • 한국콘텐츠학회 2018년도 춘계 종합학술대회 논문집
    • /
    • pp.7-8
    • /
    • 2018
  • 본 연구에서는 Word2vec을 머신러닝 분류기를 이용해 효율적인 오피니언 마이닝 방법을 제안한다. 본 연구의 목적을 위해 BOW(Bag-of-Words) 방법과 Word2vec방법을 이용해 속성 셋을 구성했다. 구성된 속성 셋은 Decision tree, Logistic regression, Support vector machine, Random forest를 이용해 오피니언 마이닝을 수행했다. 연구 결과, Word2vec 방법과 RF분류기가 가장 높은 정확도를 나타냈다. 그리고 Word2vec 방법이 BOW방법 보다 각 분류기에서 높은 성능을 나타냈다.

  • PDF

한국형 재활환자분류체계 버전 1.0 개발 (The Development of Korean Rehabilitation Patient Group Version 1.0)

  • 황수진;김애련;문선혜;김지희;김진휘;하영혜;양옥영
    • 보건행정학회지
    • /
    • 제26권4호
    • /
    • pp.289-304
    • /
    • 2016
  • Background: Rehabilitations in subacute phase are different from acute treatments regarding the characteristics and required resource consumption of the treatments. Lack of accuracy and validity of the Korean Diagnosis Related Group and Korean Out-Patient Group for the acute patients as the case-mix and payment tool for rehabilitation inpatients have been problematic issues. The objective of the study was to develop the Korean Rehabilitation Patient Group (KRPG) reflecting the characteristics of rehabilitation inpatients. Methods: As a retrospective medical record survey regarding rehabilitation inpatients, 4,207 episodes were collected through 42 hospitals. Considering the opinions of clinical experts and the decision-tree analysis, the variables for the KRPG system demonstrating the characteristics of rehabilitation inpatients were derived, and the splitting standards of the relevant variables were also set. Using the derived variables, we have drawn the rehabilitation inpatient classification model reflecting the clinical situation of Korea. The performance evaluation was conducted on the KRPG system. Results: The KRPG was targeted at the inpatients with brain or spinal cord injury. The etiologic disease, functional status (cognitive function, activity of daily living, muscle strength, spasticity, level and grade of spinal cord injury), and the patient's age were the variables in the rehabilitation patients. The algorithm of KRPG system after applying the derived variables and total 204 rehabilitation patient groups were developed. The KRPG explained 11.8% of variance in charge for rehabilitation inpatients. It also explained 13.8% of variance in length of stay for them. Conclusion: The KRPG version 1.0 reflecting the clinical characteristics of rehabilitation inpatients was classified as 204 groups.

지역사회획득 폐렴 환자의 퇴원시 사망 요인 분석 (A study on analysis of factors on in-hospital mortality for community-acquired pneumonia)

  • 김유미
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권3호
    • /
    • pp.389-400
    • /
    • 2011
  • 본 연구의 목적은 행정자료를 이용하여 지역사회획득 폐렴 환자의 사망 요인을 분석하기 위해 수행되었다. 2004~2006년 퇴원손상환자 조사자료 중 지역사회획득 폐렴환자 5,353건을 연구대상으로 하였으며, 사망률의 차이분석은 카이제곱 검정을 실시하였고, 사망 요인을 분석하기 위해 데이터마이닝 기법 중 의사결정나무 모형을 이용하였다. 의사결정나무 모형 중 C4.5가 성능이 우수하였는데, 입원경로, 호흡부전, 울혈성심부전을 포함하여 연령, 동반질환, 병상규모 등이 폐렴 사망의 위험 요인으로 나타났다. 본 연구는 행정데이터를 이용하여 지역사회획득 폐렴환자의 사망 위험요인을 밝히고자 하였다. 그러나 향후 병원특성, 지역특성, 의료행태 등에 대한 보다 포괄적인 변수를 포함한 후속 연구가 필요할 것으로 생각된다.

머신러닝 기반 고용량 I-131의 용량 예측 모델에 관한 연구 (A Study on Predictive Modeling of I-131 Radioactivity Based on Machine Learning)

  • 유연욱;이충운;김정수
    • 대한방사선기술학회지:방사선기술과학
    • /
    • 제46권2호
    • /
    • pp.131-139
    • /
    • 2023
  • High-dose I-131 used for the treatment of thyroid cancer causes localized exposure among radiology technologists handling it. There is a delay between the calibration date and when the dose of I-131 is administered to a patient. Therefore, it is necessary to directly measure the radioactivity of the administered dose using a dose calibrator. In this study, we attempted to apply machine learning modeling to measured external dose rates from shielded I-131 in order to predict their radioactivity. External dose rates were measured at 1 m, 0.3 m, and 0.1 m distances from a shielded container with the I-131, with a total of 868 sets of measurements taken. For the modeling process, we utilized the hold-out method to partition the data with a 7:3 ratio (609 for the training set:259 for the test set). For the machine learning algorithms, we chose linear regression, decision tree, random forest and XGBoost. To evaluate the models, we calculated root mean square error (RMSE), mean square error (MSE), and mean absolute error (MAE) to evaluate accuracy and R2 to evaluate explanatory power. Evaluation results are as follows. Linear regression (RMSE 268.15, MSE 71901.87, MAE 231.68, R2 0.92), decision tree (RMSE 108.89, MSE 11856.92, MAE 19.24, R2 0.99), random forest (RMSE 8.89, MSE 79.10, MAE 6.55, R2 0.99), XGBoost (RMSE 10.21, MSE 104.22, MAE 7.68, R2 0.99). The random forest model achieved the highest predictive ability. Improving the model's performance in the future is expected to contribute to lowering exposure among radiology technologists.

기계적 모터 고장진단을 위한 머신러닝 기법 (A Machine Learning Approach for Mechanical Motor Fault Diagnosis)

  • 정훈;김주원
    • 산업경영시스템학회지
    • /
    • 제40권1호
    • /
    • pp.57-64
    • /
    • 2017
  • In order to reduce damages to major railroad components, which have the potential to cause interruptions to railroad services and safety accidents and to generate unnecessary maintenance costs, the development of rolling stock maintenance technology is switching from preventive maintenance based on the inspection period to predictive maintenance technology, led by advanced countries. Furthermore, to enhance trust in accordance with the speedup of system and reduce maintenances cost simultaneously, the demand for fault diagnosis and prognostic health management technology is increasing. The objective of this paper is to propose a highly reliable learning model using various machine learning algorithms that can be applied to critical rolling stock components. This paper presents a model for railway rolling stock component fault diagnosis and conducts a mechanical failure diagnosis of motor components by applying the machine learning technique in order to ensure efficient maintenance support along with a data preprocessing plan for component fault diagnosis. This paper first defines a failure diagnosis model for rolling stock components. Function-based algorithms ANFIS and SMO were used as machine learning techniques for generating the failure diagnosis model. Two tree-based algorithms, RadomForest and CART, were also employed. In order to evaluate the performance of the algorithms to be used for diagnosing failures in motors as a critical railroad component, an experiment was carried out on 2 data sets with different classes (includes 6 classes and 3 class levels). According to the results of the experiment, the random forest algorithm, a tree-based machine learning technique, showed the best performance.

Predicting Stock Liquidity by Using Ensemble Data Mining Methods

  • Bae, Eun Chan;Lee, Kun Chang
    • 한국컴퓨터정보학회논문지
    • /
    • 제21권6호
    • /
    • pp.9-19
    • /
    • 2016
  • In finance literature, stock liquidity showing how stocks can be cashed out in the market has received rich attentions from both academicians and practitioners. The reasons are plenty. First, it is known that stock liquidity affects significantly asset pricing. Second, macroeconomic announcements influence liquidity in the stock market. Therefore, stock liquidity itself affects investors' decision and managers' decision as well. Though there exist a great deal of literature about stock liquidity in finance literature, it is quite clear that there are no studies attempting to investigate the stock liquidity issue as one of decision making problems. In finance literature, most of stock liquidity studies had dealt with limited views such as how much it influences stock price, which variables are associated with describing the stock liquidity significantly, etc. However, this paper posits that stock liquidity issue may become a serious decision-making problem, and then be handled by using data mining techniques to estimate its future extent with statistical validity. In this sense, we collected financial data set from a number of manufacturing companies listed in KRX (Korea Exchange) during the period of 2010 to 2013. The reason why we selected dataset from 2010 was to avoid the after-shocks of financial crisis that occurred in 2008. We used Fn-GuidPro system to gather total 5,700 financial data set. Stock liquidity measure was computed by the procedures proposed by Amihud (2002) which is known to show best metrics for showing relationship with daily return. We applied five data mining techniques (or classifiers) such as Bayesian network, support vector machine (SVM), decision tree, neural network, and ensemble method. Bayesian networks include GBN (General Bayesian Network), NBN (Naive BN), TAN (Tree Augmented NBN). Decision tree uses CART and C4.5. Regression result was used as a benchmarking performance. Ensemble method uses two types-integration of two classifiers, and three classifiers. Ensemble method is based on voting for the sake of integrating classifiers. Among the single classifiers, CART showed best performance with 48.2%, compared with 37.18% by regression. Among the ensemble methods, the result from integrating TAN, CART, and SVM was best with 49.25%. Through the additional analysis in individual industries, those relatively stabilized industries like electronic appliances, wholesale & retailing, woods, leather-bags-shoes showed better performance over 50%.