• Title/Summary/Keyword: Decision-tree model

Search Result 735, Processing Time 0.026 seconds

A Development of a Tailored Follow up Management Model Using the Data Mining Technique on Hypertension (데이터마이닝 기법을 활용한 맞춤형 고혈압 사후관리 모형 개발)

  • Park, Il-Su;Yong, Wang-Sik;Kim, Yu-Mi;Kang, Sung-Hong;Han, Jun-Tae
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.4
    • /
    • pp.639-647
    • /
    • 2008
  • This study used the characteristics of the knowledge discovery and data mining algorithms to develop tailored hypertension follow up management model - hypertension care predictive model and hypertension care compliance segmentation model - for hypertension management using the Korea National Health Insurance Corporation database(the insureds’ screening and health care benefit data). This study validated the predictive power of data mining algorithms by comparing the performance of logistic regression, decision tree, and ensemble technique. On the basis of internal and external validation, it was found that the model performance of logistic regression method was the best among the above three techniques on hypertension care predictive model and hypertension care compliance segmentation model was developed by Decision tree analysis. This study produced several factors affecting the outbreak of hypertension using screening. It is considered to be a contributing factor towards the nation’s building of a Hypertension follow up Management System in the near future by bringing forth representative results on the rise and care of hypertension.

Efficient context dependent process modeling using state tying and decision tree-based method (상태 공유와 결정트리 방법을 이용한 효율적인 문맥 종속 프로세스 모델링)

  • Ahn, Chan-Shik;Oh, Sang-Yeob
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.3
    • /
    • pp.369-377
    • /
    • 2010
  • In vocabulary recognition systems based on HMM(Hidden Markov Model)s, training process unseen model bring on show a low recognition rate. If recognition vocabulary modify and make an addition then recreated modeling of executed database collected and training sequence on account of bring on additional expenses and take more time. This study suggest efficient context dependent process modeling method using decision tree-based state tying. On study suggest method is reduce recreated of model and it's offered that robustness and accuracy of context dependent acoustic modeling. Also reduce amount of model and offered training process unseen model as concerns context dependent a likely phoneme model has been used unseen model solve the matter. System performance as a result of represent vocabulary dependence recognition rate of 98.01%, vocabulary independence recognition rate of 97.38%.

Development and its APPLIcation of Computer Program for Slope Hazards Prediction using Decision Tree Model (의사결정나무모형을 이용한 급경사지재해 예측프로그램 개발 및 적용)

  • Song, Young-Suk;Cho, Yong-Chan;Seo, Yong-Seok;Ahn, Sang-Ro
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.29 no.2C
    • /
    • pp.59-69
    • /
    • 2009
  • Based on the data obtained from field investigation and soil testing to slope hazards occurrence section and non-occurrence section in crystalline rocks like gneiss, granite, and so on, a prediction model was developed by the use of a decision tree model. The classification standard of the selected prediction model is composed of the slope angle, the coefficient of permeability and the void ratio in the order. The computer program, SHAPP ver. 1.0 for prediction of slope hazards around an important national facilities using GIS technique and the developed model. To prove the developed prediction model and the computer program, the field data surveyed from Jumunjin, Gangneung city were compared with the prediction result in the same site. As the result of comparison, the real occurrence location of slope hazards was similar to the predicted section. Through the continuous study, the accuracy about prediction result of slope hazards will be upgraded and the computer program will be commonly used in practical.

A Context-Aware Information Service using FCM Clustering Algorithm and Fuzzy Decision Tree (FCM 클러스터링 알고리즘과 퍼지 결정트리를 이용한 상황인식 정보 서비스)

  • Yang, Seokhwan;Chung, Mokdong
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.7
    • /
    • pp.810-819
    • /
    • 2013
  • FCM (Fuzzy C-Means) clustering algorithm, a typical split-based clustering algorithm, has been successfully applied to the various fields. Nonetheless, the FCM clustering algorithm has some problems, such as high sensitivity to noise and local data, the different clustering result from the intuitive grasp, and the setting of initial round and the number of clusters. To address these problems, in this paper, we determine fuzzy numbers which project the FCM clustering result on the axis with the specific attribute. And we propose a model that the fuzzy numbers apply to FDT (Fuzzy Decision Tree). This model improves the two problems of FCM clustering algorithm such as elevated sensitivity to data, and the difference of the clustering result from the intuitional decision. And also, this paper compares the effect of the proposed model and the result of FCM clustering algorithm through the experiment using real traffic and rainfall data. The experimental results indicate that the proposed model provides more reliable results by the sensitivity relief for data. And we can see that it has improved on the concordance of FCM clustering result with the intuitive expectation.

A Predictive Model of Depression in Rural Elders-Decision Tree Analysis (의사결정나무 분석기법을 이용한 농촌거주 노인의 우울예측모형 구축)

  • Kim, Seong Eun;Kim, Sun Ah
    • Journal of Korean Academy of Nursing
    • /
    • v.43 no.3
    • /
    • pp.442-451
    • /
    • 2013
  • Purpose: This descriptive study was done to develop a predictive model of depression in rural elders that will guide prevention and reduction of depression in elders. Methods: A cross-sectional descriptive survey was done using face-to-face private interviews. Participants included in the final analysis were 461 elders (aged${\geq}$ 65 years). The questions were on depression, personal and environmental factors, body functions and structures, activity and participation. Decision tree analysis using the SPSS Modeler 14.1 program was applied to build an optimum and significant predictive model to predict depression in rural elders. Results: From the data analysis, the predictive model for factors related to depression in rural elders presented with 4 pathways. Predictive factors included exercise capacity, self-esteem, farming, social activity, cognitive function, and gender. The accuracy of the model was 83.7%, error rate 16.3%, sensitivity 63.3%, and specificity 93.6%. Conclusion: The results of this study can be used as a theoretical basis for developing a systematic knowledge system for nursing and for developing a protocol that prevents depression in elders living in rural areas, thereby contributing to advanced depression prevention for elders.

Research on Financial Distress Prediction Model of Chinese Cultural Industry Enterprises Based on Machine Learning and Traditional Statistical (전통적인 통계와 기계학습 기반 중국 문화산업 기업의 재무적 곤경 예측모형 연구)

  • Yuan, Tao;Wang, Kun;Luan, Xi;Bae, Ki-Hyung
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.2
    • /
    • pp.545-558
    • /
    • 2022
  • The purpose of this study is to explore a prediction model for accurately predicting Financial Difficulties of Chinese Cultural Industry Enterprises through Traditional Statistics and Machine Learning. To construct the prediction model, the data of 128 listed Cultural Industry Enterprises in China are used. On the basis of data groups composed of 25 explanatory variables, prediction models using Traditional Statistical such as Discriminant Analysis and logistic as well as Machine Learning such as SVM, Decision Tree and Random Forest were constructed, and Python software was used to evaluate the performance of each model. The results show that the Random Forest model has the best prediction performance, with an accuracy of 95%. The SVM model was followed with 93% accuracy. The Decision Tree model was followed with 92% accuracy.The Discriminant Analysis model was followed with 89% accuracy. The model with the lowest prediction effect was the Logistic model with an accuracy of 88%. This shows that Machine Learning model can achieve better prediction effect than Traditional Statistical model when predicting financial distress of Chinese cultural industry enterprises.

Prediction Models of Conflict and Intimacy in Teacher-Child Relationships: Investigation of Child Variables Based on Decision Tree Analysis (교사-유아 관계의 갈등 및 친밀감에 대한 예측 모형: 의사결정나무분석을 적용한 유아변인의 탐색)

  • Shin, Yoolim
    • Korean Journal of Childcare and Education
    • /
    • v.16 no.5
    • /
    • pp.69-86
    • /
    • 2020
  • Objective: The purpose of this research was to examine the prediction models of conflict and intimacy in teacher-child relationships based on decision tree analysis. Methods: The participants were 297 preschool children from ages three to five including 166 boys and 131 girls. Teacher-child relationships were measured by the Student-Teacher Relationship Scale(STRS). Physical aggression, relational aggression, social withdrawal, and prosocial behaviors were measured by teacher ratings. Moreover, ADHD-RS(Attentive Deficit Hyperactivity Disorder Rating Scale) was used to measure ADHD. The data was analyzed with decision tree analysis. Results: According to the prediction model for teacher-child conflict, the significant predictors were physical aggression and social withdrawal. According to the prediction model for teacher-child intimacy, the significant predictors were prosocial behaviors and relational aggression. However, children's age, gender and ADHD were not significant predictors. Conclusion/Implications: The findings suggest that social behaviors may be closely related with teacher-child relationships for preschool children. Based on the results of this study, intervention suggestions were made.

Core Keywords Extraction forEvaluating Online Consumer Reviews Using a Decision Tree: Focusing on Star Ratings and Helpfulness Votes (의사결정나무를 활용한 온라인 소비자 리뷰 평가에 영향을 주는 핵심 키워드 도출 연구: 별점과 좋아요를 중심으로)

  • Min, Kyeong Su;Yoo, Dong Hee
    • The Journal of Information Systems
    • /
    • v.32 no.3
    • /
    • pp.133-150
    • /
    • 2023
  • Purpose This study aims to develop classification models using a decision tree algorithm to identify core keywords and rules influencing online consumer review evaluations for the robot vacuum cleaner on Amazon.com. The difference from previous studies is that we analyze core keywords that affect the evaluation results by dividing the subjects that evaluate online consumer reviews into self-evaluation (star ratings) and peer evaluation (helpfulness votes). We investigate whether the core keywords influencing star ratings and helpfulness votes vary across different products and whether there is a similarity in the core keywords related to star ratings or helpfulness votes across all products. Design/methodology/approach We used random under-sampling to balance the dataset. We progressively removed independent variables based on decreasing importance through backwards elimination to evaluate the classification model's performance. As a result, we identified classification models that best predict star ratings and helpfulness votes for each product's online consumer reviews. Findings We have identified that the core keywords influencing self-evaluation and peer evaluation vary across different products, and even for the same model or features, the core keywords are not consistent. Therefore, companies' producers and marketing managers need to analyze the core keywords of each product to highlight the advantages and prepare customized strategies that compensate for the shortcomings.

Development of Medical Cost Prediction Model Based on the Machine Learning Algorithm (머신러닝 알고리즘 기반의 의료비 예측 모델 개발)

  • Han Bi KIM;Dong Hoon HAN
    • Journal of Korea Artificial Intelligence Association
    • /
    • v.1 no.1
    • /
    • pp.11-16
    • /
    • 2023
  • Accurate hospital case modeling and prediction are crucial for efficient healthcare. In this study, we demonstrate the implementation of regression analysis methods in machine learning systems utilizing mathematical statics and machine learning techniques. The developed machine learning model includes Bayesian linear, artificial neural network, decision tree, decision forest, and linear regression analysis models. Through the application of these algorithms, corresponding regression models were constructed and analyzed. The results suggest the potential of leveraging machine learning systems for medical research. The experiment aimed to create an Azure Machine Learning Studio tool for the speedy evaluation of multiple regression models. The tool faciliates the comparision of 5 types of regression models in a unified experiment and presents assessment results with performance metrics. Evaluation of regression machine learning models highlighted the advantages of boosted decision tree regression, and decision forest regression in hospital case prediction. These findings could lay the groundwork for the deliberate development of new directions in medical data processing and decision making. Furthermore, potential avenues for future research may include exploring methods such as clustering, classification, and anomaly detection in healthcare systems.

Development of Prediction Model for Prevalence of Metabolic Syndrome Using Data Mining: Korea National Health and Nutrition Examination Study (국민건강영양조사를 활용한 대사증후군 유병 예측모형 개발을 위한 융복합 연구: 데이터마이닝을 활용하여)

  • Kim, Han-Kyoul;Choi, Keun-Ho;Lim, Sung-Won;Rhee, Hyun-Sill
    • Journal of Digital Convergence
    • /
    • v.14 no.2
    • /
    • pp.325-332
    • /
    • 2016
  • The purpose of this study is to investigate the attributes influencing the prevalence of metabolic syndrome and develop the prediction model for metabolic syndrome over 40-aged people from Korea Health and Nutrition Examination Study 2012. The researcher chose the attributes for prediction model through literature review. Also, we used the decision tree, logistic regression, artificial neural network of data mining algorithm through Weka 3.6. As results, social economic status factors of input attributes were ranked higher than health-related factors. Additionally, prediction model using decision tree algorithm showed finally the highest accuracy. This study suggests that, first of all, prevention and management of metabolic syndrome will be approached by aspect of social economic status and health-related factors. Also, decision tree algorithms known from other research are useful in the field of public health due to their usefulness of interpretation.