• 제목/요약/키워드: Decision tree

검색결과 1,629건 처리시간 0.029초

Diagnostic Classification Scheme in Iranian Breast Cancer Patients using a Decision Tree

  • Malehi, Amal Saki
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제15권14호
    • /
    • pp.5593-5596
    • /
    • 2014
  • Background: The objective of this study was to determine a diagnostic classification scheme using a decision tree based model. Materials and Methods: The study was conducted as a retrospective case-control study in Imam Khomeini hospital in Tehran during 2001 to 2009. Data, including demographic and clinical-pathological characteristics, were uniformly collected from 624 females, 312 of them were referred with positive diagnosis of breast cancer (cases) and 312 healthy women (controls). The decision tree was implemented to develop a diagnostic classification scheme using CART 6.0 Software. The AUC (area under curve), was measured as the overall performance of diagnostic classification of the decision tree. Results: Five variables as main risk factors of breast cancer and six subgroups as high risk were identified. The results indicated that increasing age, low age at menarche, single and divorced statues, irregular menarche pattern and family history of breast cancer are the important diagnostic factors in Iranian breast cancer patients. The sensitivity and specificity of the analysis were 66% and 86.9% respectively. The high AUC (0.82) also showed an excellent classification and diagnostic performance of the model. Conclusions: Decision tree based model appears to be suitable for identifying risk factors and high or low risk subgroups. It can also assists clinicians in making a decision, since it can identify underlying prognostic relationships and understanding the model is very explicit.

신경망과 의사결정 나무를 이용한 충수돌기염 환자의 재원일수 예측모형 개발 (Length-of-Stay Prediction Model of Appendicitis using Artificial Neural Networks and Decision Tree)

  • 정석훈;한우석;서용무;이현실
    • 한국산학기술학회논문지
    • /
    • 제10권6호
    • /
    • pp.1424-1432
    • /
    • 2009
  • 충수돌기염 환자의 LoS(Length of Stay)를 예측하는 것은 병상의 운영에 적지 않은 영향을 준다. 본 논문에서는 Neural Networks와 Decision Tree를 이용하여 LoS와 연관이 높은 입력변수들을 찾아 그 의미를 분석하며, 찾아낸 입력변수들을 이용하여 다양한 LoS 예측 모형을 개발하고 그 성능을 비교하였다. 모형의 예측 정확성을 높이기 위하여 Bagging과 Boosting 등의 Ensemble 기법도 적용하였다. 실험 결과, Decision Tree 모형이 Neural Networks 모형보다 좀 더 적은 수의 속성을 가지고도 거의 통일한 예측력을 보였으며, Ensemble 기법 중에서는 Bagging 기법이 Boosting 기법보다 좋은 결과를 보여주었다. 의사결정나무 기법은 Neural Networks 기법에 비해 설명력이 있으며, 충수돌기염의 LoS 예측에 매우 효과적이었고, 중요 입력 변수의 선정에도 좋은 결과를 보여줌에 따라 향후 적극적인 기법의 도입이 필요하다고 할 수 있다.

A Comparative Study of Predictive Factors for Passing the National Physical Therapy Examination using Logistic Regression Analysis and Decision Tree Analysis

  • Kim, So Hyun;Cho, Sung Hyoun
    • Physical Therapy Rehabilitation Science
    • /
    • 제11권3호
    • /
    • pp.285-295
    • /
    • 2022
  • Objective: The purpose of this study is to use logistic regression and decision tree analysis to identify the factors that affect the success or failurein the national physical therapy examination; and to build and compare predictive models. Design: Secondary data analysis study Methods: We analyzed 76,727 subjects from the physical therapy national examination data provided by the Korea Health Personnel Licensing Examination Institute. The target variable was pass or fail, and the input variables were gender, age, graduation status, and examination area. Frequency analysis, chi-square test, binary logistic regression, and decision tree analysis were performed on the data. Results: In the logistic regression analysis, subjects in their 20s (Odds ratio, OR=1, reference), expected to graduate (OR=13.616, p<0.001) and from the examination area of Jeju-do (OR=3.135, p<0.001), had a high probability of passing. In the decision tree, the predictive factors for passing result had the greatest influence in the order of graduation status (x2=12366.843, p<0.001) and examination area (x2=312.446, p<0.001). Logistic regression analysis showed a specificity of 39.6% and sensitivity of 95.5%; while decision tree analysis showed a specificity of 45.8% and sensitivity of 94.7%. In classification accuracy, logistic regression and decision tree analysis showed 87.6% and 88.0% prediction, respectively. Conclusions: Both logistic regression and decision tree analysis were adequate to explain the predictive model. Additionally, whether actual test takers passed the national physical therapy examination could be determined, by applying the constructed prediction model and prediction rate.

A Comparative Study of Predictive Factors for Hypertension using Logistic Regression Analysis and Decision Tree Analysis

  • SoHyun Kim;SungHyoun Cho
    • Physical Therapy Rehabilitation Science
    • /
    • 제12권2호
    • /
    • pp.80-91
    • /
    • 2023
  • Objective: The purpose of this study is to identify factors that affect the incidence of hypertension using logistic regression and decision tree analysis, and to build and compare predictive models. Design: Secondary data analysis study Methods: We analyzed 9,859 subjects from the Korean health panel annual 2019 data provided by the Korea Institute for Health and Social Affairs and National Health Insurance Service. Frequency analysis, chi-square test, binary logistic regression, and decision tree analysis were performed on the data. Results: In logistic regression analysis, those who were 60 years of age or older (Odds ratio, OR=68.801, p<0.001), those who were divorced/widowhood/separated (OR=1.377, p<0.001), those who graduated from middle school or younger (OR=1, reference), those who did not walk at all (OR=1, reference), those who were obese (OR=5.109, p<0.001), and those who had poor subjective health status (OR=2.163, p<0.001) were more likely to develop hypertension. In the decision tree, those over 60 years of age, overweight or obese, and those who graduated from middle school or younger had the highest probability of developing hypertension at 83.3%. Logistic regression analysis showed a specificity of 85.3% and sensitivity of 47.9%; while decision tree analysis showed a specificity of 81.9% and sensitivity of 52.9%. In classification accuracy, logistic regression and decision tree analysis showed 73.6% and 72.6% prediction, respectively. Conclusions: Both logistic regression and decision tree analysis were adequate to explain the predictive model. It is thought that both analysis methods can be used as useful data for constructing a predictive model for hypertension.

기계 진단을 위한 적응형 의사결정 트리 알고리즘 (Adaptive Decision Tree Algorithm for Machine Diagnosis)

  • 백준걸;김강호;김창욱;김성식
    • 한국경영과학회:학술대회논문집
    • /
    • 대한산업공학회/한국경영과학회 2000년도 춘계공동학술대회 논문집
    • /
    • pp.235-238
    • /
    • 2000
  • This article presents an adaptive decision tree algorithm for dynamically reasoning machine failure cause out of real-time, large-scale machine status database. On the basis of experiment using semiconductor etching machine, it has been verified that our model outperforms previously proposed decision tree models.

  • PDF

A LEARNING SYSTEM BY MODIFYING A DECISION TREE FOR CAPP

  • 이홍희
    • 대한산업공학회지
    • /
    • 제20권3호
    • /
    • pp.125-137
    • /
    • 1994
  • Manufacturing environs constantly change, and any efficient software system to be used in manufacturing must be able to adapt to the varying situations. In a CAPP (Computer-Aided Process Planning) system, a learning capability is necessary for the CAPP system to do change along with the manufacturing system. Unfortunately only a few CAPP systems currently possess learning capabilities. This research aims at the development of a learning system which can increase the knowledge in a CAPP system. A part in the system is represented by frames and described interactively. The process information and process planning logic is represented using a decision tree. The knowledge expansion is carried out through an interactive expansion of the decision tree according to human advice. Algorithms for decision tree modification are developed. A path can be recommended for an unknown part of limited scope. The processes are selected according to the criterion such as minimum time or minimum cost. The decision tree, and the process planning and learning procedures are formally defined.

  • PDF

Applying Decision Tree Algorithms for Analyzing HS-VOSTS Questionnaire Results

  • Kang, Dae-Ki
    • 공학교육연구
    • /
    • 제15권4호
    • /
    • pp.41-47
    • /
    • 2012
  • Data mining and knowledge discovery techniques have shown to be effective in finding hidden underlying rules inside large database in an automated fashion. On the other hand, analyzing, assessing, and applying students' survey data are very important in science and engineering education because of various reasons such as quality improvement, engineering design process, innovative education, etc. Among those surveys, analyzing the students' views on science-technology-society can be helpful to engineering education. Because, although most researches on the philosophy of science have shown that science is one of the most difficult concepts to define precisely, it is still important to have an eye on science, pseudo-science, and scientific misconducts. In this paper, we report the experimental results of applying decision tree induction algorithms for analyzing the questionnaire results of high school students' views on science-technology-society (HS-VOSTS). Empirical results on various settings of decision tree induction on HS-VOSTS results from one South Korean university students indicate that decision tree induction algorithms can be successfully and effectively applied to automated knowledge discovery from students' survey data.

A Study on Machine Fault Diagnosis using Decision Tree

  • Nguyen, Ngoc-Tu;Kwon, Jeong-Min;Lee, Hong-Hee
    • Journal of Electrical Engineering and Technology
    • /
    • 제2권4호
    • /
    • pp.461-467
    • /
    • 2007
  • The paper describes a way to diagnose machine condition based on the expert system. In this paper, an expert system-decision tree is built and experimented to diagnose and to detect machine defects. The main objective of this study is to provide a simple way to monitor machine status by synthesizing the knowledge and experiences on the diagnostic case histories of the rotating machinery. A traditional decision tree has been constructed using vibration-based inputs. Some case studies are provided to illustrate the application and advantages of the decision tree system for machine fault diagnosis.

정주여건을 고려한 의사결정나무기법 활용 농촌지역 유형화 (Typical Classification of Rural Area Considering Settlement Environment by Decision Tree Method)

  • 배승종;김대식;은상규
    • 한국농공학회논문집
    • /
    • 제58권6호
    • /
    • pp.79-92
    • /
    • 2016
  • The objective of this study is to classify the types of rural areas (138 $si{\cdot}gun$) considering settlement environment by Decision Tree Method (CHAID). The CHAID method was used for decision tree algorithm and the seven dependant variables and 5 explanatory variables were selected, respectively. By decision tree method, rural areas were finally classified into six groups through three separate processes. City area, lower area in aging rate and higher area in farmland area ratio was analyzed to be relatively rich rather than other area in the case of settlement environment index. In the future, this study will be able to utilize as a reference to the planning of rural development projects.

증분 의사결정 트리 구축을 위한 연속형 속성의 다구간 이산화 (Multi-Interval Discretization of Continuous-Valued Attributes for Constructing Incremental Decision Tree)

  • 백준걸;김창욱;김성식
    • 대한산업공학회지
    • /
    • 제27권4호
    • /
    • pp.394-405
    • /
    • 2001
  • Since most real-world application data involve continuous-valued attributes, properly addressing the discretization process for constructing a decision tree is an important problem. A continuous-valued attribute is typically discretized during decision tree generation by partitioning its range into two intervals recursively. In this paper, by removing the restriction to the binary discretization, we present a hybrid multi-interval discretization algorithm for discretizing the range of continuous-valued attribute into multiple intervals. On the basis of experiment using semiconductor etching machine, it has been verified that our discretization algorithm constructs a more efficient incremental decision tree compared to previously proposed discretization algorithms.

  • PDF