• Title/Summary/Keyword: Decision -making Tree

Search Result 197, Processing Time 0.022 seconds

Diagnostic Classification Scheme in Iranian Breast Cancer Patients using a Decision Tree

  • Malehi, Amal Saki
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.14
    • /
    • pp.5593-5596
    • /
    • 2014
  • Background: The objective of this study was to determine a diagnostic classification scheme using a decision tree based model. Materials and Methods: The study was conducted as a retrospective case-control study in Imam Khomeini hospital in Tehran during 2001 to 2009. Data, including demographic and clinical-pathological characteristics, were uniformly collected from 624 females, 312 of them were referred with positive diagnosis of breast cancer (cases) and 312 healthy women (controls). The decision tree was implemented to develop a diagnostic classification scheme using CART 6.0 Software. The AUC (area under curve), was measured as the overall performance of diagnostic classification of the decision tree. Results: Five variables as main risk factors of breast cancer and six subgroups as high risk were identified. The results indicated that increasing age, low age at menarche, single and divorced statues, irregular menarche pattern and family history of breast cancer are the important diagnostic factors in Iranian breast cancer patients. The sensitivity and specificity of the analysis were 66% and 86.9% respectively. The high AUC (0.82) also showed an excellent classification and diagnostic performance of the model. Conclusions: Decision tree based model appears to be suitable for identifying risk factors and high or low risk subgroups. It can also assists clinicians in making a decision, since it can identify underlying prognostic relationships and understanding the model is very explicit.

A Study on the Prediction of Community Smart Pension Intention Based on Decision Tree Algorithm

  • Liu, Lijuan;Min, Byung-Won
    • International Journal of Contents
    • /
    • v.17 no.4
    • /
    • pp.79-90
    • /
    • 2021
  • With the deepening of population aging, pension has become an urgent problem in most countries. Community smart pension can effectively resolve the problem of traditional pension, as well as meet the personalized and multi-level needs of the elderly. To predict the pension intention of the elderly in the community more accurately, this paper uses the decision tree classification method to classify the pension data. After missing value processing, normalization, discretization and data specification, the discretized sample data set is obtained. Then, by comparing the information gain and information gain rate of sample data features, the feature ranking is determined, and the C4.5 decision tree model is established. The model performs well in accuracy, precision, recall, AUC and other indicators under the condition of 10-fold cross-validation, and the precision was 89.5%, which can provide the certain basis for government decision-making.

Economic Value Analysis of Asian Dust Forecasts Using Decision Tree-Focused on Medicine Inventory Management (의사결정트리를 활용한 황사예보의 경제적 가치 분석-의약품 재고관리문제를 중심으로)

  • Yoon, Seung-Chul;Lee, Ki-Kwang
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.37 no.1
    • /
    • pp.120-126
    • /
    • 2014
  • This paper deals with the economic value analysis of meteorological forecasts for a hypothetical inventory decision-making situation in the pharmaceutical industry. The value of Asian dust (AD) forecasts is assessed in terms of the expected value of profits by using a decision tree, which is transformed from the specific payoff structure. The forecast user is assumed to determine the inventory level by considering base profit, inventory cost, and lost sales cost. We estimate the information value of AD forecasts by comparing the two cases of decision-making with or without the AD forecast. The proposed method is verified for the real data of AD forecasts and events in Seoul during the period 2004~2008. The results indicate that AD forecasts can provide the forecast users with benefits, which have various ranges of values according to the relative rate of inventory and lost sales cost.

Splitting Decision Tree Nodes with Multiple Target Variables (의사결정나무에서 다중 목표변수를 고려한)

  • 김성준
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.05a
    • /
    • pp.243-246
    • /
    • 2003
  • Data mining is a process of discovering useful patterns for decision making from an amount of data. It has recently received much attention in a wide range of business and engineering fields Classifying a group into subgroups is one of the most important subjects in data mining Tree-based methods, known as decision trees, provide an efficient way to finding classification models. The primary concern in tree learning is to minimize a node impurity, which is evaluated using a target variable in the data set. However, there are situations where multiple target variables should be taken into account, for example, such as manufacturing process monitoring, marketing science, and clinical and health analysis. The purpose of this article is to present several methods for measuring the node impurity, which are applicable to data sets with multiple target variables. For illustrations, numerical examples are given with discussion.

  • PDF

Decision-tree Model of Treatment-seeking Behaviors after Detecting Symptoms by Korean Stroke Patients

  • Oh Hyo-Sook;Park Hyeoun-Ae
    • Journal of Korean Academy of Nursing
    • /
    • v.36 no.4
    • /
    • pp.662-670
    • /
    • 2006
  • Purpose. This study was performed to develop and test a decision-tree model of treatment-seeking behaviors about when Korean patients visit a doctor after experiencing stroke symptoms. Methods. The study used methodological triangulation. The model was developed based on qualitative data collected from in-depth interviews with 18 stroke patients. The model was tested using quantitative data collected from interviews and a structured questionnaire involving 150 stroke patients. The predictability of the decision-tree model was quantified as the proportion of participants who followed the pathway predicted by the model. Results. Decision outcomes of the model were categorized into immediate and delayed treatment-seeking behavior. The model was influenced by lowered consciousness, social-group influences, perceived seriousness of symptoms, past history of hypertension or stroke, and barriers to hospital visits. The predictability of the model was found to be 90.7%. Conclusions. The results from this study can help healthcare personnel understand the education needs of stroke patients regarding treatment-seeking behaviors, and hence aid in the development of educational strategies for stroke patients.

Forecasting Sow's Productivity using the Machine Learning Models (머신러닝을 활용한 모돈의 생산성 예측모델)

  • Lee, Min-Soo;Choe, Young-Chan
    • Journal of Agricultural Extension & Community Development
    • /
    • v.16 no.4
    • /
    • pp.939-965
    • /
    • 2009
  • The Machine Learning has been identified as a promising approach to knowledge-based system development. This study aims to examine the ability of machine learning techniques for farmer's decision making and to develop the reference model for using pig farm data. We compared five machine learning techniques: logistic regression, decision tree, artificial neural network, k-nearest neighbor, and ensemble. All models are well performed to predict the sow's productivity in all parity, showing over 87.6% predictability. The model predictability of total litter size are highest at 91.3% in third parity and decreasing as parity increases. The ensemble is well performed to predict the sow's productivity. The neural network and logistic regression is excellent classifier for all parity. The decision tree and the k-nearest neighbor was not good classifier for all parity. Performance of models varies over models used, showing up to 104% difference in lift values. Artificial Neural network and ensemble models have resulted in highest lift values implying best performance among models.

  • PDF

A Neural Network-Driven Decision Tree Classifier Approach to Time Series Identification (인공신경망 기초 의사결정트리 분류기에 의한 시계열모형화에 관한 연구)

  • 오상봉
    • Journal of the Korea Society for Simulation
    • /
    • v.5 no.1
    • /
    • pp.1-12
    • /
    • 1996
  • We propose a new approach to classifying a time series data into one of the autoregressive moving-average (ARMA) models. It is bases on two pattern recognition concepts for solving time series identification. The one is an extended sample autocorrelation function (ESACF). The other is a neural network-driven decision tree classifier(NNDTC) in which two pattern recognition techniques are tightly coupled : neural network and decision tree classfier. NNDTc consists of a set of nodes at which neural network-driven decision making is made whether the connecting subtrees should be pruned or not. Therefore, time series identification problem can be stated as solving a set of local decisions at nodes. The decision values of the nodes are provided by neural network functions attached to the corresponding nodes. Experimental results with a set of test data and real time series data show that the proposed approach can efficiently identify the time seires patterns with high precision compared to the previous approaches.

  • PDF

A decision support system (DSS) for construction risk efficiency in Taiwan

  • Tsai, Tsung-Chieh;Li, Hsiang-Wen
    • Smart Structures and Systems
    • /
    • v.21 no.2
    • /
    • pp.249-255
    • /
    • 2018
  • Many studies in risk management have been focused on management process, contract relation, and risk analysis in the past decade, but very few studies have addressed project risks from the perspective of risk efficiency. This study started with using Fault Tree Analysis to develop a framework for the decision-making support system of risk management from the perspective of risk efficiency, in order for the support system to find risk strategies of optimal combination for the project manager by the trade-off between project risk and cost of project strategies. Comprehensive and realistic risk strategies must strive for optimal decisions that minimize project risks and risk strategies cost while addressing important data such as risk causes, risk probability, risk impact and risk strategies cost. The risk management in the construction phase of building projects in Taiwan upon important data has been analyzed, that provided the data for support system to include 247 risk causes. Then, 17 risk causes were extracted to demonstrates the decision-making support system of risk management from the perspective of risk efficiency in building project of Taiwan which could reach better combination type of risk strategies for the project manager by the trade-off between risk cost and project risk.

Wine Quality Assessment Using a Decision Tree with the Features Recommended by the Sequential Forward Selection

  • Lee, Seunghan;Kang, Kyungtae;Noh, Dong Kun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.22 no.2
    • /
    • pp.81-87
    • /
    • 2017
  • Nowadays wine is increasingly enjoyed by a wider range of consumers, and wine certification and quality assessment are key elements in supporting the wine industry to develop new technologies for both wine making and selling processes. There have been many attempts to construct a more methodical approach to the assessment of wines, but most of them rely on objective decision rather than subjective judgement. In this paper, we propose a data mining approach to predict human wine taste preferences that is based on easily available analytical tests at the certification step. We used sequential forward selection and decision tree for this purpose. Experiments with the wine quality dataset from the UC Irvine Machine Learning Repository demonstrate the accuracies of 76.7% and 78.7% for red and white wines respectively.

Correlation Analysis of the Frequency and Death Rates in Arterial Intervention using C4.5

  • Jung, Yong Gyu;Jung, Sung-Jun;Cha, Byeong Heon
    • International journal of advanced smart convergence
    • /
    • v.6 no.3
    • /
    • pp.22-28
    • /
    • 2017
  • With the recent development of technologies to manage vast amounts of data, data mining technology has had a major impact on all industries.. Data mining is the process of discovering useful correlations hidden in data, extracting executable information for the future, and using it for decision making. In other words, it is a core process of Knowledge Discovery in data base(KDD) that transforms input data and derives useful information. It extracts information that we did not know until now from a large data base. In the decision tree, c4.5 algorithm was used. In addition, the C4.5 algorithm was used in the decision tree to analyze the difference between frequency and mortality in the region. In this paper, the frequency and mortality of percutaneous coronary intervention for patients with heart disease were divided into regions.