• Title/Summary/Keyword: Decision Tree analysis

Search Result 725, Processing Time 0.032 seconds

Data Analysis of Industrial Accidents in Manufacturing Industries Using CHIAD Algorithm (CHAID Algorithm을 이용한 제조업에서의 산업재해 데이터 분석)

  • Leem Young-Moon;Hwang Young-Seob
    • Proceedings of the Safety Management and Science Conference
    • /
    • 2006.04a
    • /
    • pp.45-50
    • /
    • 2006
  • The main objective of this study is to provide feature analysis of industrial accidents in manufacturing industries using CHAID algorithm. In this study, data on 10,536 accidents were analyed to create risk groups, Including the risk of disease and accident. The sample for this work chosen from data related to manufacturing industries during three years $(2002\sim2004)$ in Korea. The resulting classification rules have been incorporated into development of a developed database tool to help quantify associated risks and act as an early warning system to individual industrial accident in manufacturing industries.

  • PDF

A GA-based Binary Classification Method for Bankruptcy Prediction (도산예측을 위한 유전 알고리듬 기반 이진분류기법의 개발)

  • Min, Jae-H.;Jeong, Chul-Woo
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.33 no.2
    • /
    • pp.1-16
    • /
    • 2008
  • The purpose of this paper is to propose a new binary classification method for predicting corporate failure based on genetic algorithm, and to validate its prediction power through empirical analysis. Establishing virtual companies representing bankrupt companies and non-bankrupt ones respectively, the proposed method measures the similarity between the virtual companies and the subject for prediction, and classifies the subject into either bankrupt or non-bankrupt one. The values of the classification variables of the virtual companies and the weights of the variables are determined by the proper model to maximize the hit ratio of training data set using genetic algorithm. In order to test the validity of the proposed method, we compare its prediction accuracy with ones of other existing methods such as multi-discriminant analysis, logistic regression, decision tree, and artificial neural network, and it is shown that the binary classification method we propose in this paper can serve as a premising alternative to the existing methods for bankruptcy prediction.

Analyzing Customer Purchase Behavior of a Department Store and Applying Customer Relationship Management Strategies (백화점 고객의 구매 분석 및 고객관계관리 전략 적용)

  • Ha Sung Ho;Baek Kyung Hoon
    • Korean Management Science Review
    • /
    • v.21 no.3
    • /
    • pp.55-69
    • /
    • 2004
  • This study analyzes customer buying-behavior patterns in a department store as time goes on, and predicts moving patterns of its customers. Through them, it suggests in this paper short-term and long-term marketing promotion strategies. RFM techniques are utilized for customer segmentation. Customers are clustered by using the Kohonen's Self Organizing Map as a method of data mining techniques. Then C5.0, a decision tree analysis technique, is used to predict moving patterns of customers. Using real world data, this study evaluates the prediction accuracy of predictive models.

Fault Diagnosis of Equipment of Wastewater Treatment Plants by Vibration Signal Analysis Using Time-Series Data Mining

  • Choi, Dae-Won;Bae, Hyeon;Chun, Seung-Pyo;Kim, Sung-Shin
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2005.06a
    • /
    • pp.2192-2197
    • /
    • 2005
  • This paper describes how to diagnose SBR plant equipment using time-series data mining. It shows the equipment diagnostics based upon vibration signals that are acquired from each device for process control. Data transform techniques including two data preprocessing skills and data mining methods were employed in the data analysis. The proposed method is not only suitable for SBR equipment, but is also suitable for other industrial devices. The experimental results performed on a lab-scale SBR plant show a good equipment-management performance.

  • PDF

Study on the Comparison and Analysis of Data Mining Models for the Efficient Customer Credit Evaluation (효율적인 신용평가를 위한 데이터마이닝 모형의 비교.분석에 관한 연구)

  • 김갑식
    • Journal of Information Technology Applications and Management
    • /
    • v.11 no.1
    • /
    • pp.161-174
    • /
    • 2004
  • This study is intended to suggest1 the optimized data mining model for the efficient customer credit evaluation in the capital finance industry. To accomplish the research objective, various data mining models for the customer credit evaluation are compared and analyzed. Furthermore, existing models such as Multi-Layered Perceptrons, Multivariate Discrimination Analysis, Radial Basis Function, Decision Tree, and Logistic Regression are employed for analyzing the customer information in the capital finance market and the detailed data of capital financing transactions. Finally, the data from the integrated model utilizing a genetic algorithm is compared with those of each individual model mentioned above. The results reveals that the integrated model is superior to other existing models.

  • PDF

Twostep Clustering of Environmental Indicator Survey Data

  • Park, Hee-Chang
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2005.10a
    • /
    • pp.59-69
    • /
    • 2005
  • Data mining technique is used to find hidden knowledge by massive data, unexpectedly pattern, relation to new rule. The methods of data mining are decision tree, association rules, clustering, neural network and so on. Clustering is the process of grouping the data into clusters so that objects within a cluster have high similarity in comparison to one another. It has been widely used in many applications, such that pattern analysis or recognition, data analysis, image processing, market research on off-line or on-line and so on. We analyze Gyeongnam social indicator survey data by 2001 using twostep clustering technique for environment information. The twostep clustering is classified as a partitional clustering method. We can apply these twostep clustering outputs to environmental preservation and improvement.

  • PDF

Churn Analysis for the First Successful Candidates in the Entrance Examination for K University

  • Kim, Kyu-Il;Kim, Seung-Han;Kim, Eun-Young;Kim, Hyun;Yang, Jae-Wan;Cho, Jang-Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.1
    • /
    • pp.1-10
    • /
    • 2007
  • In this paper, we focus on churn analysis for the first successful candidates in the entrance examination on 2006 year using Clementine, data mining tool. The goal of this study is to apply decision tree including C5.0 and CART algorithms, neural network and logistic regression techniques to predict a successful candidate churn. And we analyze the churning and nochurning successful candidates and why the successful candidates churn and which successful candidates are most likely to churn in the future using data from entrance examination data of K university on 2006 year.

  • PDF

An Empirical Study to Support Intellectual Property Strategy Planning in Firms : The Use of Intellectual Property Roadmap (지식재산 전략유형별 R&D 특성분석과 지식재산로드맵 활용방안)

  • Cho, Chanwoo;Lee, Sungjoo
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.41 no.6
    • /
    • pp.559-571
    • /
    • 2015
  • To strengthen competences, most of firms have co-operated with external partners. This increases the possibility of unexpected conflicts between firms due to the intellectual property litigation. A suitable intellectual property strategy for firms has to be developed to settle this issue. This study aims to analyze an utilization of intellectual property strategy in firms, and tries to suggest a concept of IP roadmap to support intellectual property strategy planning aligned with technology planning process. For the purposes, we derive five types of intellectual property strategy of firms using Korea Innovation Survey. Then, we explore significant affecting factors using a decision-tree and conduct in-depth analysis for them. Lastly, we suggest a concept of IP roadmap, which can be a supporting tool for developing intellectual property strategy in firms, based on analysis results.

The Informative Support and Emotional Support Classification Model for Medical Web Forums using Text Analysis (의료 웹포럼에서의 텍스트 분석을 통한 정보적 지지 및 감성적 지지 유형의 글 분류 모델)

  • Woo, Jiyoung;Lee, Min-Jung;Ku, Yungchang
    • Journal of Information Technology Services
    • /
    • v.11 no.sup
    • /
    • pp.139-152
    • /
    • 2012
  • In the medical web forum, people share medical experience and information as patients and patents' families. Some people search medical information written in non-expert language and some people offer words of comport to who are suffering from diseases. Medical web forums play a role of the informative support and the emotional support. We propose the automatic classification model of articles in the medical web forum into the information support and emotional support. We extract text features of articles in web forum using text mining techniques from the perspective of linguistics and then perform supervised learning to classify texts into the information support and the emotional support types. We adopt the Support Vector Machine (SVM), Naive-Bayesian, decision tree for automatic classification. We apply the proposed model to the HealthBoards forum, which is also one of the largest and most dynamic medical web forum.

Twostep Clustering of Environmental Indicator Survey Data

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.1
    • /
    • pp.1-11
    • /
    • 2006
  • Data mining technique is used to find hidden knowledge by massive data, unexpectedly pattern, relation to new rule. The methods of data mining are decision tree, association rules, clustering, neural network and so on. Clustering is the process of grouping the data into clusters so that objects within a cluster have high similarity in comparison to one another. It has been widely used in many applications, such that pattern analysis or recognition, data analysis, image processing, market research on off-line or on-line and so on. We analyze Gyeongnam social indicator survey data by 2001 using twostep clustering technique for environment information. The twostep clustering is classified as a partitional clustering method. We can apply these twostep clustering outputs to environmental preservation and improvement.

  • PDF