Browse > Article

A Feature Analysis of Industrial Accidents Using C4.5 Algorithm  

Leem, Young-Moon (Department of Industrial Systems Engineering, Kangnung National University)
Kwag, Jun-Koo (Department of Industrial Systems Engineering, Kangnung National University)
Hwang, Young-Seob (Department of Industrial Systems Engineering, Kangnung National University)
Publication Information
Journal of the Korean Society of Safety / v.20, no.4, 2005 , pp. 130-137 More about this Journal
Abstract
Decision tree algorithm is one of the data mining techniques, which conducts grouping or prediction into several sub-groups from interested groups. This technique can analyze a feature of type on groups and can be used to detect differences in the type of industrial accidents. This paper uses C4.5 algorithm for the feature analysis. The data set consists of 24,887 features through data selection from total data of 25,159 taken from 2 year observation of industrial accidents in Korea For the purpose of this paper, one target value and eight independent variables are detailed by type of industrial accidents. There are 222 total tree nodes and 151 leaf nodes after grouping. This paper Provides an acceptable level of accuracy(%) and error rate(%) in order to measure tree accuracy about created trees. The objective of this paper is to analyze the efficiency of the C4.5 algorithm to classify types of industrial accidents data and thereby identify potential weak points in disaster risk grouping.
Keywords
A Feature Analysis; Industrial Disasters; C4.5 Algorithm;
Citations & Related Records
연도 인용수 순위
  • Reference
1 김경배, 산업재해의 예방대책에 관한 연구, 관동대학교 석사학위논문, 2004
2 최종후, 한상태, 강현철, 김은석, 김차용, 김미경, 'SAS Enterprise Miner를 이용한 데이터 마이닝-방법론 및 활용', 자유아카데미, pp. 230-300, 2001
3 최종후, 한상태, 강현철, 김은석, '(Answer Tree를 이용한) 데이터 마이닝 의사결정나무 분석', 고려 정보 산업, pp. 17-74, 1998
4 Kamber, M., Winstone, L., Gong, W., Cheng, S., Han, J.; 'Generalization and Decision Tree Induction', Efficient Classification in Data Mining. Proceedings of the International Workshop Issue of Data Engineering (RIDE' 97) Birmingham, UK. pp. 111-120, 1997
5 Pietersma, D., Lacroix, R., Lefebvre, D., Wade, K. M., 'Induction and Evaluation of Decision Tree for Lactation Curve Analysis', Comput. Electron. Agric. 38, pp. 19-32, 2003   DOI   ScienceOn
6 오희경, 최형인, '데이터 마이닝 분류 모델 비교 및 분석', 서울대학교 석사 학위 논문, 2002
7 김종현, 우리나라 산업재해 통계를 이용한 재해 실태분석과 통계제도의 개선방향, 경일대학교 석사학위논문, pp. 40-60, 1998
8 권혜숙, '데이터마이닝 패키지에서 분류나무 알고리즘의 비교 연구', 서울대학교 석사 학위 논문, 2002
9 임영문, 최영두, '연관규칙을 이용한 데이터 분석에 관한 연구', 산업경영시스템학회지, Vol. 23, No. 61, pp 115-126, 2001
10 Chong Yau Fu.; 'Combining Loglinear Model with Classification and Regression Tree (CART) : an Application to Birth Data', Computational Statistics & Data Analysis 4, 2004
11 노동부, 2004년판 노동백서, pp 56-65, 2004