Browse > Article
http://dx.doi.org/10.14346/JKOSOS.2021.36.5.52

Machine Learning Approach to Classifying Fatal and Non-Fatal Accidents in Industries  

Kang, Sungsik (Department of Safety Engineering, Pukyong National University)
Chang, Seong Rok (Department of Safety Engineering, Pukyong National University)
Suh, Yongyoon (Department of Industrial and Systems Engineering, Dongguk University(Seoul Campus))
Publication Information
Journal of the Korean Society of Safety / v.36, no.5, 2021 , pp. 52-60 More about this Journal
Abstract
As the prevention of fatal accidents is considered an essential part of social responsibilities, both government and individual have devoted efforts to mitigate the unsafe conditions and behaviors that facilitate accidents. Several studies have analyzed the factors that cause fatal accidents and compared them to those of non-fatal accidents. However, studies on mathematical and systematic analysis techniques for identifying the features of fatal accidents are rare. Recently, various industrial fields have employed machine learning algorithms. This study aimed to apply machine learning algorithms for the classification of fatal and non-fatal accidents based on the features of each accident. These features were obtained by text mining literature on accidents. The classification was performed using four machine learning algorithms, which are widely used in industrial fields, including logistic regression, decision tree, neural network, and support vector machine algorithms. The results revealed that the machine learning algorithms exhibited a high accuracy for the classification of accidents into the two categories. In addition, the importance of comparing similar cases between fatal and non-fatal accidents was discussed. This study presented a method for classifying accidents using machine learning algorithms based on the reports on previous studies on accidents.
Keywords
machine learning; narrative texts; textmining; fatal accidents; non-fatal accidents; classification;
Citations & Related Records
연도 인용수 순위
  • Reference
1 KOSHA, "Statistical Survey and Analysis of Industrial Disasters", 2019.
2 G. Ahn, M. Seo and S. Hur, "Development of Accident Classification Model and Ontology for Effective Industrial Accident Analysis based on Textmining", Journal of The Korean Society of Safety, Vol. 32, No. 5, pp. 179-185, 2017.   DOI
3 S. Wold, K. Esbensen and P. Geladi, "Principal Component Analysis", Chemometrics and Intelligent Laboratory Systems, Vol. 2, No. 1-3, pp. 37-52, 1987.   DOI
4 J. Jeong, M. Jee, M. Go, H. Kim, H. Lim, Y. Lee, and W. Kim, "Related Documents Classification System by Similarity between Documents", Journal of Broadcast Engineering, Vol. 24, No. 1, pp. 77-86, 2019.   DOI
5 W. Lee, "A deep learning analysis of the KOSPI's directions", Journal of the Korean Data and Information Science Society, Vol. 28, No. 2, pp. 287-295, 2017.   DOI
6 M. Leblanc and R. Tibshirani, "Combining Estimates in Regression and Classification", Journal of the American Statistical Association, Vol. 91, No. 436, pp. 1641-1650, 1994.   DOI
7 J. Han, J. Pei and M. Kamber, "Data Mining: Concepts and Techniques", Elsevier, 2011.
8 S. Dreiseitl and L. Ohno-Machado, "Logistic Regression and Artificial Neural Network Classification Models", Journal of Biomedical Informatics, Vol. 35, pp. 352-359, 2002.   DOI
9 Y. Song and Y. LU, "Decision Tree Methods: Applications for Classification and Prediction", Shanghai Arch Psychiatry, Vol. 25, No. 2, pp. 130-135, 2015.
10 R. Feraud and F Clerot, "A Methodology to Explain Neural Network Classification", Neural Networks, Vol. 15, No. 2, pp. 237-246, 2002.   DOI
11 J. P. Bigus, "Data Mining with Neural Networks", Macgraw-Hill, pp. 61-97, 1996.
12 H. W. Heinrich, "Industrial Accident Prevention: A Scientific Approach", McGraw-Hill, 1931.
13 B. Kim, S. Chang and Y. Suh, "Text Analytics for Classifying Types of Accident Occurrence Using Accident Report Documents", J. Korean Soc. Saf., Vol. 33, No. 3, pp. 58-64, 2018.   DOI
14 Y. M. Goh and C. U. Ubeynarayana, "Construction Accident Narrative Classification: An Evaluation of Text Mining Techniques", Accident Analysis and Prevention, Vol. 108, pp. 122-130, 2017.   DOI
15 G. Salton and M. McGill, "Introduction to Modern Information Retrieval", McGraw-Hill, NY, 1983.
16 MOEL, "Classification of Causes of Deaths in Industrial Accidents", 2017.
17 R. Kostoff, D. Toothman, H. Eberhart, and J. Jumenik, "Textmining Using Database Tomography and Bibliometrics: A Review", Technological Forecasting and Social Change, Vol. 68, No. 3, pp. 223-252, 2001.   DOI
18 A. Mathur and G. M. Foody, "Multiclass and Binary SVM Classification: Implications for Training and Classification Users", IEEE Geoscience and Remote sensing Letters, Vol. 5, No. 2, pp. 241-245, 2008.   DOI
19 S. Kang and Y. Suh, "On the Development of Risk Factor Map for Accident Analysis using Textmining and SelfOrganizing Map(SOM) Algorithms", J. Korean Soc. Saf., Vol. 33, No. 6, pp. 77-84, 2018.   DOI
20 D. Pak, M. Hwang, M. Lee, S. Woo, S. Hahn, Y. J. Lee, and J. Hwang, "Application of Text-Classification Based Machine Learning in Predicting Psychiatric Diagnosis", Korean Journal of Biological Psychiatry, Vol. 27, No. 1, pp. 18-26, 2020.