DOI QR코드

DOI QR Code

On the Development of Risk Factor Map for Accident Analysis using Textmining and Self-Organizing Map(SOM) Algorithms

재해분석을 위한 텍스트마이닝과 SOM 기반 위험요인지도 개발

  • Kang, Sungsik (Department of Safety Engineering, Pukyong National University) ;
  • Suh, Yongyoon (Department of Safety Engineering, Pukyong National University)
  • Received : 2018.09.03
  • Accepted : 2018.11.20
  • Published : 2018.12.31

Abstract

Report documents of industrial and occupational accidents have continuously been accumulated in private and public institutes. Amongst others, information on narrative-texts of accidents such as accident processes and risk factors contained in disaster report documents is gaining the useful value for accident analysis. Despite this increasingly potential value of analysis of text information, scientific and algorithmic text analytics for safety management has not been carried out yet. Thus, this study aims to develop data processing and visualization techniques that provide a systematic and structural view of text information contained in a disaster report document so that safety managers can effectively analyze accident risk factors. To this end, the risk factor map using text mining and self-organizing map is developed. Text mining is firstly used to extract risk keywords from disaster report documents and then, the Self-Organizing Map (SOM) algorithm is conducted to visualize the risk factor map based on the similarity of disaster report documents. As a result, it is expected that fruitful text information buried in a myriad of disaster report documents is analyzed, providing risk factors to safety managers.

Keywords

HOJHB0_2018_v33n6_77_f0001.png 이미지

Fig. 1. Conceptual scheme of self-organizing map.

HOJHB0_2018_v33n6_77_f0002.png 이미지

Fig. 2. Flowchart for the methodology.

HOJHB0_2018_v33n6_77_f0003.png 이미지

Fig. 3. Number of accident documents.

HOJHB0_2018_v33n6_77_f0004.png 이미지

Fig. 4. Word cloud of accident keywords.

HOJHB0_2018_v33n6_77_f0005.png 이미지

Fig. 5. Result of Document-Term matrix.

HOJHB0_2018_v33n6_77_f0006.png 이미지

Fig. 6. SOM-based risk factor map.

HOJHB0_2018_v33n6_77_f0007.png 이미지

Fig. 8. Dynamic analysis for keyword change in each cluster.

HOJHB0_2018_v33n6_77_f0008.png 이미지

Fig. 7. Major clusters of accidents.

Table 1. Analysis of large cells

HOJHB0_2018_v33n6_77_t0001.png 이미지

Table 2. Analysis of major clusters

HOJHB0_2018_v33n6_77_t0002.png 이미지

References

  1. Ministry of Employment and Labor, "Classification of Causes of Deaths in Industrial Accidents", 2013.
  2. B. Kim, S. Chang and Y. Suh, "Text Analytics for Classifying Types of Accident Occurrence Using Accident Report Documents", J. Korean Soc. Saf., Vol. 33, No. 3, pp. 58-64, 2018. https://doi.org/10.14346/JKOSOS.2018.33.3.58
  3. G. Choi, "Cause Analysis of Accidents Associated with Industrial Machines and Devices", J. Korean Soc. Saf., Vol. 33, No. 1, pp. 16-21, 2018. https://doi.org/10.14346/JKOSOS.2018.33.1.16
  4. B. Song and Y. Suh, "Narrative Texts-based Anomaly Detection Using Accident Report Documents: The Case of Chemical Process Safety", Journal of Loss Prevention in the Process Industries, 2018. https://doi.org/10.1016/j.jlp.2018.08.010
  5. C. Budayan, "Comparing the Performance of Traditional Cluster Analysis, Self-Organizing Maps and Fuzzy C-means Method for Strategic Grouping", Expert Systems with Applications, Vol. 36, No. 9, pp. 11772-11781, 2009. https://doi.org/10.1016/j.eswa.2009.04.022
  6. R. Feldman and I. Dagan, "Knowledge Discovery in Textual Databases (KDT)", Proceedings of the First International Conference on Knowledge Discovery and Data Mining (KDD), pp. 112-117, 1995.
  7. E. Turban, "Decision Support and Business Intelligence Systems", Pearson Education Limited, Harlow, UK, 2010.
  8. Y. Suh, "Data Analytics for Social Risk Forecasting and Assessment of New Technology", J. Korean Soc. Saf., Vol. 32, No. 3, pp. 83-89, 2017. https://doi.org/10.14346/JKOSOS.2017.32.3.83
  9. T. Kohonen, "The Self-Organizing Map", Neurocomputing, Vol. 21, No. 1-3, pp. 1-6, 1998. https://doi.org/10.1016/S0925-2312(98)00030-7
  10. T. Kohonen, "Self-Organization and Associative Memory", Springer-Verlag, Berlin, Heidelberg, 2012.
  11. T. Kohonen, "Essentials of the Self-Organizing Map", Neural Networks, Vol. 37, pp. 52-65, 2013. https://doi.org/10.1016/j.neunet.2012.09.018
  12. J. Vesanto and E. Alhoniemi, "Clustering of the Self-Organizing Map", IEEE Transactions on Neural Network, Vol.11, No.3, pp. 586-600, 2000. https://doi.org/10.1109/72.846731