DOI QR코드

DOI QR Code

Syntactic and semantic information extraction from NPP procedures utilizing natural language processing integrated with rules

  • Choi, Yongsun (Department of Industrial and Management Engineering, Inje University) ;
  • Nguyen, Minh Duc (Department of Information and Communication Systems, Inje University) ;
  • Kerr, Thomas N. Jr. (Department of Operations Procedures, James A. FitzPatrick NPP)
  • Received : 2020.06.19
  • Accepted : 2020.08.09
  • Published : 2021.03.25

Abstract

Procedures play a key role in ensuring safe operation at nuclear power plants (NPPs). Development and maintenance of a large number of procedures reflecting the best knowledge available in all relevant areas is a complex job. This paper introduces a newly developed methodology and the implemented software, called iExtractor, for the extraction of syntactic and semantic information from NPP procedures utilizing natural language processing (NLP)-based technologies. The steps of the iExtractor integrated with sets of rules and an ontology for NPPs are described in detail with examples. Case study results of the iExtractor applied to selected procedures of a U.S. commercial NPP are also introduced. It is shown that the iExtractor can provide overall comprehension of the analyzed procedures and indicate parts of procedures that need improvement. The rich information extracted from procedures could be further utilized as a basis for their enhanced management.

Keywords

Acknowledgement

This study was supported in part by Korea Hydro & Nuclear Power Co., Ltd., Republic of Korea. (No. 17-Tech-14).

References

  1. International Atomic Energy Agency, Good Practices with Respect to the Development and Use of Nuclear Power Plant Procedures, IAEA-TECDOC1058, Vienna, 1998.
  2. Procedure Professionals Association, Procedure Writer's Manual, PPA AP-907-005, 2016. Revision 2.
  3. U.S. Department of Energy, Writer's Guide for Technical Procedures, DOE-STD1029-92, Revision 0, 1998.
  4. Procedure Professionals Association, Functional Requirements for Advanced and Adaptive Smart Documents, PPA AP-907-005.001, 2017. Revision 0.
  5. S.C. Peres, N. Quddus, P. Kannan, L. Ahmed, P. Ritchey, W. Johnson, S. Rahmani, M.S. Mannan, A summary and synthesis of procedural regulations and standards - informing a procedures writer's guide, J. Loss Prev. Process. Ind. 44 (2016) 726-734, https://doi.org/10.1016/j.jlp.2016.08.003.
  6. K. Thomas, B. Hallbert, Long-term Instrumentation, Information, and Vontrol Systems (II&C) Modernization Future Vision and Strategy, INL/EXT-11-24154, Revision 2, Idaho National Laboratory, 2013.
  7. P.H. Seong, H.G. Kang, M.G. Na, J.H. Kim, G. Heo, Y. Jung, Advanced MMIS toward substantial reduction in human errors in NPPs, Nucl. Eng. Technol. 45 (2013) 125-140, https://doi.org/10.5516/NET.04.2013.700.
  8. S.J. Lee, P.H. Seong, Design of an integrated operator support system for advanced NPP MCRs: issues and perspectives, in: H. Yoshikawa, Z. Zhang (Eds.), Prog. Nucl. Saf. Symbiosis Sustain. Adv. Digit. Instrumentation, Control Inf. Syst. Nucl. Power Plants, Springer Japan, Tokyo, 2014, pp. 11-26, https://doi.org/10.1007/978-4-431-54610-8_2.
  9. International Atomic Energy Agency, Information Technology for Nuclear Power Plant Configuration Management, 2010. IAEA-TECDOC-1651.
  10. Procedure Professionals Association, Procedure Process Description, 2016. PPA AP-907-001, Revision 2.
  11. J. Cowie, Y. Wilks, Information extraction, Commun. ACM 39 (1996) 80-91. https://doi.org/10.1145/234173.234209
  12. S. Sarawagi, Information extraction, found, Trends Databases. 1 (2008) 261-377. https://doi.org/10.1561/1900000003
  13. M. Mannai, W.B.A. Karaa, H.H. Ben Ghezala, Information extraction approaches: a survey, in: D.K. Mishra, A.T. Azar, A. Joshi (Eds.), Inf. Commun. Technol., Springer Singapore, Singapore, 2018, pp. 289-297.
  14. D.E. Appelt, B. Onyshkevych, The common pattern specification language, in: Proc. A Work. Held Balt. Maryl. Oct. 13-15, 1998, Association for Computational Linguistics, USA, 1998, pp. 23-30, https://doi.org/10.3115/1119089.1119095.
  15. H. Cunningham, D. Maynard, V. Tablan, Jape: a Java Annotation Patterns Engine, Univ. of Sheffield, Department of Computer Science, 2000. Technical report CSe00-10.
  16. J. Zhang, N.M. El-Gohary, Semantic NLP-based information extraction from construction regulatory documents for automated compliance checking, J. Comput. Civ. Eng. 30 (2016) 4015014. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000346
  17. N. Ireson, F. Ciravegna, M.E. Califf, D. Freitag, N. Kushmerick, A. Lavelli, Evaluating machine learning for information extraction, Proc. 22nd Int. Conf. Mach. Learn. (2005) 345-352.
  18. P. Zhou, N. El-Gohary, Ontology-based automated information extraction from building energy conservation codes, Autom. ConStruct. 74 (2017) 103-117, https://doi.org/10.1016/j.autcon.2016.09.004.
  19. W. Wong, W. Liu, M. Bennamoun, Ontology learning from text: a look back and into the future, ACM Comput. Surv. 44 (2012).
  20. N.F. Noy, D.L. McGuinness, Ontology Development 101: A Guide to Creating Your First Ontology, KSL-01-05, Stanford knowledge systems laboratory, Stanford, CA, 2001.
  21. C. Arora, M. Sabetzadeh, L. Briand, F. Zimmer, Automated checking of conformance to requirements templates using natural language processing, IEEE Trans. Software Eng. 41 (2015) 944-968. https://doi.org/10.1109/TSE.2015.2428709
  22. Y. Zhao, X. Diao, J. Huang, C. Smidts, Automated identification of causal relationships in nuclear power plant event reports, Nucl. Technol. 205 (2019) 1021-1034, https://doi.org/10.1080/00295450.2019.1580967.
  23. A. Jain, M. Ganesamoorty, NukeBERT: a Pre-trained Language Model for Low Resource Nuclear Domain, 2020. ArXiv Prepr. ArXiv2003.13821.
  24. J. Devlin, M.-W. Chang, K. Lee, K. Toutanova, BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding, 2018. ArXiv Prepr. ArXiv1810.04805.
  25. P. Shi, J. Huo, Q. Wang, Constructing ontology for knowledge sharing of materials failure analysis, Data Sci. J. 12 (2014) 181-190. https://doi.org/10.2481/dsj.12-047
  26. N.M. Meenachi, M.S. Baba, Development of semantic web-based knowledge management for nuclear reactor (KMNuR) portal, DESIDOC J. Libr. Inf. Technol. 34 (2014) 426-434. https://doi.org/10.14429/djlit.34.7002
  27. A. Pruttianan, N. Lau, V. Tech, S. Anders, M.B. Weinger, Ontology to guide scenario design to evaluate new technologies for control room modernization, in: Nucl. Plant Instrument, Control Hum. Mach. Interface Technol., 2017, pp. 206-214.
  28. R. Elhdad, N. Chilamkurti, T. Torabi, An ontology-based framework for process monitoring and maintenance in petroleum plant, J. Loss Prev. Process. Ind. 26 (2013) 104-116. https://doi.org/10.1016/j.jlp.2012.10.001
  29. T. Sobral, T. Galvao, J. Borges, An ontology-based approach to knowledgeassisted integration and visualization of urban mobility data, Expert Syst. Appl. 150 (2020) 113260, https://doi.org/10.1016/j.eswa.2020.113260.
  30. Y. Wu, V. Ebrahimipour, S. Yacout, Ontology-based modeling of aircraft to support maintenance management system, IIE Annu. Conf. Proc. (2014) 1159-1168.
  31. R.C. Ward, A. Sorokine, Nuclear Power Plant Ontology in OWL Format, ORNL/LTR-2012/467, Oak Ridge National Laboratory (ORNL), U.S. Department of Energy, 2012.
  32. M.A. Musen, The protege project: a look back and a look forward, AI Matters 1(2015) 4-12. https://doi.org/10.1145/2757001.2757003
  33. Gate, ANNIE: a Nearly-New Information Extraction System, 2018. https://gate.ac.uk/ie/annie.htm.
  34. Stanford NLP Group, Stanford Named Entity Recognizer NER. https://nlp.stanford.edu/software/CRF-NER.shtml, 2018.
  35. Microsoft, NET API, 2018. https://docs.microsoft.com/en-us/dotnet/api/microsoft.office.interop.word?view=word-pia.
  36. C. Manning, M. Surdeanu, J. Bauer, J. Finkel, S. Bethard, D. McClosky, The Stanford CoreNLP natural language processing toolkit, in: Proc. 52nd Annu. Meet. Assoc. Comput. Linguist. Syst. Demonstr., 2014, pp. 55-60.
  37. H. Ryu, Open Korean Text Processor, 2018. https://github.com/open-koreantext/open-korean-text.
  38. C.D. Manning, C.D. Manning, H. Schutze, Foundations of Statistical Natural Language Processing, MIT press, 1999.
  39. D. Jurafsky, J.H. Martin, Speech and Language Processing: an Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, second ed., Prentice-Hall, Upper Saddle River, NJ, 2009.
  40. M.P. Marcus, M.A. Marcinkiewicz, B. Santorini, Building a large annotated corpus of English: the penn treebank, comput, Linguist 19 (1993) 313-330.
  41. M.-C. De Marneffe, C.D. Manning, Stanford Typed Dependencies Manual, Technical Report, Stanford University, 2008.
  42. D. Sarkar, Text Analytics with Python: a Practical Real-World Approach to Gaining Actionable Insights from Your Data, Apress, 2016.
  43. P. Paroubek, Evaluating part-of-speech tagging and parsing, in: Eval. Text Speech Syst., Springer, 2007, pp. 99-124.
  44. G.A. Miller, WordNet: a lexical database for English, Commun. ACM 38 (1995) 39-41. https://doi.org/10.1145/219717.219748
  45. U.S. Nuclear Regulatory Commission, APR1400 Design Control Document and Environmental Report, 2018. https://www.nrc.gov/reactors/new-reactors/design-cert/apr1400/dcd.html.
  46. Korea Hydro, Nuclear Power, Glossary of Nuclear Power, 2017.
  47. J. Oxstrand, K. LeBlanc, Computer-based Procedure for Field Activities: Results from Three Evaluations at Nuclear Power Plants, Idaho National Laboratory, Idaho Falls, ID (United States), 2014.
  48. J. Piskorski, R. Yangarber, Information extraction: past, present and future, in: Multi-Source, Multiling. Inf. Extr. Summ., Springer, 2013, pp. 23-49.
  49. D.C. Wimalasuriya, D. Dou, Ontology-based information extraction: an introduction and a survey of current approaches, J. Inf. Sci. 36 (2010) 306-323, https://doi.org/10.1177/0165551509360123.
  50. Electric Power Research Institute, Improving the Execution and Productivity of Maintenance with Electronic Work Packages: a Mobile Work Management Initiative, 2015.