DOI QR코드

DOI QR Code

Artificial Neural Network-based Weight Factor Determination Method for the Enhanced XML Schema Matching of Bridge Engineering Documents

교량 건설 문서의 강화된 XML 스키마 매칭을 위한 인공신경망 기반의 요소 가중치 선정 방안

  • Park, Sang I. (Research Institute for Safety Performance, Korea Authority of Land & Infrastructure Safety) ;
  • Kwon, Tae-Ho (School of Civil & Environmental Engineering, Yonsei University) ;
  • Park, Junwon (Department of Civil Engineering, Myongji College) ;
  • Seo, Kyung-Wan (School of Civil & Environmental Engineering, Yonsei University) ;
  • Yoon, Young-Cheol (Department of Civil Engineering, Myongji College)
  • 박상일 (국토안전관리원 안전성능연구소) ;
  • 권태호 (연세대학교 건설환경공학과) ;
  • 박준원 (명지전문대학 토목공학과) ;
  • 서경완 (연세대학교 건설환경공학과) ;
  • 윤영철 (명지전문대학 토목공학과)
  • Received : 2022.01.18
  • Accepted : 2022.02.21
  • Published : 2022.02.28

Abstract

Bridge engineering documents have essential contents that must be referenced continuously throughout a structure's entire life cycle, but research related to the quality of the contents is still lacking. XML schema matching is an excellent technique to improve the quality of stored data; however, it takes excessive computing time when applied to documents with many contents and a deep hierarchical structure, such as bridge engineering documents. Moreover, it requires a manual parametric study for matching elements' weight factors, maintaining a high matching accuracy. This study proposes an efficient weight-factor determination method based on an artificial neural network (ANN) model using the simplified XML schema-matching method proposed in a previous research to reduce the computing time. The ANN model was generated and verified using 580 data of document properties, weight factors, and matching accuracy. The proposed ANN-based schema-matching method showed superiority in terms of accuracy and efficiency compared with the previous study on XML schema matching for bridge engineering documents.

Keywords

Acknowledgement

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2021R1F1A1048259).

References

  1. S. Liu, C. A. McMahon, M. J. Darlington, S. J. Culley and P. J. Wild, "A Computational Framework for Retrieval of Document Fragments based on Decomposition Schemes in Engineering Information Management", Adv. Eng. Inform., Vol. 20, No. 4, pp. 401-413, 2006. https://doi.org/10.1016/j.aei.2006.05.008
  2. Y. Zhu, R. R. Issa and R. F. Cox, "Web-based Construction Document Processing via Malleable Frame", J. Comput. Civil. Eng., Vol. 15, No. 3, pp. 157-169, 2001. https://doi.org/10.1061/(ASCE)0887-3801(2001)15:3(157)
  3. M. Zhiliang, L. Heng, Q. Shen and Y. Jun, "Using XML to Support Information Exchange in Construction Projects", Autom. Constr., Vol. 13, No. 5, pp. 629-637, 2004. https://doi.org/10.1016/j.autcon.2004.04.010
  4. M. Zhiliang, K. D. Wong, L. Heng and Y. Jun, "Utilizing Exchanged Documents in Construction Projects for Decision Support based on Data Warehousing Technique", Autom. Constr., Vol. 14, No. 3, pp. 405-412, 2005. https://doi.org/10.1016/j.autcon.2004.08.016
  5. Y. Suh, "Data Analytics for Social Risk Forecasting and Assessment of New Technology", J. Korean Soc. Saf., Vol. 32, No. 3, pp. 83-89, 2017. https://doi.org/10.14346/JKOSOS.2017.32.3.83
  6. B. S. Kim, S. Chang and Y. Suh, "Text Analytics for Classifying Types of Accident Occurrence using Accident Report Documents", J. Korean Soc. Saf., Vol. 33, No. 3, pp. 58-64, 2018. https://doi.org/10.14346/JKOSOS.2018.33.3.58
  7. Y. Rezgui, "Ontology-centered Knowledge Management using Information Retrieval Techniques", J. Comput. Civil. Eng., Vol. 20, No. 4, pp. 261-270, 2006. https://doi.org/10.1061/(ASCE)0887-3801(2006)20:4(261)
  8. L. Soibelman, J. Wu, C. Caldas, I. Brilakis and K.-Y. Lin, "Management and Analysis of Unstructured Construction Data Types", Adv. Eng. Inform., Vol. 22, No. 1, pp. 15-27, 2008. https://doi.org/10.1016/j.aei.2007.08.011
  9. S. Liu, C. A. McMahon and S. J. Culley, "A Review of Structured Document Retrieval (SDR) Technology to Improve Information Access Performance in Engineering Document Management", Comput. Ind., Vol. 59, No. 1, pp. 3-16, 2008. https://doi.org/10.1016/j.compind.2007.08.001
  10. J. Zhang and N. M. El-Gohary, "Semantic NLP-based Information Extraction from Construction Regulatory Documents for Automated Compliance Checking", J. Comput. Civil. Eng., Vol. 30, No. 2, p. 04015014, 2016. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000346
  11. M. Al Qady and A. Kandil, "Concept Relation Extraction from Construction Documents using Natural Language Processing", J. Constr. Eng. Manage., Vol. 136, No. 3, pp. 294-302, 2010. https://doi.org/10.1061/(ASCE)CO.1943-7862.0000131
  12. D. A. Salama and N. M. El-Gohary, "Automated Compliance Checking of Construction Operation Plans using a Deontology for the Construction Domain", J. Comput. Civil. Eng., Vol. 27, No. 6, pp. 681-698, 2013. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000298
  13. B.-G. Kim, S. I. Park, H.-J. Kim and S.-H. Lee, "Automatic Extraction of Apparent Semantic Structure from Text Contents of a Structural Calculation Document", J. Comput. Civil. Eng., Vol. 24, No. 3, pp. 313-324, 2010. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000047
  14. N. M. El-Gohary and T. E. El-Diraby, "Domain Ontology for Processes in Infrastructure and Construction", J. Constr. Eng. Manage., Vol. 136, No. 7, pp. 730-744, 2010. https://doi.org/10.1061/(ASCE)CO.1943-7862.0000178
  15. K. Liu and N. El-Gohary, "Ontology-based Semi-supervised Conditional Random Fields for Automated Information Extraction from Bridge Inspection Reports", Autom. Constr, Vol. 81, pp. 313-327, 2017. https://doi.org/10.1016/j.autcon.2017.02.003
  16. E. Rahm and P. A. Bernstein, "A Survey of Approaches to Automatic Schema matching", the VLDB Journal, Vol. 10, No. 4, pp. 334-350, 2001. https://doi.org/10.1007/s007780100057
  17. J. Pouliot, S. Larrivee, C. Ellul and A. Boudhaim, "Exploring Schema Matching to Compare Geospatial Standards: Application to Underground Utility Networks", Intl. Arch. Photogramm, Remote Sens. Spat. Inf. Sci., Vol. 42, No. 4/W10, pp. 157-164, 2018.
  18. F. Troncoso-Pastoriza, P. Eguia-Oller, R. P. Diaz-Redondo and E. Granada-Alvarez, "Generation of BIM Data based on the Automatic Detection, Identification and Localization of Lamps in Buildings", Sust. Cities Soc., Vol. 36, pp. 59-70, 2018. https://doi.org/10.1016/j.scs.2017.10.015
  19. S. Yang, R. Wei, and A. Shigarov, "Semantic Interoperability for Electronic Business through a Novel Cross-context Semantic Document Exchange Approach", in Proceedings of the ACM Symposium on Document Engineering 2018, 2018, pp. 1-10.
  20. S. Yi, B. Huang and W. T. Chan, "XML Application Schema Matching using Similarity Measure and Relaxation Labeling", Inf. Sci., Vol. 169, No. 1-2, pp. 27-46, 2005. https://doi.org/10.1016/j.ins.2004.02.013
  21. S. I. Park, B.-G. Kim and S.-H. Lee, "An Efficient Application of XML Schema Matching Technique to Structural Calculation Document of Bridge", KSCE J. Civ. Eng., Vol. 32, No. 1D, pp. 51-59, 2012.
  22. S. I. Park and S.-H. Lee, "Heuristic Solution using Decision Tree Model for Enhanced XML Schema Matching of Bridge Structural Calculation Documents", Front. Struct. Civ. Eng., Vol. 14, No. 6, pp. 1403-1417, 2021.
  23. W.-S. Li and C. Clifton, "SEMINT: A Tool for Identifying Attribute Correspondences in Heterogeneous Databases using Neural Networks", Data Knowl. Eng., Vol. 33, No. 1, pp. 49-84, 2000. https://doi.org/10.1016/S0169-023X(99)00044-0
  24. J. Madhavan, P. A. Bernstein and E. Rahm, "Generic Schema Matching with Cupid", in vldb, 2001, Vol. 1: Citeseer, pp. 49-58.
  25. A. Doan, J. Madhavan, P. Domingos and A. Halevy, "Learning to Map between Otologies on the Semantic Web", Proceedings of the 11th International Conference on World Wide Web, pp. 662-673, 2002.
  26. S. Melnik, H. Garcia-Molina and E. Rahm, "Similarity Flooding: A Versatile Graph Matching Algorithm and Its Application to Schema Matching", Proceedings 18th International Conference on Data Engineering, IEEE, pp. 117-128, 2002.
  27. J. Lin, "Multiple-objective Problems: Pareto-optimal Solutions by Method of Proper Equality Constraints", IEEE Trans. Autom. Control, Vol. 21, No. 5, pp. 641-650, 1976. https://doi.org/10.1109/TAC.1976.1101338
  28. U. Fayyad, G. Piatetsky-Shapiro and P. Smyth, "From Data Mining to Knowledge Discovery in Databases", AI Mag., Vol. 17, No. 3, pp. 37-37, 1996.
  29. P. Adriaans and D. Zantinge, "Data Mining Addison Wesley Longman Limited", Edinbourgh Gate, Harlow, CM20 2JE, England, 1996.
  30. F. Burden and D. Winkler, "Bayesian Regularization of Neural Networks", Artif. Neural Netw., pp. 23-42, 2008.
  31. D. J. MacKay, "A Practical Bayesian Framework for Backpropagation Networks", Neural Comput., Vol. 4, No. 3, pp. 448-472, 1992. https://doi.org/10.1162/neco.1992.4.3.448
  32. D. M. Hawkins, S. C. Basak and D. Mills, "Assessing Model fit by Cross-validation", J. Chem. Inf. Comput. Sci. Vol. 43, No. 2, pp. 579-586, 2003. https://doi.org/10.1021/ci025626i
  33. B. G. Kim, S. I. Park, H. J. Kim and S. H. Lee, "Automatic Extraction of Apparent Semantic Structure from Text Contents of a Structural Calculation Document", J. Comput. Civil. Eng., Vol. 24, No. 3, pp. 313-324, 2010. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000047