Browse > Article
http://dx.doi.org/10.9708/jksci.2022.27.09.077

Automatic Creation of SHACL Schemas for Validation of RDF Knowledge Graph Structures Based on RML Mappings  

Choi, Ji-Woong (School of Computer Science and Engineering, Soongsil University)
Abstract
In this paper, we propose a system which automatically generates SHACL schemas to describe and validate RDF knowledge graphs constructed by RML mappings. Unlike existing studies, the proposed system generates the schemas based on not only RML mapping rules but also metadata extracted from RML mapping input data in various formats such as CSV, JSON, XML or databases. Therefore, our schemas include the constraints on data type, string length, value range and cardinality, which were not present in the existing schemas. And we solves the problem with "repeated properties" which overlooked in existing studies. Through a conformance test consisting of 297 cases, we show that the proposed system generates correct constraints for the graphs. The proposed system can contribute to automation of the tedious and error-prone existing manual validation processes.
Keywords
RML; SHACL; RDF; Knowledge Graph; Validation;
Citations & Related Records
연도 인용수 순위
  • Reference
1 P. Heyvaert, A. Dimou and B. D. Meester, "RML Test Cases," https://rml.io/test-cases/
2 S. Das, S. Sundara and R. Cyganiak, "R2RML: RDB to RDF Mapping Language," https://www.w3.org/TR/r2rml/
3 H. Paulheim, "Knowledge Graph Refinement: A Survey of Approaches and Evaluation Methods," Semantic Web, Vol. 8, No. 3, pp. 489-508, December 2016. DOI: 10.3233/SW-160218   DOI
4 L. Bellomarini, E. Sallinger, and S. Vahdati, "Knowledge Graphs: The Layered Perspective," Knowledge Graphs and Big Data Processing, Vol. 12072, No. 2, pp. 20-34, July 2020. DOI: 10.1007/978-3-030-53199-7_2   DOI
5 A. Dimou, "High-quality knowledge graphs generation: R2rml and rml comparison, rules validation and inconsistency resolution," Applications and Practices in Ontology Design, Extraction, and Reasoning, Vol. 49, No. 4, pp. 55-72, November 2020. DOI: 10.3233/SSW200035   DOI
6 T. Delva, and S. M. Oo, "RML2SHACL," https://github.com/RMLio/RML2SHACL/
7 R. Reinanda, E. Meij, and M. d. Rijke, "Knowledge Graphs: An Information Retrieval Perspective," Foundations and Trends® in Information Retrieval, Vol. 14, No. 4, pp. 289-444, October 2020. DOI: 10.1561/1500000063   DOI
8 T. Delva, B. D. Smedt, S. M. Oo, D. V. Assche, S. Lieber, and A. Dimou, "RML2SHACL: RDF Generation Is Shaping Up," Proceedings of the 11th on Knowledge Capture Conference (K-CAP '21), pp. 153-160, New York, USA, December 2021. DOI: 10.1145/3460210.3493562Q   DOI
9 D. Beckett, T. Berners-Lee, E. Prud'hommeaux, and G. Carothers, "RDF 1.1 Turtle," https://www.w3.org/TR/turtle/
10 E. Iglesias, S. Jozashoori, D. Chaves-Fraga, D. Collarana, and M. Vida, "SDM-RDFizer: An RML Interpreter for the Efficient Creation of RDF Knowledge Graphs," Proceedings of the 29th ACM International Conference on Information & Knowledge Management, pp. 3039-3046, Virtual Event, Ireland, October, 2020. DOI: 10.1145/3340531.3412881   DOI
11 H. Knublauch and D. Kontokostas, "Shapes Constraint Language (SHACL)," https://www.w3.org/TR/shacl/
12 J. E. L. Gayo, E. Prud'hommeaux, I. Boneva, and D. Kontokostas, "Validating RDF Data," Springer Nature, pp. 251-253, 2022.
13 G. Carothers, "RDF 1.1 N-Quads," https://www.w3.org/TR/n-quads/
14 B. Abu-Salih, M. Al-Tawil, I. Aljarah, H. Faris, P. Wongthongtham, K. Y. Chan, and A. Beheshti, "Relational Learning Analysis of Social Politics using Knowledge Graph Embedding," Data Mining and Knowledge Discovery, Vol. 35, No. 4, pp. 1497-1536, July 2021. DOI: 10.1007/s10618-021-00760-w   DOI
15 G. Alor-Hernandez, J. L. Sanchez-Cervantes, A. Rodriguez-Gonzalez, and R. Valencia-Garcia, "Current Trends in Semantic Web Technologies: Theory and Practice," Springer, pp. 25-56, 2019.
16 R. Reda, F. Piccinini, and A. Carbonaro, "Semantic Modelling of Smart Healthcare Data," Proceedings of SAI Intelligent Systems Conference, pp. 399-411, London, United Kingdom, September 2018. DOI: 10.1007/978-3-030-01057-7_32   DOI