DOI QR코드

DOI QR Code

A Study on Distributed Parallel SWRL Inference in an In-Memory-Based Cluster Environment

인메모리 기반의 클러스터 환경에서 분산 병렬 SWRL 추론에 대한 연구

  • Received : 2017.05.19
  • Accepted : 2017.12.25
  • Published : 2018.03.15

Abstract

Recently, there are many of studies on SWRL reasoning engine based on user-defined rules in a distributed environment using a large-scale ontology. Unlike the schema based axiom rules, efficient inference orders cannot be defined in SWRL rules. There is also a large volumet of network shuffled data produced by unnecessary iterative processes. To solve these problems, in this study, we propose a method that uses Map-Reduce algorithm and distributed in-memory framework to deduce multiple rules simultaneously and minimizes the volume data shuffling occurring between distributed machines in the cluster. For the experiment, we use WiseKB ontology composed of 200 million triples and 36 user-defined rules. We found that the proposed reasoner makes inferences in 16 minutes and is 2.7 times faster than previous reasoning systems that used LUBM benchmark dataset.

최근 들어 대용량 온톨로지를 사용하여 분산 환경에서 사용자 정의 규칙을 기반으로 하는 SWRL 추론엔진에 대한 연구가 다양하게 진행되고 있다. 스키마를 기반으로 하는 공리 규칙과 다르게 SWRL 규칙들은 미리 효율적인 추론 순서를 정의할 수 없다. 또한 불필요한 반복과정으로 인해 많은 양의 네트워크 셔플링이 발생한다. 이러한 문제점들을 해결하기 위해서 본 논문에서는 Map-Reduce 알고리즘과 인메모리 기반의 분산처리 프레임워크를 활용하여 동시에 여러 규칙을 추론할 수 있고, 클러스터의 노드간에 발생하는 데이터 셔플링의 양을 최소화할 수 있는 방법을 제안한다. 제안하는 방법의 성능을 측정하기 위해 약 2억 트리플로 구성된 WiseKB 온톨로지와 36개의 사용자 정의 규칙을 사용하여 실험을 진행했고 약 16분이 소요되었다. 또한 LUBM 벤치 마크 데이터를 이용한 비교 실험에서 기존 연구보다 2.7배 높은 성능을 보였다.

Keywords

Acknowledgement

Supported by : 한국연구재단

References

  1. Vavilapalli, Vinod Kumar, et al., "Apache hadoop yarn: Yet another resource negotiator," Proc. of the 4th annual Symposium on Cloud Computing, ACM, 2013.
  2. Zaharia, Matei, et al., "Spark: cluster computing with working sets," Proc. of the 2nd USENIX conference on Hot topics in cloud computing, Vol. 10, 2010.
  3. Silver, David, et al., "Mastering the game of Go with deep neural networks and tree search," Nature 529.7587 (2016): 484-489. https://doi.org/10.1038/nature16961
  4. Ferrucci, David A., "Introduction to "this is watson," IBM Journal of Research and Development 56.3.4 (2012): 1-1. https://doi.org/10.1147/JRD.2012.2225975
  5. Horrocks, Ian, et al., "SWRL: A semantic web rule language combining OWL and RuleML," W3C Member submission 21 (2004): 79.
  6. Dean, Jeffrey, and Sanjay Ghemawat, "MapReduce: simplified data processing on large clusters," Communications of the ACM 51.1 (2008): 107-113. https://doi.org/10.1145/1327452.1327492
  7. Urbani, Jacopo, et al., "OWL reasoning with WebPIE: calculating the closure of 100 billion triples," Extended Semantic Web Conference, Springer, Berlin, Heidelberg, 2010.
  8. Wan-Gon Lee, Sung-Hyuk Bang and Young-Tack Park, "Large Scale Incremental Reasoning using SWRL Rules in a Distributed Framework," Journal of KIISE, Vol. 44, No. 4, pp. 383-391, Apr, 2017. https://doi.org/10.5626/JOK.2017.44.4.383
  9. Wu, Haijiang, et al., "Scalable Horn-Like Rule Inference of Semantic Data Using MapReduce," International Conference on Knowledge Science, Engineering and Management. Springer International Publishing, 2014.
  10. RuleML: http://ruleml.org/
  11. Sirin, Evren, et al., "Pellet: A practical owl-dl reasoner," Web Semantics: science, services and agents on the World Wide Web 5.2 (2007)