DOI QR코드

DOI QR Code

Implementation of Policy based In-depth Searching for Identical Entities and Cleansing System in LOD Cloud

LOD 클라우드에서의 연결정책 기반 동일개체 심층검색 및 정제 시스템 구현

  • Received : 2018.03.04
  • Accepted : 2018.05.24
  • Published : 2018.06.30

Abstract

This paper suggests that LOD establishes its own link policy and publishes it to LOD cloud to provide identity among entities in different LODs. For specifying the link policy, we proposed vocabulary set founded on RDF model as well. We implemented Policy based In-depth Searching and Cleansing(PISC for short) system that proceeds in-depth searching across LODs by referencing the link policies. PISC has been published on Github. LODs have participated voluntarily to LOD cloud so that degree of the entity identity needs to be evaluated. PISC, therefore, evaluates the identities and cleanses the searched entities to confine them to that exceed user's criterion of entity identity level. As for searching results, PISC provides entity's detailed contents which have been collected from diverse LODs and ontology customized to the content. Simulation of PISC has been performed on DBpedia's 5 LODs. We found that similarity of 0.9 of source and target RDF triples' objects provided appropriate expansion ratio and inclusion ratio of searching result. For sufficient identity of searched entities, 3 or more target LODs are required to be specified in link policy.

본 연구에서는 동일연결트리플들을 생성하는 대신 각 LOD마다 연결정책을 수립, 공개하고 검색 시점에서 참조하는 방식으로 개체간의 동일성을 파악하는 방안과 이러한 연결정책을 명세하기 위한 어휘를 제안하였다. 또한, 연졀정책이 운영되는 환경에서 여러 LOD들에 걸친 심층검색이 실질적으로 진행되는 것을 확인하기 위하여 PISC(Policy based In-depth Searching and Cleansing)을 구현하였으며 이를 Github에 공개하였다. LOD 클라우드는 여러 LOD들의 자발적인 참여로 이루어짐에 따라 검색된 개체들의 동일성에 대한 평가가 필요하다. 이에, PISC는 개체간 동일성 평가를 통하여 사용자가 요구한 동일수준 이상의 개체들로 정제된 검색결과를 제공한다. 검색결과로는 RDF로 모델링된 개체별 상세 검색내용과 이에 대한 의미적 구조인 온톨로지를 함께 제공된다. PISC에 대한 실험은 DBpedia의 5개 LOD를 대상으로 진행하였으며 소스와 타겟 RDF 트리플 목적어의 유사도를 0.9 정도로 요구할 경우 검색결과가 적절한 확장률과 포함률을 가지는 것으로 확인하였다. 또한, 연결정책에는 3개 이상의 타겟LOD를 명세할 경우 동일성이 충분히 검증된 개체들을 확보할 수 있는 것으로 확인하였다.

Keywords

References

  1. Heath and C. Bizer, Linked Data: Evolving the Web into a Global Data Space, Morgan & Claypool, pp. 56-71, 2011
  2. T. B. Lee, "Semantic Web Road Map", https://www.w3.org/DesignIssues/Semantic.html, 1998
  3. A. Abele and J. McCrae, "The Linked Open Data cloud diagram, 2017". http://lod-cloud.net/, 2018
  4. Harth, A., et al., Linked Data Management, 1st Ed., 20-25. CRC Press, pp. 31-68, 2014
  5. Heath, T. and Bizer, C. Linked Data: Evolving the Web into a Global Data Space, Morgan & Claypool, pp. 178-220, 2011
  6. A. Dean and H. James, Semantic Web for the Working Ontologist: Effective Modeling in RDFS and OWL, Elsevier, pp 132-143, 2011
  7. N. Konstantinou, N., Materializing the Web of Linked Data, 1st Ed., Springer, pp. 118-132., 2015
  8. C. Bizer, "Is the Semantic Web what we expected, 2017". https://www.slideshare.net/bizer/is-the-semantic-web-what-we-expected-adoption-patterns-and-contentdriven-challenges-iswc-2016-keynote, 2016
  9. W3C, "What is Linked Data, 2017", https://www.w3.org/standards/semanticweb/data, 2017
  10. J. Volz J., et al., "Silk - A Link Discovery Framework for the Web of Data", Proc. of the 2nd Workshop on Linked Data on the Web 2009, pp. 238-247, 2009. http://www.researchgate.net/publication/228638267_Silk-A_Link_Discovery_Framework_for_the_Web_of_Data
  11. A. Ngonga and S. Auer, "LIMES - A Time-Efficient Approach for Large-Scale Link Discovery on the Web of Data", Proc. of the 22nd IJCAI, pp. 2312-2317, 2011. http://svn.aksw.org/papers/2011/WWW_LIMES/public.pdf
  12. J. Park and Y. Sohn., "A Syntax Added Link Evaluation Technique for Improving Trustworthiness of LOD's Linkages", Journal of KIISE: Databases, Vol. 41, No. 1), pp. 45-61, 2014. http://www.dbpia.co.kr/Journal/ArticleDetail/NODE02360287
  13. J. Park and Y. Sohn., "Trustworthiness Improving Link Evaluation Technique for LOD Linkages giving Considerations to the Syntactic Properties of RDFS, OWL, and OWL2", Journal of KIISE: Databases, Vol. 41, No. 4, pp. 226-241, 2014. http://www.dbpia.co.kr/Journal/ArticleDetail/NODE02457716
  14. S. Brin, et al., "The PageRank Citation Ranking-Bringing Order to the Web", http://ilpubs.stanford.edu/422/1/1999-66.pdf. 1998