DOI QR코드

DOI QR Code

A Combinational Method to Determining Identical Entities from Heterogeneous Knowledge Graphs

  • Kim, Haklae (Korea Institute of Science and Technology Information)
  • Received : 2017.12.07
  • Accepted : 2018.07.09
  • Published : 2018.09.30

Abstract

With the increasing demand for intelligent services, knowledge graph technologies have attracted much attention. Various application-specific knowledge bases have been developed in industry and academia. In particular, open knowledge bases play an important role for constructing a new knowledge base by serving as a reference data source. However, identifying the same entities among heterogeneous knowledge sources is not trivial. This study focuses on extracting and determining exact and precise entities, which is essential for merging and fusing various knowledge sources. To achieve this, several algorithms for extracting the same entities are proposed and then their performance is evaluated using real-world knowledge sources.

Keywords

References

  1. Berners-Lee, T. (2009). The semantic web: linked data. Retrieved Jun 10, 2018 from https://www.w3.org/DesignIssues/LinkedData.html.
  2. Bizer, C., Cyganiak, R., & Heath, T. (2007). How to publish linked data on the web. Retrieved Jun 10, 2018 from http://wifo5-03.informatik.uni-mannheim.de/bizer/pub/LinkedDataTutorial/.
  3. Bollacker, K., Evans, C., Paritosh, P., Sturge, T., & Taylor, J. (2008). Freebase: A collaboratively created graph database for structuring human knowledge. In SIGMOD '08: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data (pp. 1247-1250). New York, NY: ACM.
  4. Castano, S., Ferrara, A., Montanelli, S., & Lorusso, D. (2008). Instance matching for ontology population. In S. Gaglio, I. Infantino, & D. Sacca (Eds.), Proceedings of the Sixteenth Italian Symposium on Advanced Database Systems (pp. 121-132). Mondello, Italy: SEBD.
  5. Ding, L., Shinavier, J., Shangguan, Z., & McGuinness, D. L. (2010). SameAs networks and beyond: Analyzing deployment status and implications of owl: sameAs in linked data. In P. F. Patel-Schneider, Y. Pan, P. Hitzler, P. Mika, L. Zhang, J. Z. Pan,...B. Glimm (Eds.), International Semantic Web Conference (pp. 145-160). Berlin: Springer.
  6. Dong, X., Gabrilovich, E., Heitz, G., Horn, W., Lao, N., Murphy, K.,...Zhang, W. (2014). Knowledge vault: A web-scale approach to probabilistic knowledge fusion. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 601-610).
  7. Enriquez, J. G., Mayo, F. J. D., Cuaresma, M. J. E., Ross, M., & Staples, G. (2017). Entity reconciliation in big data sources: A systematic mapping study. Expert Systems with Applications, 80, 14-27.
  8. Farber, M., Ell, B., Menne, C., & Rettinger, A. (2015). A comparative survey of DBpedia, Freebase, OpenCyc, Wikidata, and YAGO. Semantic Web Journal, 1, 1-5.
  9. Gabrilovich, E., & Usunier, N. (2016). Constructing and mining web-scale knowledge graphs. In R. Perego, F. Sebastiani, J. A. Aslam, I. Ruthven, & J. Zobel (Eds.), SIGIR '16 Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval (pp. 1195-1197). New York, NY: ACM.
  10. Gottron, T., & Staab, S. (2014). Linked open data. In Encyclopedia of social network analysis and mining (pp. 811-813). New York, NY: Springer.
  11. Halpin, H., Hayes, P., McCusker, J. P., McGuinness, D., & Thompson, H. S. (2010). When owl:sameAs isn't the same: An analysis of identity in linked data. In Proceedings of the 9th International Semantic Web Conference (ISWC) (pp. 53-59). Berlin, Heidelberg: IOS Press.
  12. Hogan, A., Zimmermann, A., Umbrich, J., Polleres, A., & Decker, S. (2012). Scalable and distributed methods for entity matching, consolidation and disambiguation over linked data corpora. Journal of Web Semantics, 10, 76-110.
  13. Hors, A. L., & Speicher, S. (2014). Using read-write linked data for application integration. In A. Harth, K. Hose, & R. Schenkel (Eds.), Linked data management (pp. 459-483). Lyon, France: Chapman and Hall/CRC.
  14. Idrissou, A. K., Hoekstra, R., van Harmelen, F., Khalili, A., & den Besselaar, P. V. (2017). Is my sameAs the same as your sameAs? Lenticular lenses for context-specific identity. In O. Corcho, K. Janowicz, G. Rizzo, I. Tiddi, & D. Garijo (Eds.), K-CAP (pp. 23:1-23:8). New York, NY: ACM.
  15. Kim, H., Liang, H., & Ying, D. (2014). Knowledge extraction framework for building a largescale knowledge base. EAI Endorsed Transactions on Industrial Networks and Intelligent Systems, 16(7), 1-8.
  16. Lehmann, J., Bizer, C., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., & Hellmann, S. (2009). DBpedia: A crystallization point for the Web of Data. Journal of Web Semantics, 7, 154-165.
  17. Moaawad, M. R., Mokhtar, H. M. O., & al Feel, H. T. (2017). On-the-fly academic linked data integration. In ICCDA '17 Proceedings of the International Conference on Compute and Data Analysis (pp. 114-122). New York, NY: ACM.
  18. Nguyen, K., & Ichise, R. (2016). Linked data entity resolution system enhanced by configuration learning algorithm. IEICE Transactions, 99-D, 1521-1530.
  19. Paulheim, H. (2017). Knowledge graph refinement: A survey of approaches and evaluation methods. Semantic Web, 8, 489-508.
  20. Stefanidis, K., Efthymiou, V., Herschel, M., & Christophides, V. (2014). Entity resolution in the web of data. In Proceedings of the 23rd International Conference on World Wide Web (WWW '14 Companion) (pp. 203-204). New York, NY: ACM.
  21. Suchanek, F., Kasneci, G., & Weikum, G. (2007). YAGO-A core of semantic knowledge. In Proceedings of International Conference on World Wide Web (pp. 697-706). New York, NY: ACM.
  22. Tanon, T. P., Vrandecic, D., Schaffert, S., Steiner, T., & Pintscher, L. (2016). From Freebase to Wikidata: The great migration. In Proceedings of the 25th International Conference on World Wide Web (WWW '16) (pp. 1419-1428). Geneva, Switzerland: International World Wide Web Conferences Steering Committee.
  23. Vrandecic, D. (2012). Wikidata: A new platform for collaborative data collection. In A. Mille, F. L. Gandon, J. Misselis, M. Rabinovich, & S. Staab (Eds.), WWW (Companion Volume) (pp. 1063-1064). New York, NY: ACM.
  24. Wang, Q., Mao, Z., Wang, B., & Guo, L. (2017). Knowledge graph embedding: A survey of approaches and applications. IEEE Transactions on Knowledge and Data Engineering, 29, 2724-2743.
  25. Yager, R. R. (1987). On the Dempster-Shafer framework and new combination rules. Information Sciences, 41(2), 93-137. https://doi.org/10.1016/0020-0255(87)90007-7