Browse > Article
http://dx.doi.org/10.5392/JKCA.2017.17.11.209

Effective Keyword Search on Semantic RDF Data  

Park, Chang-Sup (동덕여자대학교 컴퓨터학과)
Publication Information
Abstract
As a semantic data is widely used in various applications such as Knowledge Bases and Semantic Web, needs for effective search over a large amount of RDF data have been increasing. Previous keyword search methods based on distinct root semantics only retrieve a set of answer trees having different root nodes. Thus, they often find answer trees with similar meanings or low query relevance together while those with the same root node cannot be retrieved together even if they have different meanings and high query relevance. We propose a new method to find diverse and relevant answers to the query by permitting duplication of root nodes among them. We present an efficient query processing algorithm using path indexes to find top-k answers given a maximum amount of root duplication a set of answer trees can have. We show by experiments using a real dataset that the proposed approach can produce effective answer trees which are less redundant in their content nodes and more relevant to the query than the previous method.
Keywords
Semantic Data; RDF; Keyword Search; Top-k Query;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 RDF- Semantic Web Standards, https://www.w3.org/2001/sw/wiki/RDF
2 SPARQL Query Language for RDF, http://www.w3.org/TR/rdf-sparql-query/
3 V. Kacholia, S. Pandit, S. Chakrabarti, S. Sudarshan, R. Desai, and H. Karambelkar, "Bidirectional expansion for keyword search on graph databases," Proc. Of the 31st Int. Conf. on VLDB, pp.505-516, 2005.
4 M. Kargar, A. An, and X. Yu, "Efficient duplication free and minimal keyword search in graphs," IEEE Trans. on Knowledge and Data Engineering, Vol.26, No.7, pp.1657-1669, 2014.   DOI
5 T. Tran, S. Rudolph, P. Cimiano, and H. Wang, "Top-k exploration of query candidates for efficient keyword search on graph-shaped (RDF) data," Proc. of the 25th ICDE, pp.405-416, 2009.
6 W. Le, F. Li, A. Kementsietsidis, and S. Duan, "Scalable Keyword Search on Large RDF Data," IEEE Trans. On Knowledge and Data Engineering, Vol.26, No.11, pp.2774-2788, 2014.   DOI
7 S. Buttcher, C. Clarke, and G. Cormack, Information retrieval: implementing and evaluating search engine, MIT Press, 2010.
8 R. Fagin, A. Lotem, and M. Naor, "Optimal aggregation algorithms for middleware," Journal of Computer and System Sciences, Vol.66, No.4, pp.614-656, 2003.   DOI
9 H. He, H. Wang, J. Yang, and P. S. Yu, "BLINKS: ranked keyword searches on graphs," ACM SIGMOD Conference, pp.305-316, 2007.
10 B. Ding, J. X. Yu, S. Wang, L. Qin, X. Zhang, and X. Lin, "Finding top-k min-cost connected trees in databases," Proc. Of ICDE, pp.836-845, 2007.
11 B. B. Dalvi, M. Kshirsagar, and S. Sudarshan, "Keyword search on external memory data graphs," Proc. of the VLDB Endowment, Vol.1, No.11, pp.1189-1204, 2008.   DOI
12 박창섭, "그래프 데이터에 대한 비-중복적 키워드 검색 방법," 한국콘텐츠학회논문지, 제16권, 제6호, pp.205-214, 2016.   DOI
13 K. Golenberg, B. Kimelfeld, and Y. Sagiv, "Keyword proximity search in complex data graphs," Proc. of ACM SIGMOD Conference, pp.927-940, 2008.
14 J. X. Yu, Qin, and L. Chang, "Keyword search in relational databases: : a survey," Bulletin of the IEEE CS Technical Committee on Data Engineering, Vol.33, No.1, pp.67-78, 2010.
15 C. Park and S. Lim, "Efficient processing of keyword queries over graph databases for finding effective answers," Information Proc. and Management, Vol.51, No.1, pp.42-57, 2015.   DOI
16 C. Liu, L. Yao, J. Li, R. Zhou, and Z. He, "Finding smallest k-Compact tree set for keyword queries on graphs using mapreduce," World Wide Web, Vol.19, No.3, pp.499-518, 2016.   DOI