Browse > Article
http://dx.doi.org/10.5392/JKCA.2020.20.03.018

Improving Diversity of Keyword Search on Graph-structured Data by Controlling Similarity of Content Nodes  

Park, Chang-Sup (동덕여자대학교 컴퓨터학과)
Publication Information
Abstract
Recently, as graph-structured data is widely used in various fields such as social networks and semantic Webs, needs for an effective and efficient search on a large amount of graph data have been increasing. Previous keyword-based search methods often find results by considering only the relevance to a given query. However, they are likely to produce semantically similar results by selecting answers which have high query relevance but share the same content nodes. To improve the diversity of search results, we propose a top-k search method that finds a set of subtrees which are not only relevant but also diverse in terms of the content nodes by controlling their similarity. We define a criterion for a set of diverse answer trees and design two kinds of diversified top-k search algorithms which are based on incremental enumeration and A heuristic search, respectively. We also suggest an improvement on the A search algorithm to enhance its performance. We show by experiments using real data sets that the proposed heuristic search method can find relevant answers with diverse content nodes efficiently.
Keywords
Graph-structured Data; Keyword Search; Top-k Query; Diversification; $A^{\ast}$ Search;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 K. Golenberg, B. Kimelfeld, and Y. Sagiv, "Keyword proximity search in complex data graphs," Proc. of ACM SIGMOD Conference, pp.927-940, 2008.
2 C. Liu, L. Yao, J. Li, R. Zhou, and Z. He, "Finding smallest k-Compact tree set for keyword queries on graphs using map-reduce," World Wide Web, Vol.19, No.3, pp.499-518, 2016.   DOI
3 V. Kacholia, S. Pandit, S. Chakrabarti, S. Sudarshan, R. Desai, and H. Karambelkar, "Bidirectional expansion for keyword search on graph databases," Proc. of the 31st Int'l Conference on VLDB, pp.505-516, 2005.
4 H. He, H. Wang, J. Yang, and P. S. Yu, "BLINKS: ranked keyword searches on graphs," ACM SIGMOD Conference, pp.305-316, 2007.
5 박창섭, "그래프 데이터에 대한 비-중복적 키워드 검색 방법," 한국콘텐츠학회논문지, 제16권, 제6호, pp.205-214, 2016.   DOI
6 C. S. Park, "Reducing redundancy in keyword query processing on graph databases," Journal of Information Science and Engineering, Vol.34, No.2, pp.551-574, 2018.   DOI
7 C. S. Park, "Effective keyword search on graph data using limited root redundancy of answer trees," Int'l Journal of Web Information Systems, Vol.14, No.3, pp.299-316, 2018.   DOI
8 L. Qin, J. X. Yu, L. Chang, and Y. Tao, "Querying communities in relational databases," Proc. of IEEE International Conference on Data Engineering, pp.724-735. 2009.
9 M. Kargar and A. An, "Keyword search in graphs: finding r-cliques," Proc. of the VLDB Endowment, Vol.4, pp.681-692, 2011.   DOI
10 B. Ding, J. X. Yu, S. Wang, L. Qin, X. Zhang, and X. Lin, "Finding top-k min-cost connected trees in databases," Proc. of ICDE, pp.836-845, 2007.
11 W. Le, F. Li, A. Kementsietsidis, and S. Duan, "Scalable keyword search on large RDF data," IEEE Transaction on Knowledge and Data Engineering, Vol.26, No.11, pp.2774-2788, 2014.   DOI
12 A. Angel and N. Koudas, "Efficient diversity-aware search," Proc. of ACM SIGMOD Conference, pp.781-792, 2011.
13 SPARQL Query Language for RDF. http://www.w3.org/TR/rdf-sparql-query/
14 Cypher Query Language. https://neo4j.com/developer/cypher-query-language/
15 M. Kargar, A. An, and X. Yu, "Efficient duplication free and minimal keyword search in graphs," IEEE Trans. on Knowledge and Data Engineering, Vol.26, No.7, pp.1657-1669, 2014.   DOI
16 M. Zhong, Y. Wang, and Y. Zhu, "Coverage-oriented diversification of keyword search results on graphs," Proc. of Int'l Conference on Database Systems for Advanced Applications, pp.166-183, 2018.
17 A. Dass and D. Theodoratos, "Trading off popularity for diversity in the results sets of keyword queries on linked data," Proc. of Int'l Conference on Web Engineering, pp.151-170, 2017.
18 D. Rafiei, K. Bharat, and A. Shukla, "Diversifying web search results," Proc. of the 19th Int'l Conference on WWW, pp.781-790, 2010.
19 G. Capannini, F. M. Nardini, R. Perego, and F. Silvestri, "Efficient diversification of web search results," Proc. of the VLDB Endowment, Vol.4, pp.451-459, 2011.   DOI
20 C. N. Ziegler, S. M. McNee, J. A. Konstan, and G. Lausen, "Improving recommendation lists through topic diversification," Proc. of the 14th Int'l Conference on WWW, pp.22-32, 2005.