Browse > Article
http://dx.doi.org/10.5392/JKCA.2022.22.08.033

Approximate Top-k Labeled Subgraph Matching Scheme Based on Word Embedding  

Choi, Do-Jin (창원대학교 컴퓨터공학과)
Oh, Young-Ho (충북대학교 빅데이터협동과정)
Bok, Kyoung-Soo (원광대학교 인공지능융합학과)
Yoo, Jae-Soo (충북대학교 정보통신공학부)
Publication Information
Abstract
Labeled graphs are used to represent entities, their relationships, and their structures in real data such as knowledge graphs and protein interactions. With the rapid development of IT and the explosive increase in data, there has been a need for a subgraph matching technology to provide information that the user is interested in. In this paper, we propose an approximate Top-k labeled subgraph matching scheme that considers the semantic similarity of labels and the difference in graph structure. The proposed scheme utilizes a learning model using FastText in order to consider the semantic similarity of a label. In addition, the label similarity graph(LSG) is used for approximate subgraph matching by calculating similarity values between labels in advance. Through the LSG, we can resolve the limitations of the existing schemes that subgraph expansion is possible only if the labels match exactly. It supports structural similarity for a query graph by performing searches up to 2-hop. Based on the similarity value, we provide k subgraph matching results. We conduct various performance evaluations in order to show the superiority of the proposed scheme.
Keywords
Labeled Graph; Approximate Subgraph Matching; Top-k Matching; Learning Model;
Citations & Related Records
연도 인용수 순위
  • Reference
1 X. Shan, G. Wang, L. Ding, B. Song, and Y. Xu, "Top-k Subgraph Query Based on Frequent Structure in Large-Scale Dynamic Graphs," IEEE Access, Vol.6, pp.78471-78482, 2018.   DOI
2 R. Kaur and S. Singh, "A Comparative Analysis of Structural Graph Metrics to Identify Anomalies in Online Social Networks," Computers & Electrical Engineering, Vol.57, pp.294-310, 2017.   DOI
3 A. B. Sonmez and T. Can, "Comparison of Tissue/disease Specific Integrated Networks using Directed Graphlet Signatures," BMC bioinformatics Vol.18, No.S-4, pp.41-50, 2017.   DOI
4 J. Calle-Gomez, J. Rivero, D. Cuadra, and P. Isasi, "Extending ACO for Fast Path Search in Huge Graphs and Social Networks," Expert Systems with Applications, Vol.86, pp.292-306, 2017.   DOI
5 S. Dutta, P. Nayek, and A. Bhattacharya, "Neighbor-Aware Search for Approximate Labeled Graph Matching using the Chi-Square Statistics," Proc. International Conference on World Wide Web, pp.1281-1290, 2017.
6 K. Kim, I. Seo, W. Han, J. Lee, S. Hong, H. Chafi, H. Shin, and G. Jeong, "Turboflux: A Fast Continuous Subgraph Matching System for Streaming Graph Data," Proc. International Conference on Management of Data, pp.411-426, 2018.
7 Y. Tian and J. M. Patel, "Tale: A Tool for Approximate Large Graph Matching," Proc. International Conference on Data Engineering, pp.963-972, 2008.
8 A. Sinha, X. Shen, Y. Song, H. Ma, D. Eide, B. P. Hsu, and K. Wang, "An Overview of Microsoft Academic Service (MAS) and Applications," Proc. International Conference on World Wide Web Companion, pp.243-246, 2015.
9 A. Khan, Y. Wu, C. C. Aggarwal, and X. Yan, "Nema: Fast Graph Search with Label Similarity," Proc. of the VLDB Endowment, Vol.6, No.3, pp.181-192, 2013.   DOI
10 H. Yu and D. Yuan, "Subgraph Search in Large Graphs with Result Diversification," Proc. SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics, pp.1046-1054, 2014.
11 X. Shan, C. Jia, L. Ding, X. Ding, B. Song, "Dynamic Top-K Interesting Subgraph Query on Large-Scale Labeled Graphs," Information, Vol.10, No.2, p.61, 2019.   DOI
12 B. Du, S. Zhang, N. Cao, and H. Tong, "First: Fast Interactive Attributed Subgraph Matching," Proc. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp.1447-1456, 2017.
13 A. Salamanis, D. D. Kehagias, C. K. Filelis-Papadopoulos, D. Tzovaras, and G. A. Gravvanis, "Managing Spatial Graph Dependencies in Large Volumes of Traffic Data for Travel-Time Prediction," IEEE Transactions on Intelligent Transportation Systems, Vol.17, No.6, pp.1678-1687, 2016.   DOI
14 L. Liu, Lihui, B. Du, and H. Tong. "G-Finder: Approximate Attributed Subgraph Matching," Proc. IEEE International Conference on Big Data, pp.513-522, 2019.
15 W. Chen, J. Liu, Z. Chen, X. Tang, and K. Li, "PBSM: An Efficient Top-K Subgraph Matching Algorithm," International Journal of Pattern Recognition and Artificial Intelligence, Vol.32, No.6, pp.1-23, 2018.