Browse > Article
http://dx.doi.org/10.5392/JKCA.2018.18.02.147

Incremental Frequent Pattern Detection Scheme Based on Sliding Windows in Graph Streams  

Jeong, Jaeyun (충북대학교 정보통신공학과)
Seo, Indeok (충북대학교 빅데이터학과)
Song, Heesub (충북대학교 빅데이터학과)
Park, Jaeyeol (충북대학교 정보통신공학과)
Kim, Minyeong (충북대학교 정보통신공학과)
Choi, Dojin (충북대학교 정보통신공학과)
Bok, Kyoungsoo (충북대학교 정보통신공학과)
Yoo, Jaesoo (충북대학교 정보통신공학과)
Publication Information
Abstract
Recently, with the advancement of network technologies, and the activation of IoT and social network services, many graph stream data have been generated. As the relationship between objects in the graph streams changes dynamically, studies have been conducting to detect or analyze the change of the graph. In this paper, we propose a scheme to incrementally detect frequent patterns by using frequent patterns information detected in previous sliding windows. The proposed scheme calculates values that represent whether the frequent patterns detected in previous sliding windows will be frequent in how many future silding windows. By using the values, the proposed scheme reduces the overall amount of computation by performing only necessary calculations in the next sliding window. In addition, only the patterns that are connected between the patterns are recognized as one pattern, so that only the more significant patterns are detected. We conduct various performance evaluations in order to show the superiority of the proposed scheme. The proposed scheme is faster than existing similar scheme when the number of duplicated data is large.
Keywords
Graph Analysis; Graph Stream; Frequent Pattern Detection; Incremental Processing; Sliding Window;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 유병국, 김순홍, "소셜네트워크 분석을 통한 마케팅 전략," 한국콘텐츠학회논문지, 제13권, 제5호, pp.396-407, 2013.   DOI
2 한진수, 조중권, 최도진, 임종태, 복경수, 유재수, "부하 분산을 위한 정점 절단 기반의 그래프 스트림 분할 기법," 한국정보과학회 학술발표논문, pp.206-208, 2017.
3 강필성, "사물인터넷과 빅데이터 분석 기반의 스마트공장 구현 사례 및 시사점," 한국정보화진흥원, Near & Future, Vol.20, pp.25-35, 2016.
4 C. Giannella, J. Han, J. Pei, X. Yan, and P. S. Yu, "Mining frequent patterns in data streams at multiple time granularities," Next generation data mining, pp.191-212, 2003.
5 C. K. Leung and Q. I. Khan, "DSTree: A Tree Structure for the Mining of Frequent Sets from Data Streams," Proc. International Conference on Data Mining, pp.928-932, 2006.
6 P. Braun, J. J. Cameron, A. Cuzzocrea, F. Jiang, and C. K. Leung, "Effectively and Efficiently Mining Frequent Patterns from Dense Graph Streams on Disk," Proc. International Conference in Knowledge Based and Intelligent Information and Engineering Systems, pp.338-347, 2014.
7 A. Cuzzocrea, Z. Han, F. Jiang, C. K. Leung, and H. Zhang, "Edge-based Mining of Frequent Subgraphs from Graph Streams," International Conference in Knowledge Based and Intelligent Information and Engineering Systems, pp.573-582, 2015.
8 S. K. Tanbeer, C. K. Leung, and J. J. Cameron, "Interactive Mining of Strong Friends from Social Networks and its Applications in E-Commerce," Journal of Organizational Computing and Electronic Commerce, Vol.24, No.2-3, pp.157-173, 2014.   DOI
9 서복일, 김재인, 황부현, "스트림 데이터 환경에서 배치 가중치를 이용하여 사용자 특성을 반영한 빈발항목 집합 탐사," 한국콘텐츠학회논문지, 제11권, 제1호, pp.56-64, 2011.   DOI
10 임종태, 복경수, 유재수, "대용량 그래프 환경에서 스카이라인을 이용한 서브 그래프 유사도 측정 기법," 한국콘텐츠학회 종합학술대회, pp.47-48, 2017.
11 C. C. Aggarwal, Y. Li, P. S. Yu, and R. Jin, "On dense pattern mining in graph streams," Proceedings of the VLDB Endowment, Vol.3, No.1-2, pp.975-984, 2010.   DOI
12 A. Bifet, G. Holmes, B. Pfahringer, and R. Gavalda, "Mining frequent closed graphs on evolving data streams," Proc. ACM SIGKDD international conference on Knowledge discovery and data mining, pp.591-599, 2011.
13 E. Valari, M. Kontaki, and A. N. Papadopoulos, "Discovery of top-k dense subgraphs in dynamic graph collections," Proc. International Conference on Scientific and Statistical Database Management, Springer, pp.213-230, 2012.
14 S. K. Tanbeer, F. Jiang, C. K. Leung, R. K. MacKinnon, and I. J. M. Medina, "Finding groups of friends who are significant across multiple domains in social networks," Proc. International Conference on Computational Aspects of Social Networks, pp.21-26, 2013.
15 A. Cuzzocrea, F. Furfaro, G. M. Mazzeo, and D. Sacca, "A grid framework for approximate aggregate query answering on summarized sensor network readings," Proc. OTM Workshops, pp.144-153, 2004.
16 A. Fariha, C. F. Ahmed, C. K. Leung, S. M. Abdullah, and L. Cao, "Mining frequent patterns from human interactions in meetings using directed acyclic graphs," Proc. Pacific-Asia Conference on Knowledge Discovery and Data Mining, Springer, pp.38-49, 2013.
17 F. Jiang and C. K. Leung, "Mining interesting "following" patterns from social networks," Proc. International Conference on Data Warehousing and Knowledge Discovery, Springer, pp.308-319, 2014.