Browse > Article
http://dx.doi.org/10.3745/KTSDE.2016.5.3.155

IRFP-tree: Intersection Rule Based FP-tree  

Lee, Jung-Hun (동국대학교 전산원 컴퓨터해킹보안전공)
Publication Information
KIPS Transactions on Software and Data Engineering / v.5, no.3, 2016 , pp. 155-164 More about this Journal
Abstract
For frequency pattern analysis of large databases, the new tree-based frequency pattern analysis algorithm which can compensate for the disadvantages of the Apriori method has been variously studied. In frequency pattern tree, the number of nodes is associated with memory allocation, but also affects memory resource consumption and processing speed of the growth. Therefore, reducing the number of nodes in the tree is very important in the frequency pattern mining. However, the absolute criteria which need to order the transaction items for construction frequency pattern tree has lowered the compression ratio of the tree nodes. But most of the frequency based tree construction methods adapted the absolute criteria. FP-tree is typically frequency pattern tree structure which is an extended prefix-tree structure for storing compressed frequent crucial information about frequent patterns. For construction the tree, all the frequent items in different transactions are sorted according to the absolute criteria, frequency descending order. CanTree also need to absolute criteria, canonical order, to construct the tree. In this paper, we proposed a novel frequency pattern tree construction method that does not use the absolute criteria, IRFP-tree algorithm. IRFP-tree(Intersection Rule based FP-tree). IRFP-tree is constituted with the new paradigm of the intersection rule without the use of the absolute criteria. It increased the compression ratio of the tree nodes, and reduced the tree construction time. Our method has the additional advantage that it provides incremental mining. The reported test result demonstrate the applicability and effectiveness of the proposed approach.
Keywords
Big Data Analysis; Data Mining; Frequent Pattern Analysis; FP-Tree; DRFP-Tree;
Citations & Related Records
연도 인용수 순위
  • Reference
1 R. Agrawal, T. Imieliski, and A. Swami, "Mining association rules between sets of items in large databases," in Proc. ACM SIGMOD Int. Cont. Manage. Data, pp.207-216, 1993.
2 R. Agrawal and R. Srikant, "Fast algorithms for mining association rules in large databases," in Proc. Int. Conf. Very Large Data Bases, pp.487-499, 1994.
3 Jiawei Han, Jian Pei, and Yiwen Yin, "Mining Frequent Patterns without Candidate Generation," in ACM-SIGMOD, Dallas, 2000.
4 Chris Giannella, Jiawei Han, Jian Pei, Xifeng Yan, and Philip S. Yu, "Mining Frequent Patterns in Data Streams at Multiple Time Granularities; Data Mining: Next Generation Challenges and Future Directions," AAAI/MIT, 2003.
5 O. R. Zaiane and M. El-Hajj, "COFI Approach for Mining Frequent Itemsets Revisited," Proc. ACM SIGMOD Workshop Res. Issues Data Mining Knowl. Discovery, pp.70-75, 2004.
6 O. R. Zaiane and M. El-Hajj, "Cofi-tree mining: A new approach to pattern growth with reduced candidacy," in Proceedings of the IEEE ICDM Workshop on Frequent Itemset Mining Implementation(FIMI'03), 2003.
7 M. Adnan and R. Alhajj, "DRFP-tree: Disc-resident frequent pattern tree," Appl. Intell., Vol.30, No.2, pp.84-97, 2009   DOI
8 M. Adnan and R. Alhajj, "A bounded and Adaptive Memorybased Approach to Mine Frequent Patterns From Very Large Databases," IEEE Transactions on Systems, Man, and Cybernetics, Part B(2011), pp.154-172.   DOI
9 C. K. -S. Leung, Q. I. Khan, and T. Hoque, "CanTree:A Tree Structure for Efficient Incremental Mining of Frequent Patterns," Proc. IEEE Int. Conf. Data Mining, pp.274-281, 2005.
10 C. K. -S. Leung, and Q. I. Khan, "DSTree:a tree structure for the mining of frequent sets from data streams," in Proc. IEEE ICDM, pp.928-932, 2006.
11 G. Liu, H. Lu, J. X. Yu, W. Wang, and X. Xiao, "AFOPT: An efficient implementation of pattern growth approach," in Proc. FIMI, 2003.
12 Cheung, William and Osmar R. Zaiane, "Incremental mining of frequent patterns without candidate generation or support constraint," DatabaseEngineering and Applications Symposium, Proceedings. Seventh International. IEEE, pp.111-116, 2003.
13 B. Goethals, "Memory issues in frequent itemset mining," in proc. ACM SAC, pp.530-534, 2004.
14 R. Vaarandi, "A breadth-first algorithm for mining frequent patterns from event logs," in Proc. IEEE INTELLCOMM, pp.274-281, 2004.
15 G. Buehrer, S. Parthasarathy, and A. Ghoting, "Out-of-core frequent pattern mining on a commodity PC," in Proc. 12th ACM SIGKDD Int. Conf. KDD, pp.86-95, 2006.