Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2008.15-D.1.15

Frequent Patterns Mining using only one-time Database Scan  

Chai, Duck-Jin (충북대학교 BK21)
Jin, Long (한국전자통신연구원)
Lee, Yong-Mi (충북대학교 전자계산학과)
Hwang, Bu-Hyun (전남대학교 전산학과)
Ryu, Keun-Ho (충북대학교 전기전자 및 컴퓨터공학부)
Abstract
In this paper, we propose an efficient algorithm using only one-time database scan. The proposed algorithm creates the bipartite graph which indicates relationship of large items and transactions including the large items. And then we can find large itemsets using the bipartite graph. The bipartite graph is generated when database is scanned to find large items. We can't easily find transactions which include large items in the large database. In the bipartite graph, large items and transactions are linked each other. So, we can trace the transactions which include large items through the link information. Therefore the bipartite graph is a indexed database which indicates inclusion relationship of large items and transactions. We can fast find large itemsets because proposed method conducts only one-time database scan and scans indexed the bipartite graph. Also, it don't generate candidate itemsets.
Keywords
Data Mining; Association Rule; Bipartite Graph;
Citations & Related Records
연도 인용수 순위
  • Reference
1 R. Srikant, Q. Vu and R. Agrawal, 'Mining association rules with item constraints,' In Proceedings of the Knowledge Discovery and Data Mining, pp.67-73, August, 1997
2 R. Agrawal, C. Aggarwal and V. V. V. Prasad, 'A tree projection algorithm for generation of frequent itemsets,' In Journal of Parallel and Distributed Computing, Volume 61, Issue 3, pp.350-371, March, 2001   DOI   ScienceOn
3 S. Sarawagi, S. Thomas and R. Agrawal, 'Integrating association rule mining with relational database systems: Alternatives and implications,' In Proceedings of the ACM SIGMOD, pp. 343-354, June, 1998
4 J. Han, J. Pei and Y. Yin, 'Mining frequent patterns without candidate generation,' In Proceedings of the ACM SIGMOD, pp.1-12, June, 2000
5 R. Agrawal, T. Imielinski, and A. Swami, 'Mining Association Rules between Sets of Items in Large Databases,' In Proceedings of the ACM SIGMOD, Washington D.C., pp.207-216, May, 1993
6 R. Agrawal, R. Srikant, 'Fast Algorithms for Mining Association Rules in Large Databases,' In Proceedings of the VLDB, Santiago, Chile, pp.487-499, September, 1994
7 G. Grahne, L. Lakshmanan and X. Wang, 'Efficient mining of constrained correlated sets,' In Proceedings of the ICDE, pp.512-521, February, 2000
8 B. Liu, W. Hsu and Y. Ma, 'Mining association rules with multiple minimum supports,' In Proceedings of the ACM SIGKDD, pp.337-341, August, 1999
9 M. Klemettinen, h. Mannila, P. Ronkainen, h. Toivonen and A.I. Verkamo, 'Finding interesting rules from large sets of discovered association rules,' In Proceedings of the CIKM, pp.401-408, November, 1994
10 B. Lent, A. Swami and J. Widom, 'Clustering association rules,' In Proceedings of the ICDE, pp.220-231, April, 1997
11 R. Ng, L. V. S. Lakshmanan, J. Han and A. Pang, 'Exploratory mining and pruning optimizations of constrained associations rules,' In Proceedings of the ACM SIGMOD, pp.13-24, June, 1998
12 J.S. Park, M.-S. Chen, and P.S. Yu, 'An Effective Hash-Based Algorithm for Mining Association Rules,' In Proceedings of the ACM SIGMOD, pp.175-186, May, 1995
13 A. Savasere, E. Omiecinski and S. Navathe, 'An efficient algorithm for mining association rules in large databases,' In Proceedings of the VLDB, pp.432-444, September, 1995