Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2012.19D.2.139

A Method for Frequent Itemsets Mining from Data Stream  

Seo, Bok-Il (전남대학교 전자컴퓨터공학부)
Kim, Jae-In (전남대학교 전자컴퓨터공학부)
Hwang, Bu-Hyun (전남대학교 전자컴퓨터공학부)
Abstract
Data Mining is widely used to discover knowledge in many fields. Although there are many methods to discover association rule, most of them are based on frequency-based approaches. Therefore it is not appropriate for stream environment. Because the stream environment has a property that event data are generated continuously. it is expensive to store all data. In this paper, we propose a new method to discover association rules based on stream environment. Our new method is using a variable window for extracting data items. Variable windows have variable size according to the gap of same target event. Our method extracts data using COBJ(Count object) calculation method. FPMDSTN(Frequent pattern Mining over Data Stream using Terminal Node) discovers association rules from the extracted data items. Through experiment, our method is more efficient to apply stream environment than conventional methods.
Keywords
Data Mining; Data Stream; Frequent Itemsets; Real-time Mining;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Pang-Ning Tan, Michael Steinbach, Vipin Kumar 저, 용환승, 나연묵, 박종수 역, "데이터 마이닝".
2 K.Y.Huang, C.H.Chang, "SMCA: A General Model for Mining Asynchronous Periodic Patterns in Temporal Databases.", IEEE Transactions on Knowledge and Data Engineering, Vol.17, No.6, 2005, June.
3 S.Laxman, P.S.Sastry, and K.Unnikrishnan, "Discovering frequent generalized episodes when events persist for different durations.", IEEE Transactions on Knowledge and Data Engineering Vol.19, 2007, Sep.
4 H. Mannila, H. Toivonen and A. I. Verkamo, "Discovering frequent episodes in sequences.", proc. of International Conference on Knowledge Discovery and Data Mining(KDD-95), 1995.
5 W.Pijls and R.Potharst, "Classification and target group selection based upon frequent patterns.", Proc. of Twelfth Belgium-netherlands artificial intelligence Conference (BNAIC00), pp.125-132, 2000.
6 H.Jin, J.Chen, H.He, C.Kelman, D.McAullay, and C.M.O'Keefe, "Signaling Potential Adverse Drug Reactions from Administrative Health Databases.", IEEE Transactions on Knowledge and Data Engineering, Vol.22, No.6, 2010, August.
7 N. Srinivasa, Q. Jiang, and L. G. Barajas, "High-Impact Event Prediction by Temporal Data Mining Through Genetic Algorithms.", proc. of 4th IEEE International Conference on Natual Computation, 2008.
8 C. F. Ahmed, S. K. Tanbeer, and B.S. Jeong, "Efficient Mining of Weighted Frequent Patterns Over Data Streams", 11th IEEE International Conference on High Performance Computing and Communications", 2009.
9 G. Chen, X. Wu, and X. Zhu, "Mining Sequential Patterns Across Data Streams," Univ. of Vermont Computer Science Technical Report(CS-05-04), 2005(3).
10 J. Pei, J. Han, B. Mortazavi-Asl, J. Wang, H. Pinto, Q. Chen, U. Dayal, and M.Hsu, "Mining Sequential Patterns by Pattern-Growth: The PrefixSpan Approach," IEEE Transactions on Knowledge and Data Engineering, Vol.16, No.11, 2004(11).
11 C. K. S. Leung, B. Hao, "Mining of Frequent Itemsets from Streams of Uncertain Data", IEEE International Conference on Data Engineering, 2009.
12 R. Agrawal, R. Srikant, "Mining Sequential Patterns.", Proc. of 11th International Conference on Data Engineering, ICDE, pp.3-14, 1995.
13 J. Han, J. Pei, Y. Yin, and R. Mao, "Mining frequent patterns without candidate generation: a frequent pattern tree approach.", Data Mining and Knowledge Discovery, Vol.8, pp.53-87, 2004.   DOI   ScienceOn
14 R. Agrawal, T. Imielinski, and A. Swami, "Mining association rules between sets of items in large databases", proc. 12th ACM SIGMOD Int'l Conf. on Management of Data, pp.207-216, 1993.
15 R. Agrawal, R. Srikant, "Fast algorithms for mining association rules." , The VLDB Conference, Santiago, Chile, 1994. Sep.
16 Minos N. Garofalakis, Rajeev Rastogi, and Kyuseok Shim, "SPRIT: Sequential Pattern Mining with Regular Expression Constraints.", Proceedings of the 25th VLDB Conference, Edinburgh, Scotland, pp.223-234, 1999.