Browse > Article

High Utility Pattern Mining using a Prefix-Tree  

Jeong, Byeong-Soo (경희대학교 전자정보대학 컴퓨터공학)
Ahmed, Chowdhury Farhan (경희대학교 전자정보대학 컴퓨터공학)
Lee, In-Gi (이화대학교 컴퓨터공학과)
Yong, Hwan-Seong (이화대학교 컴퓨터공학과)
Abstract
Recently high utility pattern (HUP) mining is one of the most important research issuer in data mining since it can consider the different weight Haloes of items. However, existing mining algorithms suffer from the performance degradation because it cannot easily apply Apriori-principle for pattern mining. In this paper, we introduce new high utility pattern mining approach by using a prefix-tree as in FP-Growth algorithm. Our approach stores the weight value of each item into a node and utilizes them for pruning unnecessary patterns. We compare the performance characteristics of three different prefix-tree structures. By thorough experimentation, we also prove that our approach can give performance improvement to a degree.
Keywords
High utility pattern mining; data mining; transaction frequency; transaction weighted utilization;
Citations & Related Records
연도 인용수 순위
  • Reference
1 R. Agrawal and R. Srikant, 'Fast algorithms for mining association rules in large databases,' Proc. of the 20th Int'l Conf on Very Large Data Bases, Sep. pp.487-499, 1994
2 Y. Liu, W.-K Liao, A. Choudhary, 'A fast high utility itemsets mining algorithm,' Proc. 1st IntI. Canf. on Utility-Based Data Mining, pp.90-99, Aug. 2005.   DOI
3 B. Barber and H.J. Hamilton, 'Extracting share frequent itemsets with infrequent subsets,' Data Mining and Knowledge Discovery, vol.7, pp.153-185, 2003   DOI   ScienceOn
4 J.-L. Koh, S.-F. Shieh, 'An efficient approach for maintaining association rules based on adjusting FP-tree structures,' Proceedings of the DASFAA' 04, pp.417-424, 2004
5 XLi, Z.-H. Deng and S. Tang, 'A fast algorithm for maintenance of association rules in incremental databases,' Advanced Data Mining and Application (ADMA 06), vol.4093, pp.56-63, Jul 2006   DOI   ScienceOn
6 A. Erwin, RP. Gopalan, N.R. Achuthan, 'CTUMine: an efficient high utility itemset mining algorithm using the pattern growth approach,' Proc. of the Seventh IEEE Int. Conf. on Computer and Information Technology (CIT'07), pp.71-76, Oct. 2007   DOI
7 Y. Liu, W.-K Liao and A. Choudhary, 'A Two phase algorithm for fast discovery of high utility of itermsets,' Proc. of the 9th Pacific-Asia Conf. on Knowledge Discovery and Data Mining(PAKDD'05), pp.689-695, May 2005
8 J. Hu and A. Mojsilovic, 'High utility pattern mining: A method for discovery of high utility item sets,' Pattern Recognition, vol.40, pp. 3317- 3324, 2007   DOI   ScienceOn
9 J. Han, J. Pei, Y. Yin and R. Mao, 'Mining frequent patterns without candidate generation: a frequent-pattern tree approach,' Data Mining and Knowledge Discovery, vol.8, pp.53-87, 2004   DOI   ScienceOn
10 C.F. Ahmed, S.K Tanbeer, B.-S. Jeong and Y.-K Lee, 'Mining high utility patterns in incremental databases,' Proc of ICUIMC, pp.653-663, Feb. 2009   DOI
11 S. Zhang, J. Zhang and C. Zhang, 'EDUA: An efficient algorithm for dynamic database mining,' Information Science, vol.177, pp.2756-2767, 2007   DOI   ScienceOn
12 Y. Liu, W.-K. Liao, A. Choudhary, 'A fast high utility itemsets mining algorithm,' Proc. 1st IntI. Conf. on Utility-Based Data Mining, pp.90-99, Aug. 2005   DOI
13 H. Yao and H. J. Hamilton, 'Mining itemset utilities from transaction databases,' Data & Knowledge Engineering, vol.59, pp.603-626, 2006   DOI   ScienceOn
14 C. K-S. Leung Q.I. Khan, Z. Li and T. Hoque 'Can'Tree: a canonical-order tree for incremental frequent-pattern mining,' Knowledge and Information Systems, vol.11, no.3, pp.287-311, 2007   DOI   ScienceOn
15 R. Agrawal, T. Imielinski and A. Swami, 'Mining association rules between sets of items in large databases,' Proc. of the 12th ACM SIGMOD Int'l Conf. on Management of Data, pp. 207-216, May 1993   DOI   ScienceOn
16 S.K. Tanbeer, C.F. Ahmed, B.-S. Jeong and Y.-K. Lee, 'CP-tree: A tree structure for single pass frequent pattern mining,' Proc. of the 12th Pacific Asia Conf. on Knowledge Discovery and Data Mining (PAKDD'08), May 2008   DOI   ScienceOn
17 F. Tao, 'Weighted association rule mining using weighted support and significant framework,' Proc. of the 9th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining, pp.661-666, 2003   DOI
18 U. Yun, 'WIS: Weighted interesting sequential pattern mining with a similar level of support and/or weight,' ETRI Journal, vol.29, no.3, pp.336-352, Jun. 2007   DOI   ScienceOn
19 XLi, Z.-H. Deng and S. Tang, 'A fast algorithm for maintenance of association rules in incremental databases,' Advanced Data Mining and Application (ADMA 06), vol.4093, pp.56-63, Jul. 2006   DOI   ScienceOn
20 H. Yao and H. J. Hamilton, 'Mining itemset utilities from transaction databases,' Data & Knowledge Engineering, vol. 59, pp.603-626, 2006   DOI   ScienceOn