[KSCI] Korea Science Citation Index Service

High Utility Pattern Mining using a Prefix-Tree

Jeong, Byeong-Soo (경희대학교 전자정보대학 컴퓨터공학)
Ahmed, Chowdhury Farhan (경희대학교 전자정보대학 컴퓨터공학)
Lee, In-Gi (이화대학교 컴퓨터공학과)
Yong, Hwan-Seong (이화대학교 컴퓨터공학과)

Publication Information

Journal of KIISE:Databases / v.36, no.5, 2009 , pp. 341-351 More about this Journal

Abstract

Recently high utility pattern (HUP) mining is one of the most important research issuer in data mining since it can consider the different weight Haloes of items. However, existing mining algorithms suffer from the performance degradation because it cannot easily apply Apriori-principle for pattern mining. In this paper, we introduce new high utility pattern mining approach by using a prefix-tree as in FP-Growth algorithm. Our approach stores the weight value of each item into a node and utilizes them for pruning unnecessary patterns. We compare the performance characteristics of three different prefix-tree structures. By thorough experimentation, we also prove that our approach can give performance improvement to a degree.

Keywords

High utility pattern mining; data mining; transaction frequency; transaction weighted utilization;

Citations & Related Records

Reference

1	R. Agrawal and R. Srikant, 'Fast algorithms for mining association rules in large databases,' Proc. of the 20th Int'l Conf on Very Large Data Bases, Sep. pp.487-499, 1994
2	Y. Liu, W.-K Liao, A. Choudhary, 'A fast high utility itemsets mining algorithm,' Proc. 1st IntI. Canf. on Utility-Based Data Mining, pp.90-99, Aug. 2005. DOI
3	B. Barber and H.J. Hamilton, 'Extracting share frequent itemsets with infrequent subsets,' Data Mining and Knowledge Discovery, vol.7, pp.153-185, 2003 DOI ScienceOn
4	J.-L. Koh, S.-F. Shieh, 'An efficient approach for maintaining association rules based on adjusting FP-tree structures,' Proceedings of the DASFAA' 04, pp.417-424, 2004
5	XLi, Z.-H. Deng and S. Tang, 'A fast algorithm for maintenance of association rules in incremental databases,' Advanced Data Mining and Application (ADMA 06), vol.4093, pp.56-63, Jul 2006 DOI ScienceOn
6	A. Erwin, RP. Gopalan, N.R. Achuthan, 'CTUMine: an efficient high utility itemset mining algorithm using the pattern growth approach,' Proc. of the Seventh IEEE Int. Conf. on Computer and Information Technology (CIT'07), pp.71-76, Oct. 2007 DOI
7	Y. Liu, W.-K Liao and A. Choudhary, 'A Two phase algorithm for fast discovery of high utility of itermsets,' Proc. of the 9th Pacific-Asia Conf. on Knowledge Discovery and Data Mining(PAKDD'05), pp.689-695, May 2005
8	J. Hu and A. Mojsilovic, 'High utility pattern mining: A method for discovery of high utility item sets,' Pattern Recognition, vol.40, pp. 3317- 3324, 2007 DOI ScienceOn
9	J. Han, J. Pei, Y. Yin and R. Mao, 'Mining frequent patterns without candidate generation: a frequent-pattern tree approach,' Data Mining and Knowledge Discovery, vol.8, pp.53-87, 2004 DOI ScienceOn
10	C.F. Ahmed, S.K Tanbeer, B.-S. Jeong and Y.-K Lee, 'Mining high utility patterns in incremental databases,' Proc of ICUIMC, pp.653-663, Feb. 2009 DOI
11	S. Zhang, J. Zhang and C. Zhang, 'EDUA: An efficient algorithm for dynamic database mining,' Information Science, vol.177, pp.2756-2767, 2007 DOI ScienceOn
12	Y. Liu, W.-K. Liao, A. Choudhary, 'A fast high utility itemsets mining algorithm,' Proc. 1st IntI. Conf. on Utility-Based Data Mining, pp.90-99, Aug. 2005 DOI
13	H. Yao and H. J. Hamilton, 'Mining itemset utilities from transaction databases,' Data & Knowledge Engineering, vol.59, pp.603-626, 2006 DOI ScienceOn
14	C. K-S. Leung Q.I. Khan, Z. Li and T. Hoque 'Can'Tree: a canonical-order tree for incremental frequent-pattern mining,' Knowledge and Information Systems, vol.11, no.3, pp.287-311, 2007 DOI ScienceOn
15	R. Agrawal, T. Imielinski and A. Swami, 'Mining association rules between sets of items in large databases,' Proc. of the 12th ACM SIGMOD Int'l Conf. on Management of Data, pp. 207-216, May 1993 DOI ScienceOn
16	S.K. Tanbeer, C.F. Ahmed, B.-S. Jeong and Y.-K. Lee, 'CP-tree: A tree structure for single pass frequent pattern mining,' Proc. of the 12th Pacific Asia Conf. on Knowledge Discovery and Data Mining (PAKDD'08), May 2008 DOI ScienceOn
17	F. Tao, 'Weighted association rule mining using weighted support and significant framework,' Proc. of the 9th ACM SIGKDD Int'l Conf. on Knowledge Discovery and Data Mining, pp.661-666, 2003 DOI
18	U. Yun, 'WIS: Weighted interesting sequential pattern mining with a similar level of support and/or weight,' ETRI Journal, vol.29, no.3, pp.336-352, Jun. 2007 DOI ScienceOn
19	XLi, Z.-H. Deng and S. Tang, 'A fast algorithm for maintenance of association rules in incremental databases,' Advanced Data Mining and Application (ADMA 06), vol.4093, pp.56-63, Jul. 2006 DOI ScienceOn
20	H. Yao and H. J. Hamilton, 'Mining itemset utilities from transaction databases,' Data & Knowledge Engineering, vol. 59, pp.603-626, 2006 DOI ScienceOn

KSCI

High Utility Pattern Mining using a Prefix-Tree Prefix-Tree를 이용한 높은 유틸리티 패턴 마이닝 기법

High Utility Pattern Mining using a Prefix-Tree