[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.7465/jkdi.2014.25.2.365

Comparison of confidence measures useful for classification model building

Park, Hee Chang (Department of Statistics, Changwon National University)

Publication Information

Journal of the Korean Data and Information Science Society / v.25, no.2, 2014 , pp. 365-371 More about this Journal

Abstract

Association rule of the well-studied techniques in data mining is the exploratory data analysis for understanding the relevance among the items in a huge database. This method has been used to find the relationship between each set of items based on the interestingness measures such as support, confidence, lift, similarity measures, etc. By typical association rule technique, we generate association rule that satisfy minimum support and confidence values. Support and confidence are the most frequently used, but they have the drawback that they can not determine the direction of the association because they have always positive values. In this paper, we compared support, basic confidence, and three kinds of confidence measures useful for classification model building to overcome this problem. The result confirmed that the causal confirmed confidence was the best confidence in view of the association mining because it showed more precisely the direction of association.

Keywords

Association rule; causal confidence; causal confirmed confidence; confirmed confidence; data mining;

Citations & Related Records

Times Cited By KSCI : 6 (Citation Analysis)

Reference
Cited By KSCI

1	Agrawal, R., Imielinski, R. and Swami, A. (1993). Mining association rules between sets of items in large databases. Proceedings of the ACM SIGMOD Conference on Management of Data, 207-216.
2	Berzal, F., Cubero, J., Marin, N., Sanchez, D., Serrano, J. and Vila, A. (2005). Association rule evaluation for classification purposes. Actas del III Taller Nacional de Mineria de Datos y Aprendizaje, TAMIDA2005, 135-144.
3	Cho, K. H. and Park, H. C. (2011a). Study on the multi intervening relation in association rules. Journal of the Korean Data Analysis Society, 13, 297-306.
4	Cho, K. H. and Park, H. C. (2011b). A study on insignificant rules discovery in association rule mining. Journal of the Korean Data & Information Science Society, 22, 81-88. 과학기술학회마을
5	Han, J. and Fu, Y. (1999). Mining multiple-level association rules in large databases. IEEE Transactions on Knowledge and Data Engineering, 11, 68-77.
6	Han, J., Pei, J. and Yin, Y. (2000). Mining frequent patterns without candidate generation. Proceedings of ACM SIGMOD Conference on Management of Data, 1-12.
7	Jin, D. S., Kang, C., Kim, K. K. and Choi, S. B. (2011). CRM on travel agency using association rules. Journal of the Korean Data Analysis Society, 13, 2945-2952.
8	Park, H. C. (2011b). The proposition of attributably pure confidence in association rule mining. Journal of the Korean Data & Information Science Society, 22, 235-243. 과학기술학회마을
9	Kodratoff, Y. (2000). Comparing machine learning and knowledge discovery in databases: An application to knowledge discovery in texts. Proceeding of Machine Learning and its Applications: Advanced Lectures, 1-21.
10	Liu, B., Hsu, W. and Ma, Y. (1999). Mining association rules with multiple minimum supports. Proceedings of the 5th International Conference on Knowledge Discovery and Data Mining, 337-241.
11	Park, H. C. (2011a). Association rule ranking function by decreased lift influence. Journal of the Korean Data & Information Science Society, 22, 179-188.
12	Pasquier, N., Bastide, Y., Taouil, R. and Lakhal, L. (1999). Discovering frequent closed itemsets for association rules. Proceedings of the 7th International Conference on Database Theory, 398-416.
13	Park, H. C. (2012a). Negatively attributable and pure confidence for generation of negative association rules. Journal of the Korean Data & Information Science Society, 23, 707-716.
14	Park, H. C. (2012b). Exploration of PIM based similarity measures as association rule thresholds. Journal of the Korean Data & Information Science Society, 23, 1127-1135. DOI ScienceOn
15	Park, H. C. (2013). Proposition of causal association rule thresholds. Journal of the Korean Data & Information Science Society, 24, 1189-1197. 과학기술학회마을 DOI ScienceOn
16	Pei, J., Han, J. and Mao, R. (2000). CLOSET: An efficient algorithm for mining frequent closed itemsets. Proceedings of ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 21-30.
17	Saygin, Y., Vassilios, S. V. and Clifton, C. (2002). Using unknowns to prevent discovery of association rules. Proceedings of 2002 Conference on Research Issues in Data Engineering, 45-54.

1	Proposition of a New Confidence Measure Useful for Classification Model Building in Association Rule / [Park, Hee-Chang;] / Journal of the Korean Data Analysis Society
2	Comparison of Cosine Family Similarity Measures in the Aspect of Association Rule / [Park, Hee-Chang;] / Journal of the Korean Data Analysis Society
3	Proposition of the Balanced and Attributable Net Confidence in Association Rule Mining / [Park, Hee-Chang;] / Journal of the Korean Data Analysis Society
4	Symmetrically Weighted Net Confidence for Generation of Meaningful Association Rules / [Park, Hee Chang;] / Journal of the Korean Data Analysis Society
5	Symmetrically Relative Accuracy as Rule Evaluation Measure / [Park, Hee Chang;] / Journal of the Korean Data Analysis Society

2	Eun Hyung Cho. (2016) Journal of the Korean Data and Information Science Society Analysis of error source in subjective evaluation results on Taekwondo Poomsae: Application of generalizability theory / 27 (2) , 395
6	Youngsun Kim. (2015) Journal of the Korean Data and Information Science Society Construction of the structural equation model on college adaptation in nursing students / 26 (6) , 1439
3	Hee Chang Park. (2015) Journal of the Korean Data and Information Science Society Proposition of balanced comparative confidence considering all available diagnostic tools / 26 (3) , 611

KSCI

Comparison of confidence measures useful for classification model building 분류 모형 구축에 유용한 신뢰도 측도 간의 비교

Comparison of confidence measures useful for classification model building