DOI QR코드

DOI QR Code

Negatively attributable and pure confidence for generation of negative association rules

음의 연관성 규칙 생성을 위한 음의 기여 순수 신뢰도의 제안

  • Received : 2012.08.08
  • Accepted : 2012.09.03
  • Published : 2012.09.30

Abstract

The most widely used data mining technique is to explore association rules. This technique has been used to find the relationship between items in a massive database based on the interestingness measures such as support, confidence, lift, etc. Association rules are frequently used by retail stores to assist in marketing, advertising, floor placement, and inventory control.In general, association rule technique generates the rule, 'If A, then B.', whereas negative association rule technique generates the rule, 'If A, then not B.', or 'If not A, then B.'. We can determine whether we promote other products in addition to promote its products only if we add negative association rules to existing association rules. In this paper, we proposed the negatively attributable and pure confidence to overcome the problems faced by negative association rule technique, and then we checked three conditions for interestingness measure. The comparative studies with negative confidence, negatively pure confidence, and negatively attributable and pure confidence are shown by numerical examples. The results show that the negatively attributable and pure confidence is better than negative confidence and negatively pure confidence.

데이터 마이닝 기법들 중에서 가장 많이 활용되고 있는 연관성 규칙은 방대한 데이터베이스에서 항목간의 관계를 흥미도 측도에 의해 명확히 수치화함으로써 그들간의 관련성을 표시해주는 기법이다. 양의 연관성 규칙 마이닝이 임의의 한 항목이 발생하면 다른 항목도 발생한다는 규칙을 생성하기 위한 기법인 반면에, 음의 연관성 규칙은 어느 항목이 발생하면 다른 항목은 발생하지 않는다는 규칙을 찾아내는 기법이다. 음의 연관성 규칙은 양의 연관성 규칙의 활용과 마찬가지로 고객의 구매 경향 및 마케팅 정책을 제시할 수 있고 교차판매와 매장 진열 등과 같이 타겟 마케팅에 활용 가능하다. 양의 연관성 규칙에 음의 연관성 규칙을 추가하게 되면 어떤 제품을 판매하기 위해서는 그 제품만 마케팅 하는 것뿐만 아니라 더 나아가 그 제품이 아닌 어느 제품을 마케팅 하는것이 필요한지를 판단할 수 있다. 본 논문에서는 기존의 음의 신뢰도의 단점을 보완할 수 있는 음의 기여 순수 신뢰도를 제안한 후, 이에 대해 흥미도 측도가 가져야 할 조건들을 조사하였으며, 예제 데이터를 활용하여 음의 기여 순수 신뢰도의 유용성을 고찰하였다.

Keywords

References

  1. Agrawal, R., Imielinski, R. and Swami, A. (1993). Mining association rules between sets of items in large databases. Proceedings of the ACM SIGMOD Conference on Management of Data, 207-216.
  2. Bala, P. K. (2009). A technique for mining negative association rules. Proceedings of the 2nd Bangalore Annual Compute Conference, 23-23.
  3. Cho, K. H. and Park, H. C. (2011). Discovery of insignificant association rule s using external variable. Journal of the Korean Data Analysis Society, 13, 1343-1352.
  4. Choi, J. H. and Park, H. C. (2008). Comparative study of quantitative data binning methods in association rule. Journal of the Korean Data & Information Science Society, 19, 903-910.
  5. Han, J. and Fu, Y. (1999). Mining multiple-level association rules in large databases. IEEE Transactions on Knowledge and Data Engineering, 11, 68-77.
  6. Han, J. and Kamber, M. (2006). Data mining : Concepts and techniques, Morgam Kaufmann, USA.
  7. Han, J., Pei, J. and Yin, Y. (2000). Mining frequent patterns without candidate generation. Proceedings of ACM SIGMOD Conference on Management of Data, 1-12.
  8. Hwang, J. and Kim, J. (2003). Target marketing using inverse association rule. Journal of Intelligence and Information Systems, 9, 195-209.
  9. Lee, J., Park, S., Kang, Y., Park, S. and Lee, J. (2003). Finding negative association rules with Boolean analyzer. Proceedings of the Korean Institute of Information Scientists and Engineers, 30, 187-189.
  10. Lim, J., Lee, K. and Cho, Y. (2010). A study of association rule by considering the frequency. Journal of the Korean Data & Information Science Society, 21, 1061-1069.
  11. Liu, B., Hsu, W. and Ma, Y. (1999). Mining association rules with multiple minimum supports. Proceedings of the 5th International Conference on Knowledge Discovery and Data Mining, 337-241.
  12. Park, H. C. (2010a). Weighted association rules considering item RFM scores. Journal of the Korean Data & Information Science Society, 21, 1147-1154.
  13. Park, H. C. (2010b). Standardization for basic association measures in association rule mining. Journal of the Korean Data & Information Science Society, 21, 891-899.
  14. Park, H. C. (2011a). Proposition of negatively pure association rule threshold. Journal of the Korean Data & Information Science Society, 22, 179-188.
  15. Park, H. C. (2011b). The proposition of attributably pure confidence in association rule mining. Journal of the Korean Data & Information Science Society, 22, 235-243.
  16. Park, H. C. (2011c). The application of some similarity measures to association rule thresholds. Journal of the Korean Data Analysis Society, 13, 1331-1342.
  17. Pei, J., Han, J. and Mao, R. (2000). CLOSET: An efficient algorithm for mining frequent closed itemsets. Proceedings of ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 21-30.
  18. Piatetsky-Shapiro, G. (1991). Discovery, analysis and presentation of strong rules. Knowledge Discovery in Databases, AAAI/MIT Press, 229-248.
  19. Saygin Y., Vassilios S. V. and Clifton C.(2002). Using unknowns to prevent discovery of association rules. Proceedings of 2002 Conference on Research Issues in Data Engineering, 45-54.
  20. Shang, S., Dong, X., Geng, R. and Zhao, L. (2008). Mining negative association rules in multi-database. Proceedings of the Fifth International Conference on Fuzzy Systems and Knowledge Discovery, 596- 599.
  21. Sim, A., Indrawan, M. and Srinivasan, B. (2008). The importance of negative associations and the discovery of association rule pairs. International Journal of Business Intelligence and Data Mining, 3, 158-176. https://doi.org/10.1504/IJBIDM.2008.020517

Cited by

  1. Evaluation on validity of health literacy measurement scale vol.24, pp.2, 2013, https://doi.org/10.7465/jkdi.2013.24.2.257
  2. The development of symmetrically and attributably pure confidence in association rule mining vol.25, pp.3, 2014, https://doi.org/10.7465/jkdi.2014.25.3.601
  3. The proposition of cosine net confidence in association rule mining vol.25, pp.1, 2014, https://doi.org/10.7465/jkdi.2014.25.1.97