• 제목/요약/키워드: Pure confidence

검색결과 46건 처리시간 0.022초

The Proposition of Conditionally Pure Confidence in Association Rule Mining

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • 제19권4호
    • /
    • pp.1141-1151
    • /
    • 2008
  • Data mining is the process of sorting through large amounts of data and picking out useful information. One of the well-studied problems in data mining is the exploration of association rules. An association rule technique finds the relation among each items in massive volume database. Some interestingness measures have been developed in association rule mining. Interestingness measures are useful in that it shows the causes for pruning uninteresting rules statistically or logically. This paper propose a conditional pure confidence to evaluate association rules and then describe some properties for a proposed measure. The comparative studies with confidence and pure confidence are shown by numerical example. The results show that the conditional pure confidence is better than confidence or pure confidence.

  • PDF

The development of symmetrically and attributably pure confidence in association rule mining (연관성 규칙에서 활용 가능한 대칭적 기여 순수 신뢰도의 개발)

  • Park, Hee Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권3호
    • /
    • pp.601-609
    • /
    • 2014
  • The most widely used data mining technique for big data analysis is to generate meaningful association rules. This method has been used to find the relationship between set of items based on the association criteria such as support, confidence, lift, etc. Among them, confidence is the most frequently used, but it has the drawback that we can not know the direction of association by it. The attributably pure confidence was developed to compensate for this drawback, but the value was changed by the position of two item sets. In this paper, we propose four symmetrically and attributably pure confidence measures to compensate the shortcomings of confidence and the attributably pure confidence. And then we prove three conditions of interestingness measure by Piatetsky-Shapiro, and comparative studies with confidence, attributably pure confidence, and four symmetrically and attributably pure confidence measures are shown by numerical examples. The results show that the symmetrically and attributably pure confidence measures are better than confidence and the attributably pure confidence. Also the measure NSAPis found to be the best among these four symmetrically and attributably pure confidence measures.

Negatively attributable and pure confidence for generation of negative association rules (음의 연관성 규칙 생성을 위한 음의 기여 순수 신뢰도의 제안)

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • 제23권5호
    • /
    • pp.939-948
    • /
    • 2012
  • The most widely used data mining technique is to explore association rules. This technique has been used to find the relationship between items in a massive database based on the interestingness measures such as support, confidence, lift, etc. Association rules are frequently used by retail stores to assist in marketing, advertising, floor placement, and inventory control.In general, association rule technique generates the rule, 'If A, then B.', whereas negative association rule technique generates the rule, 'If A, then not B.', or 'If not A, then B.'. We can determine whether we promote other products in addition to promote its products only if we add negative association rules to existing association rules. In this paper, we proposed the negatively attributable and pure confidence to overcome the problems faced by negative association rule technique, and then we checked three conditions for interestingness measure. The comparative studies with negative confidence, negatively pure confidence, and negatively attributable and pure confidence are shown by numerical examples. The results show that the negatively attributable and pure confidence is better than negative confidence and negatively pure confidence.

The proposition of attributably pure confidence in association rule mining (연관 규칙 마이닝에서 기여 순수 신뢰도의 제안)

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권2호
    • /
    • pp.235-243
    • /
    • 2011
  • The most widely used data mining technique is to explore association rules. This technique has been used to find the relationship between each set of items based on the association thresholds such as support, confidence, lift, etc. There are many interestingness measures as the criteria for evaluating association rules. Among them, confidence is the most frequently used, but it has the drawback that it can not determine the direction of the association. The net confidence measure was developed to compensate for this drawback, but it is useless in the case that the value of positive confidence is the same as that of negative confidence. This paper propose a attributably pure confidence to evaluate association rules and then describe some properties for a proposed measure. The comparative studies with confidence, net confidence, and attributably pure confidence are shown by numerical example. The results show that the attributably pure confidence is better than confidence or net confidence.

The proposition of compared and attributably pure confidence in association rule mining (연관 규칙 마이닝에서 비교 기여 순수 신뢰도의 제안)

  • Park, Hee Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • 제24권3호
    • /
    • pp.523-532
    • /
    • 2013
  • Generally, data mining is the process of analyzing big data from different perspectives and summarizing it into useful information. The most widely used data mining technique is to generate association rules, and it finds the relevance between two items in a huge database. This technique has been used to find the relationship between each set of items based on the interestingness measures such as support, confidence, lift, etc. Among many interestingness measures, confidence is the most frequently used, but it has the drawback that it can not determine the direction of the association. The attributably pure confidence and compared confidence are able to determine the direction of the association, but their ranges are not [-1, +1]. So we can not interpret the degree of association operationally by their values. This paper propose a compared and attributably pure confidence to compensate for this drawback, and then describe some properties for a proposed measure. The comparative studies with confidence, compared confidence, attributably pure confidence, and a proposed measure are shown by numerical example. The results show that the a compared and attributably pure confidence is better than any other confidences.

The application for predictive similarity measures of binary data in association rule mining (이분형 예측 유사성 측도의 연관성 평가 기준 적용 방안)

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권3호
    • /
    • pp.495-503
    • /
    • 2011
  • The most widely used data mining technique is to find association rules. Association rule mining is the method to quantify the relationship between each set of items in very huge database based on the association thresholds. There are some basic association thresholds to explore meaningful association rules ; support, confidence, lift, etc. Among them, confidence is the most frequently used, but it has the drawback that it can not determine the direction of the association. The net confidence and the attributably pure confidence were developed to compensate for this drawback, but they have other drawbacks.In this paper we consider some predictive similarity measures for binary data in cluster analysis and multi-dimensional analysis as association threshold to compensate for these drawbacks. The comparative studies with net confidence, attributably pure confidence, and some predictive similarity measures are shown by numerical example.

Proposition of negatively pure association rule threshold (음의 순수 연관성 규칙 평가 기준의 제안)

  • Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권2호
    • /
    • pp.179-188
    • /
    • 2011
  • Association rule represents the relationship between items in a massive database by quantifying their relationship, and is used most frequently in data mining techniques. In general, association rule technique generates the rule, 'If A, then B.', whereas negative association rule technique generates the rule, 'If A, then not B.', or 'If not A, then B.'. We can determine whether we promote other products in addition to promote its products only if we add negative association rules to existing association rules. In this paper, we proposed the negatively pure association rules by negatively pure support, negatively pure confidence, and negatively pure lift to overcome the problems faced by negative association rule technique. In checking the usefulness of this technique through numerical examples, we could find the direction of association by the sign of the negatively pure association rule measure.

Clothing Purchase Behavior according to Consumer Self-Confidence (소비자 자신감에 따른 의복구매행동)

  • Jeon, Kyung-Sook
    • Journal of the Korean Home Economics Association
    • /
    • 제45권6호
    • /
    • pp.1-9
    • /
    • 2007
  • Even though self-confidence is a personal factor of a people, it works as behavioristic factor in consumer behavior. In this study, the influence of consumer self-confidence on clothing purchase behavior was investigated. A total of 284 data sets were analyzed after collecting questionnaires from college students in Seoul using convenient sampling method. For data analysis, chi-square test, analysis of variance, reliability test and factor analysis were performed by SPSSWIN program. The results were as followed. First, the clothing purchase places were affected by the consumers' level of self-confidence. The more confident consumers preferred internet shopping and Dongdaemun market to large-scale shops. The discount stores were selected by the less confident consumers. Second, information search was one of the main reasons to visit internet shopping mall by the more confident consumers. Third, the more confident consumers showed the higher level of clothing involvement than the less confident consumers. Finally, unplanned purchases, such as pure impulse buying and reminder buying were more likely to occur by the more confident consumers with less purchase conflicts.

An Analysis on Efficiency for the Environmental Friendly Agricultural Product of Strawberry in GyeongBuk Province (경북지역 친환경딸기 농가의 인증유형에 따른 효율성 분석)

  • Lee, Sang-Ho;Song, Kyung-Hwan
    • Korean Journal of Organic Agriculture
    • /
    • 제21권4호
    • /
    • pp.487-500
    • /
    • 2013
  • The purpose of this study is to estimate efficiency of environmental-friendly agricultural product by using Data Envelopment Analysis. A proposed method employs a bootstrapping approach to generating efficiency estimates through Monte Carlo simulation resampling process. The technical efficiency, pure technical efficiency, and scale efficiency measure of strawberry by pesticide-free certification is 0.967, 0.995, 0.968 respectively. However those of bias-corrected estimates are 0.918, 0.983, 0.934. We know that the DEA estimator is an upward biased estimator. In technical efficiency, average lower and upper confidence bounds of 0.807 and 0.960. According to these results, the DEA bootstrapping model used here provides bias-corrected and confidence intervals for the point estimates, it is more preferable.

An Analysis of the Efficiency of Watermelon Using the Bootstrapping DEA Model (시설수박의 출하시기별 효율성 분석)

  • Lee, Sang-Ho
    • Korean Journal of Organic Agriculture
    • /
    • 제26권1호
    • /
    • pp.33-41
    • /
    • 2018
  • The paper aims to estimate efficiency of watermelon by using a bootstrapping approach to generating efficiency estimates through Monte Carlo simulation resampling process. We use the input-output data for watermelon 107 farmers. The main results are as follows. The estimates of efficiency depends on the methodology. The estimates of general DEA is greater than the bootstrapping method. The technical efficiency and pure technical efficiency measure of watermelon is 0.72, 0.82 respectively. However the bias-corrected estimates are less than those of DEA. We know that the DEA estimator is an upward biased estimator. According to these results, the DEA bootstrapping model used here provides bias-corrected and confidence intervals for the point estimates, it is more preferable.