• Title/Summary/Keyword: 측도함수

Search Result 54, Processing Time 0.018 seconds

Index of union and other accuracy measures (Index of Union와 다른 정확도 측도들)

  • Hong, Chong Sun;Choi, So Yeon;Lim, Dong Hui
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.4
    • /
    • pp.395-407
    • /
    • 2020
  • Most classification accuracy measures for optimal threshold are divided into two types: one is expressed with cumulative distribution functions and probability density functions, the other is based on ROC curve and AUC. Unal (2017) proposed the index of union (IU) as an accuracy measure that considers two types to get them. In this study, ten kinds of accuracy measures (including IU) are divided into six categories, and the advantages of the IU are studied by comparing the measures belonging to each category. The optimal thresholds of these measures are obtained by setting various normal mixture distributions; subsequently, the first and second type of errors as well as the error sums corresponding to each threshold are calculated. The properties and characteristics of the IU statistic are explored by comparing the discriminative power of other accuracy measures based on error values.The values of the first type error and error sum of IU statistic converge to those of the best accuracy measures of the second category as the mean difference between the two distributions increases. Therefore, IU could be an accuracy measure to evaluate the discriminant power of a model.

범주형 자료에서 연관성 측도들의 비교 분석

  • 홍종선;임한승
    • Communications for Statistical Applications and Methods
    • /
    • v.4 no.3
    • /
    • pp.645-661
    • /
    • 1997
  • 연속형 변수들의 상관관계와 범주형 변수들의 연관성 측도들을 비교 연구하였다. 이 연구를 위하여 연속형 변수들이며 +1에서 -1까지 완벽한 상관관계를 갖고 있는 2 변량 정규분포를 이용하여 2$\times$2 분할표와 확장하여 일반적인 I$\times$J 분할표를 대신하는 3$\times$3 분할표를 생성하였다. 2 차원 분할표에서 정의된 연관성 측도들을 구하여 논의하였는데 2$\times$2 분할표에서는 교차적비 $\alpha$ 통계량과 교차적비의 함수로 표현되는 Yule [1912]의 Q와 Y의 통계량 그리고 상관계수 R 통계량과 R 통계량의 함수인 P 통계량을 설명하고 생성된 분할표에서 구한 통계량값을 분석하였으며, 3$\times$3 분할표에서는 Pearson의 독립성 검정통계량 $X^2$의 함수로 표현되는 P. T. V 통계량과 Goodman과 Kruskal [1954]의 $\lambda_{C/R}$통계량과 Light와 Margolin [1971]의 $\tau_{R/C}$ 통계량을 설명하고 그 값들을 Pearson의 상관계수와 비교 분석하였다.

  • PDF

Evaluation of Uncertainty Importance Measure for Monotonic Function (단조함수에 대한 불확실성 중요도 측도의 평가)

  • Cho, Jae-Gyeun
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.15 no.5
    • /
    • pp.179-185
    • /
    • 2010
  • In a sensitivity analysis, an uncertainty importance measure is often used to assess how much uncertainty of an output is attributable to the uncertainty of an input, and thus, to identify those inputs whose uncertainties need to be reduced to effectively reduce the uncertainty of output. A function is called monotonic if the output is either increasing or decreasing with respect to any of the inputs. In this paper, for a monotonic function, we propose a method for evaluating the measure which assesses the expected percentage reduction in the variance of output due to ascertaining the value of input. The proposed method can be applied to the case that the output is expressed as linear and nonlinear monotonic functions of inputs, and that the input follows symmetric and asymmetric distributions. In addition, the proposed method provides a stable uncertainty importance of each input by discretizing the distribution of input to the discrete distribution. However, the proposed method is computationally demanding since it is based on Monte Carlo simulation.

Fuzzy Measures Defined by the Semi-Normed Fuzzy Integrals (준 노름 퍼지 적분에 의해 정의된 퍼지 측도)

  • Kim, Mi-Hye;Lee, Soon-Seok
    • The Journal of the Korea Contents Association
    • /
    • v.2 no.4
    • /
    • pp.99-103
    • /
    • 2002
  • In this paper, we investigate for how to define a fuzzy measure by using the semi-normed fuzzy integral of a given measurable function with respect to another given fuzzy measure when t-seminorm is continuous. Let (X, F, g) be a fuzzy measure space, h$\in$L$^\circ$(X), and $\top$ be a continuous t-seminorm.. Then the set function $\nu$ defined by $\nu$(A)=$\int _A$h$\top$g for any $A\in$F is a fuzzy measure on (X, F).

  • PDF

Optimal threshold using the correlation coefficient for the confusion matrix (혼동행렬의 상관계수를 이용한 최적분류점)

  • Hong, Chong Sun;Oh, Se Hyeon;Choi, Ye Won
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.1
    • /
    • pp.77-91
    • /
    • 2022
  • The optimal threshold estimation is considered in order to discriminate the mixture distribution in the fields of Biostatistics and credit evaluation. There exists well-known various accuracy measures that examine the discriminant power. Recently, Matthews correlation coefficient and the F1 statistic were studied to estimate optimal thresholds. In this study, we explore whether these accuracy measures are appropriate for the optimal threshold to discriminate the mixture distribution. It is found that some accuracy measures that depend on the sample size are not appropriate when two sample sizes are much different. Moreover, an alternative method for finding the optimal threshold is proposed using the correlation coefficient that defines the ratio of the confusion matrix, and the usefulness and utility of this method are also discusses.

Optimal Criterion of Classification Accuracy Measures for Normal Mixture (정규혼합에서 분류정확도 측도들의 최적기준)

  • Yoo, Hyun-Sang;Hong, Chong-Sun
    • Communications for Statistical Applications and Methods
    • /
    • v.18 no.3
    • /
    • pp.343-355
    • /
    • 2011
  • For a data with the assumption of the mixture distribution, it is important to find an appropriate threshold and evaluate its performance. The relationship is found of well-known nine classification accuracy measures such as MVD, Youden's index, the closest-to-(0, 1) criterion, the amended closest-to-(0, 1) criterion, SSS, symmetry point, accuracy area, TA, TR. Then some conditions of these measures are categorized into seven groups. Under the normal mixture assumption, we calculate thresholds based on these measures and obtain the corresponding type I and II errors. We could explore that which classification measure has minimum type I and II errors for estimated mixture distribution to understand the strength and weakness of these classification measures.

Signed Hellinger measure for directional association (연관성 방향을 고려한 부호 헬링거 측도의 제안)

  • Park, Hee Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.2
    • /
    • pp.353-362
    • /
    • 2016
  • By Wikipedia, data mining is the process of discovering patterns in a big data set involving methods at the intersection of association rule, decision tree, clustering, artificial intelligence, machine learning. and database systems. Association rule is a method for discovering interesting relations between items in large transactions by interestingness measures. Association rule interestingness measures play a major role within a knowledge discovery process in databases, and have been developed by many researchers. Among them, the Hellinger measure is a good association threshold considering the information content and the generality of a rule. But it has the drawback that it can not determine the direction of the association. In this paper we proposed a signed Hellinger measure to be able to interpret operationally, and we checked three conditions of association threshold. Furthermore, we investigated some aspects through a few examples. The results showed that the signed Hellinger measure was better than the Hellinger measure because the signed one was able to estimate the right direction of association.

Risk Difference, Relative Risk, and Odds Ratio: A Graphic Approach (위험도차이, 상대위험률, 그리고 교차비:그래프 방법)

  • Cho Tae-Kyoung
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.1
    • /
    • pp.163-170
    • /
    • 2006
  • The argument concerning the choice of effect measure for epidemiologic data or clinic data has been renewed. But the relationships among effect measures can be confusing if effect measures are expressed by conventional mathematical functions alone. In this article, risk difference(RD), relative risk(RR), and odds ratios(OR) for binary data are presented by radar diagram instead of mathematical functions and the relationships among them are showed using radar diagram. This radar diagram is offered flexible conceptual tool to understand effect measures, DR, RR, and OR for binary data.

A New Similarity Measure based on RMF and It s Application to Linguistic Approximation (상대적 소수 함수에 기반을 둔 새로운 유사성 측도와 언어 근사에의 응용)

  • Choe, Dae-Yeong
    • The KIPS Transactions:PartB
    • /
    • v.8B no.5
    • /
    • pp.463-468
    • /
    • 2001
  • We propose a new similarity measure based on relative membership function (RMF). In this paper, the RMF is suggested to represent the relativity between fuzzy subsets easily. Since the shape of the RMF is determined according to the values of its parameters, we can easily represent the relativity between fuzzy subsets by adjusting only the values of its parameters. Hence, we can easily reflect the relativity among individuals or cultural differences when we represent the subjectivity by using the fuzzy subsets. In this case, these parameters may be regarded as feature points for determining the structure of fuzzy subset. In the sequel, the degree of similarity between fuzzy subsets can be quickly computed by using the parameters of the RMF. We use Euclidean distance to compute the degree of similarity between fuzzy subsets represented by the RMF. In the meantime, we present a new linguistic approximation method as an application area of the proposed similarity measure and show its numerical example.

  • PDF

A Didactical Analysis on Circular Measure (호도법에 관한 교수학적 고찰)

  • Kang, Mee-Kwang
    • The Mathematical Education
    • /
    • v.50 no.3
    • /
    • pp.355-365
    • /
    • 2011
  • The purpose of this study is to provide mathematical knowledge for supporting the didactical knowledge on circular measure and radian in the high school curriculum. We show that circular measure related to arcs can be mathematically justified as an angular measure and radian is a well defined concept to be able to reconcile the values of trigonometric functions and ones of circular functions, which are real variable functions. Radian has two-fold intrinsic attributes of angular measure and arc measure on the unit circle, in particular, the latter property plays a very important role in simplifying the trigonometric derivatives. To improve students's low academic achievement in trigonometry section, the useful advantage and the background over the introduction of radian should be preferentially taught and recognized to students. We suggest some teaching plans to practice in the class of elementary and middle school for enhancing teachers' and students' understanding of radian.