• Title/Summary/Keyword: Kolmogorov-smirnov statistic

Search Result 28, Processing Time 0.023 seconds

Modified Test Statistic for Identity of Two Distribution on Credit Evaluation (신용평가에서 두 분포의 동일성 검정에 대한 수정통계량)

  • Hong, C.S.;Park, H.S.
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.2
    • /
    • pp.237-248
    • /
    • 2009
  • The probability of default on the credit evaluation study is represented as a linear combination of two distributions of default and non-default, and the distribution of the probability of default are generally known in most cases. Except the well-known Kolmogorov-Smirnov statistic for testing the identity of two distribution, Kuiper, Cramer-Von Mises, Anderson-Darling, and Watson test statistics are introduced in this work. Under the assumption that the population distribution is known, modified Cramer-Von Mises, Anderson-Darling, and Watson statistics are proposed. Based on score data generated from various probability density functions of the probability of default, the modified test statistics are discussed and compared.

Nonparametric homogeneity tests of two distributions for credit rating model validation (신용평가모형에서 두 분포함수의 동일성 검정을 위한 비모수적인 검정방법)

  • Hong, Chong-Sun;Kim, Ji-Hoon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.2
    • /
    • pp.261-272
    • /
    • 2009
  • Kolmogorov-Smirnov (K-S) statistic has been widely used for testing homogeneity of two distributions in the credit rating models. Joseph (2005) used K-S statistic to obtain validation criteria which is most well-known. There are other homogeneity test statistics such as the Cramer-von Mises, Anderson-Darling, and Watson statistics. In this paper, these statistics are introduced and applied to obtain criterion of these statistics by extending Joseph (2005)'s work. Another set of alternative criterion is suggested according to various sample sizes, type a error rates, and the ratios of bads and goods by using the simulated data under the similar situation as real credit rating data. We compare and explore among Joseph's criteria and two sets of the proposed criterion and discuss their applications.

  • PDF

Goodness-of-Fit Test for the Normality based on the Generalized Lorenz Curve

  • Cho, Youngseuk;Lee, Kyeongjun
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.4
    • /
    • pp.309-316
    • /
    • 2014
  • Testing normality is very important because the most common assumption is normality in statistical analysis. We propose a new plot and test statistic to goodness-of-fit test for normality based on the generalized Lorenz curve. We compare the new plot with the Q-Q plot. We also compare the new test statistic with the Kolmogorov-Smirnov (KS), Cramer-von Mises (CVM), Anderson-Darling (AD), Shapiro-Francia (SF), and Shapiro-Wilks (W) test statistic in terms of the power of the test through by Monte Carlo method. As a result, new plot is clearly classified normality and non-normality than Q-Q plot; in addition, the new test statistic is more powerful than the other test statistics for asymmetrical distribution. We check the proposed test statistic and plot using Hodgkin's disease data.

Testing Goodness of Fit in Nonparametric Function Estimation Techniques for Proportional Hazards Model

  • Kim, Jong-Tae
    • Communications for Statistical Applications and Methods
    • /
    • v.4 no.2
    • /
    • pp.435-444
    • /
    • 1997
  • The objective of this study is to investigate the problem of goodness of fit testing based on nonparametric function estimation techniques for the random censorship model. The small and large sample properties of the proposed test, $E_{mn}$, were investigated and it is shown that under the proportional hazard model $E_{mn}$ has higher power compared to the powers of the Kolmogorov -Smirnov, Kuiper, Cramer-von Mises, and analogue of the Cramer-von Mises type test statistic.

  • PDF

Confidence Intervals for Distribution Function

  • Choi, J.R.;Kang, M.K.;Chu, I.S.
    • Communications for Statistical Applications and Methods
    • /
    • v.4 no.1
    • /
    • pp.311-315
    • /
    • 1997
  • In this note we consider confidence interval based on Kolmogorov-Smirnov statistic. In order to obtain confidence interval we need percentage points of the statistics. Bootstrap method is examined whether it is useful to determine the points. It is concluded that the method is useful for observations with many ties, whereas it gives less conserbative points for continuous distributions.

  • PDF

Likelihood ratio in estimating Chi-square parameter

  • Rahman, Mezbahur
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.3
    • /
    • pp.587-592
    • /
    • 2009
  • The most frequent use of the chi-square distribution is in the area of goodness-of-t of a distribution. The likelihood ratio test is a commonly used test statistic as the maximum likelihood estimate in statistical inferences. The recently revised versions of the likelihood ratio test statistics are used in estimating the parameter in the chi-square distribution. The estimates are compared with the commonly used method of moments and the maximum likelihood estimate.

  • PDF

Likelihood ratio in estimating gamma distribution parameters

  • Rahman, Mezbahur;Muraduzzaman, S. M.
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.2
    • /
    • pp.345-354
    • /
    • 2010
  • The Gamma Distribution is widely used in Engineering and Industrial applications. Estimation of parameters is revisited in the two-parameter Gamma distribution. The parameters are estimated by minimizing the likelihood ratios. A comparative study between the method of moments, the maximum likelihood method, the method of product spacings, and minimization of three different likelihood ratios is performed using simulation. For the scale parameter, the maximum likelihood estimate performs better and for the shape parameter, the product spacings estimate performs better. Among the three likelihood ratio statistics considered, the Anderson-Darling statistic has inferior performance compared to the Cramer-von-Misses statistic and the Kolmogorov-Smirnov statistic.

On the Small Sample Distribution and its Consistency with the Large Sample Distribution of the Chi-Squared Test Statistic for a Two-Way Contigency Table with Fixed Margins (주변값이 주어진 이원분할표에 대한 카이제곱 검정통계량의 소표본 분포 및 대표본 분포와의 일치성 연구)

  • Park, Cheol-Yong;Choi, Jae-Sung;Kim, Yong-Gon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.11 no.1
    • /
    • pp.83-90
    • /
    • 2000
  • The chi-squared test statistic is usually employed for testing independence of two categorical variables in a two-way contingency table. It is well known that, under independence, the test statistic has an asymptotic chi-squared distribution under multinomial or product-multinomial models. For the case where both margins fixed, the sampling model of the contingency table is a multiple hypergeometric distribution and the chi-squared test statistic follows the same limiting distribution. In this paper, we study the difference between the small sample and large sample distributions of the chi-squared test statistic for the case with fixed margins. For a few small sample cases, the exact small sample distribution of the test statistic is directly computed. For a few large sample sizes, the small sample distribution of the statistic is generated via a Monte Carlo algorithm, and then is compared with the large sample distribution via chi-squared probability plots and Kolmogorov-Smirnov tests.

  • PDF

Comparison of several criteria for ordering independent components (독립성분의 순서화 방법 비교)

  • Choi, Eunbin;Cho, Sulim;Park, Mira
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.6
    • /
    • pp.889-899
    • /
    • 2017
  • Independent component analysis is a multivariate approach to separate mixed signals into original signals. It is the most widely used method of blind source separation technique. ICA uses linear transformations such as principal component analysis and factor analysis, but differs in that ICA requires statistical independence and non-Gaussian assumptions of original signals. PCA have a natural ordering based on cumulative proportion of explained variance; howerver, ICA algorithms cannot identify the unique optimal ordering of the components. It is meaningful to set order because major components can be used for further analysis such as clustering and low-dimensional graphs. In this paper, we compare the performance of several criteria to determine the order of the components. Kurtosis, absolute value of kurtosis, negentropy, Kolmogorov-Smirnov statistic and sum of squared coefficients are considered. The criteria are evaluated by their ability to classify known groups. Two types of data are analyzed for illustration.

Classical and Bayesian methods of estimation for power Lindley distribution with application to waiting time data

  • Sharma, Vikas Kumar;Singh, Sanjay Kumar;Singh, Umesh
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.3
    • /
    • pp.193-209
    • /
    • 2017
  • The power Lindley distribution with some of its properties is considered in this article. Maximum likelihood, least squares, maximum product spacings, and Bayes estimators are proposed to estimate all the unknown parameters of the power Lindley distribution. Lindley's approximation and Markov chain Monte Carlo techniques are utilized for Bayesian calculations since posterior distribution cannot be reduced to standard distribution. The performances of the proposed estimators are compared based on simulated samples. The waiting times of research articles to be accepted in statistical journals are fitted to the power Lindley distribution with other competing distributions. Chi-square statistic, Kolmogorov-Smirnov statistic, Akaike information criterion and Bayesian information criterion are used to access goodness-of-fit. It was found that the power Lindley distribution gives a better fit for the data than other distributions.