• Title/Summary/Keyword: kolmogorov-smirnov test

Search Result 207, Processing Time 0.024 seconds

Comprehensive comparison of normality tests: Empirical study using many different types of data

  • Lee, Chanmi;Park, Suhwi;Jeong, Jaesik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.5
    • /
    • pp.1399-1412
    • /
    • 2016
  • We compare many normality tests consisting of different sources of information extracted from the given data: Anderson-Darling test, Kolmogorov-Smirnov test, Cramervon Mises test, Shapiro-Wilk test, Shaprio-Francia test, Lilliefors, Jarque-Bera test, D'Agostino' D, Doornik-Hansen test, Energy test and Martinzez-Iglewicz test. For the purpose of comparison, those tests are applied to the various types of data generated from skewed distribution, unsymmetric distribution, and distribution with different length of support. We then summarize comparison results in terms of two things: type I error control and power. The selection of the best test depends on the shape of the distribution of the data, implying that there is no test which is the most powerful for all distributions.

Goodness-of-Fit Test for the Normality based on the Generalized Lorenz Curve

  • Cho, Youngseuk;Lee, Kyeongjun
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.4
    • /
    • pp.309-316
    • /
    • 2014
  • Testing normality is very important because the most common assumption is normality in statistical analysis. We propose a new plot and test statistic to goodness-of-fit test for normality based on the generalized Lorenz curve. We compare the new plot with the Q-Q plot. We also compare the new test statistic with the Kolmogorov-Smirnov (KS), Cramer-von Mises (CVM), Anderson-Darling (AD), Shapiro-Francia (SF), and Shapiro-Wilks (W) test statistic in terms of the power of the test through by Monte Carlo method. As a result, new plot is clearly classified normality and non-normality than Q-Q plot; in addition, the new test statistic is more powerful than the other test statistics for asymmetrical distribution. We check the proposed test statistic and plot using Hodgkin's disease data.

Computer Programs for Nonparametric Tests (비모수적(非母數的) 통계(統計) 프로그램의 개발(開發))

  • Bae, Do-Seon;Jang, Jung-Sun;Kim, Sang-Bok
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.12 no.2
    • /
    • pp.101-108
    • /
    • 1986
  • Computer programs for IBM PC/XT/AT or compatibles, are presented for running 9 nonparametric tests. They include sign test, Wilcoxon signed rank test, Mann-Whitney Wilcoxon test, Kruskal-Wallis test, Kolmogorov-Smirnov one sample and two sample tests, Kendall and Spearman rank correlation coefficient tests, and Chi square test for contingency table. Each program is written with BASIC language and is combined into a statistical package, 'NONPARA'. It is easily accessible through the menu programs. The alogorithms on which each test is based, are also explained and 3 examples are given.

  • PDF

Comparison of Differences in Levels per Group on Math Self-Regulated Learning Factors of High School Students (고등학생의 수학 자기조절 학습 요인에 대한 집단별 수준 차이 비교)

  • Yoo, Ki Jong
    • Journal for History of Mathematics
    • /
    • v.34 no.1
    • /
    • pp.21-37
    • /
    • 2021
  • The purpose of the present study is to compare the differences in levels per group of high school students regarding the self-regulated learning factors for mathematics. For this purpose, a self-regulated learning measurement tool was developed and surveys were conducted. And the statistical analysis was completed using the frequency analysis, Kolmogorov-Smirnov normality test, Mann-Whitney U test and the Kruskal-Wallis H test. As a result, it is found that self-efficacy is of statistically significant differences in self-regulated learning levels regardless of the group classifications but test anxiety does not show statistically significant differences in self-regulated learning levels regardless of the group classifications.

Determination of Probable Rainfall Intensity Formulas for Designing Storm Sewer Systems at Incheon District (우수거 설계를 위한 인천지방에서의 확률강우강도식의 산정)

  • Ahn, Tae-Jin;Kim, Kyung-Sub
    • Journal of Korean Society of Water and Wastewater
    • /
    • v.12 no.3
    • /
    • pp.99-106
    • /
    • 1998
  • This paper presents a procedure for determining the design rainfall depth and the design rainfall intensity at Incheon city area in Korea. In this study the eight probability distributions are considered to estimate the probable rainfall depths for 11 different durations. The Kolmogorov - Smirnov test and the Chi-square test are adopted to test each distribution. The probable rainfall intensity formulas are then determined by i) the least squares (LS) method, ii) the least median squares (LMS) method, iii) the reweighted least squares method based on the LMS (RLS), and iv) the constrained regression (CR) model. The Talbot, the Sherman, the Japanese, and the Unified type are considered to determine the best type for the Incheon station. The root mean squared (RMS) errors are computed to test the formulas derived by four methods. It is found that the Unified type is the most reliable and that all methods presented herein are acceptable for determining the coefficients of rainfall intensity formulas from an engineering point of view.

  • PDF

Test of Exponentiality in Step Stress Accelerated Life test Model based on Kullback­Leibler Information Function (쿨백­라이블러 정보함수 이용한 단계 스트레스 가속수명모형의 지수성 검정)

  • 박병구;윤상철
    • Journal of Korean Society for Quality Management
    • /
    • v.31 no.4
    • /
    • pp.194-202
    • /
    • 2003
  • In this paper, we propose goodness of fit test statistics for exponentiality in accelerated life tests data based on Kullback­Leibler information functions. This acceleration model is assumed to be a tampered random variable model. The procedure is applicable when the exponential parameter using the data from accelerated life tests is or is not specified under null hypothesis. And we compare the power of the proposed test statistics with Kolmogorov­Smirnov, Cramer von Mises and Anderson­Darling statistics in the small sample.

Goodness of Fit Test of Normality Based on Kullback-Leibler Information

  • Kim, Jong-Tae;Lee, Woo-Dong;Ko, Jung-Hwan;Yoon, Yong-Hwa;Kang, Sang-Gil
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.3
    • /
    • pp.909-918
    • /
    • 1999
  • Arizono and Ohta(1989) studied goodness of fit test of normality using the entropy estimator proposed by Vasicek (1976) Recently van Es(1992) and Correa(1995) proposed an estimator of entropy. In this paper we propose goodness of fit test statistics for normality based on Vasicek ven Es and Correa. And we compare the power of the proposed test statistics with Kolmogorov-Smirnov Kuiper Cramer von Mises Watson Anderson-Darling and Finkelstein and Schefer statistics.

  • PDF

Adjusted ROC and CAP Curves (조정된 ROC와 CAP 곡선)

  • Hong, Chong-Sun;Kim, Ji-Hun;Choi, Jin-Soo
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.1
    • /
    • pp.29-39
    • /
    • 2009
  • Among others, ROC and CAP curves are used to explore the discriminatory power between the defaults and non-defaults, based on the distribution of the probability of default in credit rating works. ROC and CAP curves are plotted in terms of various ratios of the probability of default. Each point on ROC and CAP curves is calculated according to cutting points (scores) for classifying between defaults and non-defaults. In this paper, adjusted ROC and CAP curves are proposed by using functions of ratios of the probability of default. It is possible to recognize the score corresponding to a point oil these adjusted curves, and we can identify the best score to show the optimal discriminatory power. Moreover, we discuss the relationships between the best score obtained from the adjusted ROC and CAP curves and the score corresponding to Kolmogorov - Smirnov statistic to test the homogeneous distribution functions of the defaults and non-defaults.

Optimal Threshold from ROC and CAP Curves (ROC와 CAP 곡선에서의 최적 분류점)

  • Hong, Chong-Sun;Choi, Jin-Soo
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.5
    • /
    • pp.911-921
    • /
    • 2009
  • Receiver Operating Characteristic(ROC) and Cumulative Accuracy Profile(CAP) curves are two methods used to assess the discriminatory power of different credit-rating approaches. The points of optimal classification accuracy on an ROC curve and of maximal profit on a CAP curve can be found by using iso-performance tangent lines, which are based on the standard notion of accuracy. In this paper, we offer an alternative accuracy measure called the true rate. Using this rate, one can obtain alternative optimal threshold points on both ROC and CAP curves. For most real populations of borrowers, the number of the defaults is much less than that of the non-defaults, and in such cases the true rate may be more efficient than the accuracy rate in terms of cost functions. Moreover, it is shown that both alternative scores of optimal classification accuracy and maximal profit are the identical, and this single score coincides with the score corresponding to Kolmogorov-Smirnov statistic used to test the homogeneous distribution functions of the defaults and non-defaults.

Reliability-Based Managing Criteria for Cable Tension Force in Cable-stayed Bridges (신뢰성에 기초한 사장교 케이블 장력 관리기준치 설정)

  • Cho, Hyo-Nam;Kang, Kyung-Koo;Cha, Cheol-Joon
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.9 no.3
    • /
    • pp.129-138
    • /
    • 2005
  • This paper presents a methodology for the determination of optimal managing criteria for cable tension force in cable-stayed bridges using acceleration data acquired by monitoring system. There are many long span bridges installed with monitoring system in Korea. The monitoring systems are installed to diagnose abnormal behavior or damages in bridges and to warn these to bridge management agency. In cable-stayed bridges, the cable tension force could be an important indicator of abnormal behavior because of the geometric configuration of the cable-stayed bridge. If the management value of cable tension force is set too high or too low, then the monitoring system could not warn properly for the abnormal behavior of a bridge. Generally, the management value is set by empirical or engineering judgment, but in this paper, a new methodology for the determination of managing criteria for cable tension force is proposed based on the probability distribution model for tension force and reliability analysis. The proposed methodology is applied to a real concrete cable-stayed bridge in order to investigate its applicability.