• 제목/요약/키워드: Test Statistics

검색결과 6,465건 처리시간 0.029초

Robust Variable Selection in Classification Tree

  • 장정이;정광모
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2001년도 추계학술발표회 논문집
    • /
    • pp.89-94
    • /
    • 2001
  • In this study we focus on variable selection in decision tree growing structure. Some of the splitting rules and variable selection algorithms are discussed. We propose a competitive variable selection method based on Kruskal-Wallis test, which is a nonparametric version of ANOVA F-test. Through a Monte Carlo study we note that CART has serious bias in variable selection towards categorical variables having many values, and also QUEST using F-test is not so powerful to select informative variables under heavy tailed distributions.

  • PDF

A Unit Root Test Based on Bootstrapping

  • Shin, Key-Il;Kang, Hee-Jeong
    • Communications for Statistical Applications and Methods
    • /
    • 제3권1호
    • /
    • pp.257-265
    • /
    • 1996
  • We consider nonstationary autoregressive autoregressive process with infinite variance of error. In the case of infinite cariance, the limiting distribution of the estimated coefficient is different from that under the finite cariance assumption. In this paper we show that the bootstrap method can be used to approximate the distribution of ordinary least squares estimator of the coefficient in the first order random walk process with infinite variance through some empirical studies and we suggest a test procedure based on bootstrap method for the unit root test.

  • PDF

A Study on Distribution Based on the Normalized Sample Lorenz Curve

  • Suk-Bok kang;Cho, Young-Suk
    • Communications for Statistical Applications and Methods
    • /
    • 제8권1호
    • /
    • pp.185-192
    • /
    • 2001
  • Using the Lorenz curve that is proved to be a powerful tool to measure the income inequality within a population of income receivers, we propose the normalized sample Lorenz curve for the goodness-of-fit test that is very important test in statistical analysis. For two hodgkin's disease data sets, we compare the Q-Q plot and the proposed normalized sample Lorenz curve.

  • PDF

Asymmetric Modeling in Beta-ARCH Processes

  • S. Y. Hwang;Kahng, Myung-Wook
    • Journal of the Korean Statistical Society
    • /
    • 제31권4호
    • /
    • pp.459-468
    • /
    • 2002
  • A class of asymmetric beta-ARCH processes is proposed and connections to traditional ARCH models are explained. Geometric ergodicity of the model is discussed. Conditional least squares as well as maximum likelihood estimators of parameters and their limit results are also presented. A test for symmetry of the model is studied with limiting power of test statistic given.

Test for Independence in Bivariate Pareto Model with Bivariate Random Censored Data

  • Cho, Jang-Sik;Kwon, Yong-Man;Choi, Seung-Bae
    • Journal of the Korean Data and Information Science Society
    • /
    • 제15권1호
    • /
    • pp.31-39
    • /
    • 2004
  • In this paper, we consider two components system which the lifetimes follow bivariate pareto model with bivariate random censored data. We assume that the censoring times are independent of the lifetimes of the two components. We develop large sample test for testing independence between two components. Also we present a simulation study which is the test based on asymptotic normal distribution in testing independence.

  • PDF

Influence Analysis of the Liklihood Ratio Test in Multivariate Behrens-Fisher Problem

  • Jung, Kang-Mo;Kim, Myung-Geun
    • Communications for Statistical Applications and Methods
    • /
    • 제6권3호
    • /
    • pp.939-946
    • /
    • 1999
  • We propose methods for detecting influential observations that have a large influence on the likelihood ratio test statistic for the multivariate Behrens-Fisher problem. For this purpose we derive the influence curve and the derivative influence of the likelihood ratio test statistic. An illustrative example is given to show the effectiveness of the proposed methods on the identification of influential observations.

  • PDF

k-모집단 동질성검정에서 피어슨검정의 오차성분 분석에 관한 연구 (Error cause analysis of Pearson test statistics for k-population homogeneity test)

  • 허순영
    • Journal of the Korean Data and Information Science Society
    • /
    • 제24권4호
    • /
    • pp.815-824
    • /
    • 2013
  • 국가단위의 조사와 같은 대규모 표본조사에서는 표본의 대표성을 확보하기 위해 층화, 집락, 계통, 불균등확률추출 등을 종합적으로 사용하는 복합표본설계가 일반화되어 있다. 이러한 복합표본설계에 기초한 범주형 자료분석에서는 자료의 독립성과 다항분포를 가정하는 전통적인 피어슨검정이 왜곡된 검정결과를 가져올 수 있다. 본 연구는 복합표본설계에 의한 범주형조사자료의 k-모집단 동질성검정에서 설계기반 일치통계량인 Wald 검정통계량을 유도하고, 전통적인 피어슨검정통계량을 사용할 경우 발생할 수 있는 오차요인을 항목별로 분해하여, 분산의 편의에 의한 영향, 추정량의 편의에 의한 영향, 기타 분산의 편의와 추정량의 편의가 교락되어 미치는 영향으로 각각 분해하는 식을 도출하였다. 또한, 도출된 식의 각 항목이 피어슨 카이제곱검정통계량에 미치는 상대적 크기를 경험적으로 확인하기 위해 국민건강영양조사 제4기 2차년도 자료를 이용해 경험분석 하였다. 분석결과, 변수에 따른 차이는 있지만 대체로 분산의 편의가 미치는 영향이 추정량의 편의가 미치는 영향보다 크다는 것을 명확히 확인할 수 있었다.

화률화 블록 계획법에서 우산형 대립가설에 대한 분포부관 검정법의 연구 (On the distribution-free tests for umbrella alternatives in a randomized block design)

  • 김동희;김영철
    • 응용통계연구
    • /
    • 제5권1호
    • /
    • pp.41-57
    • /
    • 1992
  • 확률화 블록 계획법에서 우산형 대립가설에 대한 분포 무관 검정법을 제시하고 제안된 검정통계량의 점근적 성질과 모수적 방법과의 점근상대효율을 관찰하였다. 블록수가 4. 처리수가 5일 때와 블록수가 3, 처리수가 5일 때 주어진 정점이 2, 3, 4인 경우와 블록수가 2, 처리수가 4일 때 주어진 정점이 3인 경우에 소표본 Monte Carlo실험을 통하여 제안된 검정통계량과 Puri의 모수적 통계량의 실험 검정력을 구하였으며 제안된 검정통계량이 두터운 꼬리를 갖는 분포에서 효율적임을 보였다.

  • PDF

Genetic association tests when a nuisance parameter is not identifiable under no association

  • Kim, Wonkuk;Kim, Yeong-Hwa
    • Communications for Statistical Applications and Methods
    • /
    • 제24권6호
    • /
    • pp.663-671
    • /
    • 2017
  • Some genetic association tests include an unidentifiable nuisance parameter under the null hypothesis of no association. When the mode of inheritance (MOI) is not specified in a case-control design, the Cochran-Armitage (CA) trend test contains an unidentifiable nuisance parameter. The transmission disequilibrium test (TDT) in a family-based association study that includes the unaffected also contains an unidentifiable nuisance parameter. The hypothesis tests that include an unidentifiable nuisance parameter are typically performed by taking a supremum of the CA tests or TDT over reasonable values of the parameter. The p-values of the supremum test statistics cannot be obtained by a normal or chi-square distribution. A common method is to use a Davies's upper bound of the p-value instead of an exact asymptotic p-value. In this paper, we provide a unified sine-cosine process expression of the CA trend test that does not specify the MOI and the TDT that includes the unaffected. We also present a closed form expression of the exact asymptotic formulas to calculate the p-values of the supremum tests when the score function can be written as a linear form in an unidentifiable parameter. We illustrate how to use the derived formulas using a pharmacogenetics case-control dataset and an attention deficit hyperactivity disorder family-based example.