• 제목/요약/키워드: two sample t-test

검색결과 578건 처리시간 0.026초

Effective Sample Sizes for the Test of Mean Differences Based on Homogeneity Test

  • Heo, Sunyeong
    • 통합자연과학논문집
    • /
    • 제12권3호
    • /
    • pp.91-99
    • /
    • 2019
  • Many researchers in various study fields use the two sample t-test to confirm their treatment effects. The two sample t-test is generally used for small samples, and assumes that two independent random samples are selected from normal populations, and the population variances are unknown. Researchers often conduct F-test, the test of equality of variances, before testing the treatment effects, and the test statistic or confidence interval for the two sample t-test has two formats according to whether the variances are equal or not. Researchers using the two sample t-test often want to know how large sample sizes they need to get reliable test results. This research gives some guidelines for sample sizes to them through simulation works. The simulation had run for normal populations with the different ratios of two variances for different sample sizes (${\leq}30$). The simulation results are as follows. First, if one has no idea equality of variances but he/she can assume the difference is moderate, it is safe to use sample size at least 20 in terms of the nominal level of significance. Second, the power of F-test for the equality of variances is very low when the sample sizes are small (<30) even though the ratio of two variances is equal to 2. Third, the sample sizes at least 10 for the two sample t-test are recommendable in terms of the nominal level of significance and the error limit.

Comparison of the Power of Bootstrap Two-Sample Test and Wilcoxon Rank Sum Test for Positively Skewed Population

  • Heo, Sunyeong
    • 통합자연과학논문집
    • /
    • 제15권1호
    • /
    • pp.9-18
    • /
    • 2022
  • This research examines the power of bootstrap two-sample test, and compares it with the powers of two-sample t-test and Wilcoxon rank sum test, through simulation. For simulation work, a positively skewed and heavy tailed distribution was selected as a population distribution, the chi-square distributions with three degrees of freedom, χ23. For two independent samples, the fist sample was selected from χ23. The second sample was selected independently from the same χ23 as the first sample, and calculated d+ax for each sampled value x, a randomly selected value from χ23. The d in d+ax has from 0 to 5 by 0.5 interval, and the a has from 1.0 to 1.5 by 0.1 interval. The powers of three methods were evaluated for the sample sizes 10,20,30,40,50. The null hypothesis was the two population medians being equal for Bootstrap two-sample test and Wilcoxon rank sum test, and the two population means being equal for the two-sample t-test. The powers were obtained using r program language; wilcox.test() in r base package for Wilcoxon rank sum test, t.test() in r base package for the two-sample t-test, boot.two.bca() in r wBoot pacakge for the bootstrap two-sample test. Simulation results show that the power of Wilcoxon rank sum test is the best for all 330 (n,a,d) combinations and the power of two-sample t-test comes next, and the power of bootstrap two-sample comes last. As the results, it can be recommended to use the classic inference methods if there are widely accepted and used methods, in terms of time, costs, sometimes power.

두 모집단 모평균 비교의 지도에 관한 연구 (A Study on Teaching Method of Two-Sample Test for Population Mean Difference)

  • 김용태;이장택
    • 한국수학교육학회지시리즈A:수학교육
    • /
    • 제45권2호
    • /
    • pp.145-154
    • /
    • 2006
  • The main purpose of this study is to investigate the effect of departures from normality and equal variance on the two-sample test when the variances are unknown. We have found that type I error brought about a little bit change which is ignorable in relation to kurtosis. But the change of type I error was mainly based on the skewness of the parent population. In introductory statistics classes where data analysis includes techniques for detecting skewness of two populations, we recommend the two-sample t-test when maximal skewness of two populations is smalter than the value 4 when the variances seem equal. Furthermore, our simulations reveal that the two-sample t-test appears somewhat more robust than that of z-test if the assumption of equal variance is satisfied. In the case of unequal variance, the two-sample t-test appears somewhat more robust provided the t-statistic using Satterthwaite's approximate degrees of freedom.

  • PDF

Effect of Positively Skewed Distribution on the Two sample t-test: Based on Chi-square Distribution

  • Heo, Sunyeong
    • 통합자연과학논문집
    • /
    • 제14권3호
    • /
    • pp.123-129
    • /
    • 2021
  • This research examines the effect of positively skewed population distribution on the two sample t-test through simulation. For simulation work, two independent samples were selected from the same chi-square distributions with 3, 5, 10, 15, 20, 30 degrees of freedom and sample sizes 3, 5, 10, 15, 20, 30, respectively. Chi-square distribution is largely skewed to the right at small degrees of freedom and getting symmetric as the degrees of freedom increase. Simulation results show that the sampled populations are distributed positively skewed like chi-square distribution with small degrees of freedom, the F-test for the equality of variances shows poor performances even at the relatively large degrees of freedom and sample sizes like 30 for both, and so it is recommended to avoid using F-test. When two population variances are equal, the skewness of population distribution does not affect on the t-test in terms of the confidence level. However even though for the highly positively skewed distribution and small sample sizes like three or five the t-test achieved the nominal confidence level, the error limits are very large at small sample size. Therefore, if the sampled population is expected to be highly skewed to the right, it will be recommended to use relatively large sample size, at least 20.

Asymptotic Relative Efficiency of t-test Following Transformations

  • Yeo, In-Kwon
    • Journal of the Korean Statistical Society
    • /
    • 제26권4호
    • /
    • pp.467-476
    • /
    • 1997
  • The two-sample t-test is not expected to be optimal when the two samples are not drawn from normal populations. According to Box and Cox (1964), the transformation is estimated to enhance the normality of the tranformed data. We investigate the asymptotic relative efficiency of the ordinary t-test versus t-test applied transformation introduced by Yeo and Johnson (1997) under Pitman local alternatives. The theoretical and simulation studies show that two-sample t-test using transformed date gives higher power than ordinary t-test for location-shift models.

  • PDF

잡음영상에서 에지검출을 위한 이표본 검정법 (Two-sample Tests for Edge Detection in Noisy Images)

  • 임동훈;박은희
    • 응용통계연구
    • /
    • 제14권1호
    • /
    • pp.149-160
    • /
    • 2001
  • 본 논문에서는 이표본 위치문제에서 대표적인 윌콕슨 검정법과 T 검정법을 사용하여 잡음영상에서 에지를 검출하고자 한다. 에지높이모수를 사용하여 얻은 수정된 농도값상에서 검정통계량을 계산하고 유의수준에 의해 결정된 임계값과 비교하여 에지유무를 판정한다. 영상실험을 통하여 얻은 에지맵과 객관적인 척도하에서 에지검출 성능을 비교분석한다.

  • PDF

두 집단 평균 차이 검정에서 분산의 동질성에 관한 소고 (Note on the Equality of Variances in Two Sample t-Test)

  • 김상철;임요한
    • Communications for Statistical Applications and Methods
    • /
    • 제17권1호
    • /
    • pp.79-88
    • /
    • 2010
  • 기초통계학의 수업에서 두 집단간 평균의 차이를 검정함에 있어 두 집단의 분산의 동질성 여부에 따라 다른 통계 절차를 사용할 것을 제안하고 있다. 이러한 이유로 통계 분석에 사용되는 SAS나 SPSS 등의 패키지에서는 두 집단의 평균 차이의 검정에 앞서 분산의 동질성 검정을 선행할 것을 제안한다. 하지만, 이전의 몇몇 연구에서 알려진 바와 같이 이러한 이 단계 검정 절차는 검정의 유의수준(제 1종의 오류)을 제어하기가 어렵다. 본 글에서는 이 단계 검정을 행함에 있어 1 단계와 2 단계의 유의수준 ${\alpha}_1$${\alpha}_2$를 조절하여 전체 검정의 유의수준을 주어진 ${\alpha}$ 이하로 제어하는 절차를 소개한다.

Improved Statistical Testing of Two-class Microarrays with a Robust Statistical Approach

  • Oh, Hee-Seok;Jang, Dong-Ik;Oh, Seung-Yoon;Kim, Hee-Bal
    • Interdisciplinary Bio Central
    • /
    • 제2권2호
    • /
    • pp.4.1-4.6
    • /
    • 2010
  • The most common type of microarray experiment has a simple design using microarray data obtained from two different groups or conditions. A typical method to identify differentially expressed genes (DEGs) between two conditions is the conventional Student's t-test. The t-test is based on the simple estimation of the population variance for a gene using the sample variance of its expression levels. Although empirical Bayes approach improves on the t-statistic by not giving a high rank to genes only because they have a small sample variance, the basic assumption for this is same as the ordinary t-test which is the equality of variances across experimental groups. The t-test and empirical Bayes approach suffer from low statistical power because of the assumption of normal and unimodal distributions for the microarray data analysis. We propose a method to address these problems that is robust to outliers or skewed data, while maintaining the advantages of the classical t-test or modified t-statistics. The resulting data transformation to fit the normality assumption increases the statistical power for identifying DEGs using these statistics.

영상에서 윈도우 배치에 따른 통계적 에지검출 비교 (Window Configurations Comparison Based on Statistical Edge Detection in Images)

  • 임동훈
    • Communications for Statistical Applications and Methods
    • /
    • 제16권4호
    • /
    • pp.615-625
    • /
    • 2009
  • 본 논문에서는 영상의 에지검출을 하는데 사용되는 여러 가지 윈도우 배치(window configurations)하에서 통계학의 이표본 위치문제(two-sample location problem)에서 대표적인 Wilcoxon 검정과 T-검정에 기초한 에지검출법에 대해 논의하고자 한다. 영상의 에지검출하는데 윈도우 배치 선택은 에지검출 성능을 결정하는 중요한 요소이다. 본 논문에서 에지는 선택된 윈도우 배치 하에서 에지-높이 모수(edge-height parameter)를 사용한 에지 모형 하에서 두 근방 영역간의 유의한 차이가 있는지를 검정함으로서 결정한다. 영상 실험에서 윈도우 배치에 따른 통계적 검정에 의한 에지검출 성능은 에지 맵(edge map)을 통한 정성적인 비교와 객관적인 척도하에서 정량적인 비교 그리고 CPU 계산시간까지 고려하여 분석하였다.

독립인 두 모집단 설계에서의 표본수 비교 (Sample size comparison for two independent populations)

  • 고해원;김동재
    • Journal of the Korean Data and Information Science Society
    • /
    • 제21권6호
    • /
    • pp.1243-1251
    • /
    • 2010
  • 임상시험을 시행하는 경우 위약을 신약과 비교하는 경우가 대다수이다. 기존에 독립인 두 모 집단의 표본수를 계산하는 방법으로 모수적 방법에서는 t검정을 이용하였고, 비모수적 방법에서는 Wilcoxon 순위합검정 (Wilcoxon, 1945)을 이용하였다. 본 논문에서는 Orban과 Wolfe (1982)가 제안한 선형위치통계량의 검정법과, Kim (1994)이 선형위치통계량에 기초하여 계산한 검정력의 결과를 이용하여 표본수 구하는 방법을 제안한다. 또한 앞서 제안한 방법의 표본수를 기존의 Wilcoxon 순위합검정을 이용하여 Wang 등 (2003)이 제안한 공식을 이용한 표본수, 그리고 모수적 방법을 이용한 t검정의 표본수와 비교하였다.