• Title/Summary/Keyword: two-sample

Search Result 5,682, Processing Time 0.032 seconds

Effective Sample Sizes for the Test of Mean Differences Based on Homogeneity Test

  • Heo, Sunyeong
    • Journal of Integrative Natural Science
    • /
    • v.12 no.3
    • /
    • pp.91-99
    • /
    • 2019
  • Many researchers in various study fields use the two sample t-test to confirm their treatment effects. The two sample t-test is generally used for small samples, and assumes that two independent random samples are selected from normal populations, and the population variances are unknown. Researchers often conduct F-test, the test of equality of variances, before testing the treatment effects, and the test statistic or confidence interval for the two sample t-test has two formats according to whether the variances are equal or not. Researchers using the two sample t-test often want to know how large sample sizes they need to get reliable test results. This research gives some guidelines for sample sizes to them through simulation works. The simulation had run for normal populations with the different ratios of two variances for different sample sizes (${\leq}30$). The simulation results are as follows. First, if one has no idea equality of variances but he/she can assume the difference is moderate, it is safe to use sample size at least 20 in terms of the nominal level of significance. Second, the power of F-test for the equality of variances is very low when the sample sizes are small (<30) even though the ratio of two variances is equal to 2. Third, the sample sizes at least 10 for the two sample t-test are recommendable in terms of the nominal level of significance and the error limit.

Comparison of the Power of Bootstrap Two-Sample Test and Wilcoxon Rank Sum Test for Positively Skewed Population

  • Heo, Sunyeong
    • Journal of Integrative Natural Science
    • /
    • v.15 no.1
    • /
    • pp.9-18
    • /
    • 2022
  • This research examines the power of bootstrap two-sample test, and compares it with the powers of two-sample t-test and Wilcoxon rank sum test, through simulation. For simulation work, a positively skewed and heavy tailed distribution was selected as a population distribution, the chi-square distributions with three degrees of freedom, χ23. For two independent samples, the fist sample was selected from χ23. The second sample was selected independently from the same χ23 as the first sample, and calculated d+ax for each sampled value x, a randomly selected value from χ23. The d in d+ax has from 0 to 5 by 0.5 interval, and the a has from 1.0 to 1.5 by 0.1 interval. The powers of three methods were evaluated for the sample sizes 10,20,30,40,50. The null hypothesis was the two population medians being equal for Bootstrap two-sample test and Wilcoxon rank sum test, and the two population means being equal for the two-sample t-test. The powers were obtained using r program language; wilcox.test() in r base package for Wilcoxon rank sum test, t.test() in r base package for the two-sample t-test, boot.two.bca() in r wBoot pacakge for the bootstrap two-sample test. Simulation results show that the power of Wilcoxon rank sum test is the best for all 330 (n,a,d) combinations and the power of two-sample t-test comes next, and the power of bootstrap two-sample comes last. As the results, it can be recommended to use the classic inference methods if there are widely accepted and used methods, in terms of time, costs, sometimes power.

Two-Sample Inference for Quantiles Based on Bootstrap for Censored Survival Data

  • Kim, Ji-Hyun
    • Journal of the Korean Statistical Society
    • /
    • v.22 no.2
    • /
    • pp.159-169
    • /
    • 1993
  • In this article, we consider two sample problem with randomly right censored data. We propse two-sample confidence intervals for the difference in medians or any quantiles, based on bootstrap. The bootstrap version of two-sample confidence intervals proposed in this article is simple to apply and do not need the assumption of the shift model, so that for the non-shift model, the density estimation is not necessary, which is an attractive feature in small to moderate sized sample case.

  • PDF

Unbiased Balanced Half-Sample Variance Estimation in Stratified Two-stage Sampling

  • Kim, Kyu-Seong
    • Journal of the Korean Statistical Society
    • /
    • v.27 no.4
    • /
    • pp.459-469
    • /
    • 1998
  • Balanced half sample method is a simple variance estimation method for complex sampling designs. Since it is simple and flexible, it has been widely used in large scale sample surveys. However, the usual BHS method overestimate the true variance in without replacement sampling and two-stage cluster sampling. Focusing on this point , we proposed an unbiased BHS variance estimator in a stratified two-stage cluster sampling and then described an implementation method of the proposed estimator. Finally, partially BHS design is explained as a tool of reducing the number of replications of the proposed estimator.

  • PDF

The Test Statistic of the Two Sample Locally Optimum Rank Detector for Random Signals in Weakly Dependent Noise Models (약의존성 잡음에서 두 표본을 쓰는 국소 최적 확률 신호 검파기의 검정 통계량)

  • Bae, Jin-Soo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.8C
    • /
    • pp.709-712
    • /
    • 2010
  • In this paper, the two sample locally optimum rank detector is obtained in the weakly dependent noise with non-zero temporal correlation between noise observations. The test statistic of the locally optimum rank detector is derived from the Neyman-Pearson lemma suitable for the two sample observation models, where it is assumed that reference observations are available in addition to regular observations. Two-sample locally optimum rank detecter shows the same performance with the one-sample locally optimum rank detector asymptotically. The structure of the two-sample rank detector is simpler than that of the one-sample rank detector because the sign statistic is not processed separately.

Simulations of Two-Dimensional Electronic Correlation Spectra

  • Kim, Hak Jin;Jeon, Seong Jun
    • Bulletin of the Korean Chemical Society
    • /
    • v.22 no.8
    • /
    • pp.807-815
    • /
    • 2001
  • Two-dimensional (2D) correlation method, which generates the synchronous and the asynchronous 2D spectrum by complex cross correlation of the Fourier transformed spectra, is an analysis method for the changes of the sample spectrum induced by vari ous perturbations. In the present work, the 2D electronic correlation spectra have been simulated for the cases where the sample spectrum composed of two gaussian bands changes linearly. When only the band amplitudes of the sample spectrum change, the synchronous spectrum shows strong peaks at the band centers of the sample spectrum, but the asynchronous spectrum does not make peaks. When the sample spectrum shifts without changing intensity and width, the synchronous spectrum shows peaks around the initial and final positions of the band maximum and the asynchronous spectrum shows long peaks spanning the shifting range. The band width change produces the complex 2D correlation spectra. When the sample spectrum shifts with band broadening, the width change by 50% of full width at half maximum (FWHM) does not give so large an effect on the correlation spectrum as the spectral shift by one half of FWHM of the sample spectrum.

Effect of Positively Skewed Distribution on the Two sample t-test: Based on Chi-square Distribution

  • Heo, Sunyeong
    • Journal of Integrative Natural Science
    • /
    • v.14 no.3
    • /
    • pp.123-129
    • /
    • 2021
  • This research examines the effect of positively skewed population distribution on the two sample t-test through simulation. For simulation work, two independent samples were selected from the same chi-square distributions with 3, 5, 10, 15, 20, 30 degrees of freedom and sample sizes 3, 5, 10, 15, 20, 30, respectively. Chi-square distribution is largely skewed to the right at small degrees of freedom and getting symmetric as the degrees of freedom increase. Simulation results show that the sampled populations are distributed positively skewed like chi-square distribution with small degrees of freedom, the F-test for the equality of variances shows poor performances even at the relatively large degrees of freedom and sample sizes like 30 for both, and so it is recommended to avoid using F-test. When two population variances are equal, the skewness of population distribution does not affect on the t-test in terms of the confidence level. However even though for the highly positively skewed distribution and small sample sizes like three or five the t-test achieved the nominal confidence level, the error limits are very large at small sample size. Therefore, if the sampled population is expected to be highly skewed to the right, it will be recommended to use relatively large sample size, at least 20.

A TWO-SAMPLE CONDITIONAL UNRELATED QUESTION MODEL

  • Lee, Gi-Sung;Hong, Ki-Hak
    • Journal of applied mathematics & informatics
    • /
    • v.9 no.2
    • /
    • pp.825-835
    • /
    • 2002
  • In this paper, we extend the conditional unrelated question model which was suggested by Lee and Hong(2000) to two-sample case when there is no information about the true proportion of the unrelated character Y. Conditions are obtained under which the proposed model is more efficient than Carr et al.\`s conditional modal and Greenberg et al.'s two-sample unrelated question model.

A Study on Teaching Method of Two-Sample Test for Population Mean Difference (두 모집단 모평균 비교의 지도에 관한 연구)

  • Kim Yong-Tae;Lee Jang-Taek
    • The Mathematical Education
    • /
    • v.45 no.2 s.113
    • /
    • pp.145-154
    • /
    • 2006
  • The main purpose of this study is to investigate the effect of departures from normality and equal variance on the two-sample test when the variances are unknown. We have found that type I error brought about a little bit change which is ignorable in relation to kurtosis. But the change of type I error was mainly based on the skewness of the parent population. In introductory statistics classes where data analysis includes techniques for detecting skewness of two populations, we recommend the two-sample t-test when maximal skewness of two populations is smalter than the value 4 when the variances seem equal. Furthermore, our simulations reveal that the two-sample t-test appears somewhat more robust than that of z-test if the assumption of equal variance is satisfied. In the case of unequal variance, the two-sample t-test appears somewhat more robust provided the t-statistic using Satterthwaite's approximate degrees of freedom.

  • PDF

Effects of the Reference Sample Size on the Performance of the Two-Sample Rank Detector (두 표본 순위 검파에서 기준 표본 크기가 검파기 성능에 미치는 영향)

  • Bae, Jinsoo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.40 no.8
    • /
    • pp.1515-1517
    • /
    • 2015
  • The effects of the reference sample size on the detection probability of the two-sample rank detector is investigated in this paper. The larger reference sample size shows the better performance of the detector. The effect is also shown to be saturated as the reference sample size becomes larger.