DOI QR코드

DOI QR Code

Effect of complex sample design on Pearson test statistic for homogeneity

복합표본자료에서 동질성검정을 위한 피어슨 검정통계량의 효과

  • Heo, Sun-Yeong (Department of Statistics, Changwon National University) ;
  • Chung, Young-Ae (Department of Childhood Education, Changwon National University)
  • Received : 2012.06.25
  • Accepted : 2012.07.19
  • Published : 2012.07.31

Abstract

This research is for comparison of test statistics for homogeneity when the data is collected based on complex sample design. The survey data based on complex sample design does not satisfy the condition of independency which is required for the standard Pearson multinomial-based chi-squared test. Today, lots of data sets ara collected by complex sample designs, but the tests for categorical data are conducted using the standard Pearson chi-squared test. In this study, we compared the performance of three test statistics for homogeneity between two populations using data from the 2009 customer satisfaction evaluation survey to the service from Gyeongsangnam-do regional offices of education: the standard Pearson test, the unbiasedWald test, and the Pearsontype test with survey-based point estimates. Through empirical analyses, we fist showed that the standard Pearson test inflates the values of test statistics very much and the results are not reliable. Second, in the comparison of Wald test and Pearson-type test, we find that the test results are affected by the number of categories, the mean and standard deviation of the eigenvalues of design matrix.

복합표본설계에 기초한 범주형 조사자료는 통상적인 피어슨 카이제곱검정에 필요한 조건을 만족하지 못한다. 그러나 많은 조사연구에서 복잡한 표본설계 방법을 적용하고 있지만, 종래의 피어슨 검정결과를 제시하고 있다. 본 연구는 복합표본설계에 의한 범주형자료의 동질성검정에 대한 실증분석을 통해, 종래의 피어슨 검정과 불편검정인 왈드검정, 표본설계를 반영한 비율추정치를 사용하는 피어슨 검정을 비교하였다. 분석결과, 종래의 피어슨검정은 표본설계를 반영하는 검정들에 비해 통계량 값이 매우 크고, 유의확률이 심각하게 작게 나타나는 것을 확인하였다. 복합표본설계를 반영하되 추정량의 분산을 아는 경우와 모르는 경우의 비교에서는 범주수, 설계효과행렬의 고유치들의 평균과 표준편차에 영향을 받는 것을 확인하였다.

Keywords

References

  1. Chung, Y., Jung, D. and Heo, S. (2009). 2009 Customer satisfaction evaluation survey to the service from Gyeongsangnam-do regional offices of education, Changwon National Univeristy, Gyeongnam.
  2. Heo. S. and Chang. D. (2010). A sample survey design for service satisfaction evaluation of regional education offices. Journal of the Korean Data & Information Science Society, 21, 671-678.
  3. Holt, D., Scott, A. J. and Ewings, P. D. (1980). Chi-squared tests with survey data. Journal of the Royal Statistical Society A, 143, 302-320.
  4. Heo, S. (2006). Power analysis of the Rao-Scott first-order adjustment to the Pearson test for homogeneity. Proceedings of Joint Statistical Meeting, Seattle, U.S.A., 3126-3129.
  5. Kim, D. H., Cho, K. H., Hwang, J. S. and Jung, K. H. (2009). A sample design for life and attitude survey of Gyeongbuk people. Journal of the Korean Data & Information Science Society, 20, 1165-1167.
  6. Kim, D. H., Hwang, J. S. and Kwak, S. G. (2010). A sample design for the survey on actual state of SMEs. Journal of the Korean Data & Information Science Society, 21, 1021-1029.
  7. Lee, C., Kang, H. and Sim, S. (2012). An implementation of the sample size and the power for testing mean and proportion. Journal of the Korean Data & Information Science Society, 23, 53-61. https://doi.org/10.7465/jkdi.2012.23.1.053
  8. Lavrakas, P. J. (2008). Encyclopedia of survey research methods, Vol.2, SAGE Publication, Inc., London.
  9. Rao, J. N. K. and Scott, A. J. (1981). The analysis of categorical data from complex sample surveys: Chi-squared tests for goodness of fit the independence in two-way tables. Journal of the American Statistical Association, 76, 221-230. https://doi.org/10.1080/01621459.1981.10477633
  10. Rao, J. N. K. and Scott, A. J. (1984). On chi-squared test for multiway contingency tables with cell proportions estimated from survey data. The Annals of Statistics, 12, 46-60. https://doi.org/10.1214/aos/1176346391
  11. Rao, J. N. K. and Scott, A. J. (1987). On simple adjustments to chi-square tests with sample survey data. The Annals of Statistics, 15, 385-397. https://doi.org/10.1214/aos/1176350273
  12. Shao, J. (1996). Resampling methods in sample surveys (with discussion). Statistics, 27, 203-254. https://doi.org/10.1080/02331889708802523

Cited by

  1. A study of the factors influential on a health-related quality of life using complex sample design vol.25, pp.4, 2014, https://doi.org/10.7465/jkdi.2014.25.4.829
  2. The unit-nonresponse status and use of weight in the KCYPS vol.25, pp.6, 2014, https://doi.org/10.7465/jkdi.2014.25.6.1397
  3. Error cause analysis of Pearson test statistics for k-population homogeneity test vol.24, pp.4, 2013, https://doi.org/10.7465/jkdi.2013.24.4.815
  4. Empirical Analysis on Rao-Scott First Order Adjustment for Two Population Homogeneity test Based on Stratified Three-Stage Cluster Sampling with PPS vol.7, pp.3, 2014, https://doi.org/10.13160/ricns.2014.7.3.208
  5. 성향점수를 이용한 운동강도가 고혈압에 미치는 영향 vol.28, pp.1, 2012, https://doi.org/10.7465/jkdi.2017.28.1.109
  6. 국가표본조사자료 기반 청소년 성경험의 개인 및 가족 요인 분석 vol.28, pp.1, 2017, https://doi.org/10.7465/jkdi.2017.28.1.21