DOI QR코드

DOI QR Code

A Study on the Power Comparison between Logistic Regression and Offset Poisson Regression for Binary Data

  • Kim, Dae-Youb (Department of Statistics, Hankuk University of Foreign Studies) ;
  • Park, Heung-Sun (Department of Statistics, Hankuk University of Foreign Studies)
  • Received : 2012.01.24
  • Accepted : 2012.06.03
  • Published : 2012.07.31

Abstract

In this paper, for analyzing binary data, Poisson regression with offset and logistic regression are compared with respect to the power via simulations. Poisson distribution can be used as an approximation of binomial distribution when n is large and p is small; however, we investigate if the same conditions can be held for the power of significant tests between logistic regression and offset poisson regression. The result is that when offset size is large for rare events offset poisson regression has a similar power to logistic regression, but it has an acceptable power even with a moderate prevalence rate. However, with a small offset size (< 10), offset poisson regression should be used with caution for rare events or common events. These results would be good guidelines for users who want to use offset poisson regression models for binary data.

Keywords

References

  1. Agresti, A. (1996). An Introduction to Categorical Data Analysis, New York
  2. Breslow, N. E. and Clayton, D. G. (1993). Approximate inference in generalized linear mixed models, Journal of the American Statistical Association, 88, 9-25. https://doi.org/10.2307/2290687
  3. Cornfield, J. (1956). A statistical problem arising from retrospective studies, Proceedings of the Third Berkeley Symposium, 4, 135-148.
  4. McCullagh, P. and Nelder, J. (1989). Generalized Linear Models, Chapman and Hall, London.
  5. McNutt, L.,Wu, C., Xue, X. and Hafner, J. P. (2003). Estimating the relative risk in cohort studies and clinical trials of common outcomes, American Journal of Epidemiology, 157, 940-943. https://doi.org/10.1093/aje/kwg074
  6. Nelder, J. and Wedderburn, R. (1972). Generalized linear models, Journal of the Royal Statistical Society, Series A, 135, 370-384. https://doi.org/10.2307/2344614
  7. Petersen, M. R. and Deddens, J. A. (2008). A comparison of two methods for estimating prevalence ratios, BMC Medical Research Methodology, 8, 1-9. https://doi.org/10.1186/1471-2288-8-1
  8. SAS Institute (2008). SAS/STAT 9.2 User's Guide: The GENMOD Procedure, Cary, NC.
  9. Spiegelman, D. and Hertzmark, E. (2005). Easy SAS calculations for risk or prevalence ratios and differences, American Journal of Epidemiology, 162, 199-200. https://doi.org/10.1093/aje/kwi188
  10. Thall, P. F. and Vail, S. C. (1990). Some covariance models for longitudinal count data with overdispersion, Biometrics, 46, 657-671. https://doi.org/10.2307/2532086
  11. Zhang, J. and Yu, K. F. (1998). What's the relative risk?, JAMA, 280, 1690-1691. https://doi.org/10.1001/jama.280.19.1690
  12. Zou, G. (2004). A modified poisson regression approach to prospective studies with binary data, American Journal of Epidemiology, 159, 702-706. https://doi.org/10.1093/aje/kwh090