DOI QR코드

DOI QR Code

Modeling clustered count data with discrete weibull regression model

  • Yoo, Hanna (Department of Big Data, Busan University of Foreign Studies)
  • Received : 2021.11.22
  • Accepted : 2022.01.13
  • Published : 2022.07.31

Abstract

In this study we adapt discrete weibull regression model for clustered count data. Discrete weibull regression model has an attractive feature that it can handle both under and over dispersion data. We analyzed the eighth Korean National Health and Nutrition Examination Survey (KNHANES VIII) from 2019 to assess the factors influencing the 1 month outpatient stay in 17 different regions. We compared the results using clustered discrete Weibull regression model with those of Poisson, negative binomial, generalized Poisson and Conway-maxwell Poisson regression models, which are widely used in count data analyses. The results show that the clustered discrete Weibull regression model using random intercept model gives the best fit. Simulation study is also held to investigate the performance of the clustered discrete weibull model under various dispersion setting and zero inflated probabilities. In this paper it is shown that using a random effect with discrete Weibull regression can flexibly model count data with various dispersion without the risk of making wrong assumptions about the data dispersion.

Keywords

Acknowledgement

This work was supported by the research grant of the Busan University of Foreign Studies in 2021.

References

  1. Alan B and Nial F (2021). Bayesian inference, model selection and likelihood estimation using fast rejection sampling: The Conway-Maxwell-Poisson distribution, Bayesian Analysis, 16, 905-931.
  2. Albert J (1992). A Bayesian analysis of a Poisson random effects model for home run hitters, The American Statistician, 46, 246-253.
  3. Barbiero A (2019). A bivariate count model with discrete Weibull margins, Mathematics and Computers in Simulation, 156, 91-109. https://doi.org/10.1016/j.matcom.2018.07.003
  4. Dunlop D (1994). Regression for longitudinal data: A bridge from least squares regression, The American Statistician, 48, 299-303.
  5. Hall DB (2000). Zero-inflated Poisson and binomial regression with random effects: A case study, Biometrics, 56, 1030-1039. https://doi.org/10.1111/j.0006-341X.2000.01030.x
  6. Klakattawi HS, Vinciotti V, and Yu K (2018). A simple and adaptive dispersion regression model for count data, Entropy, 20, 142.
  7. Kulasekera K (1994). Approximate MLE's of the parameters of a discrete Weibull distribution with type 1 censored data.Microelectron, Reliab, 34, 1185-1188. https://doi.org/10.1016/0026-2714(94)90502-9
  8. Lawless J (1987). Negative binomial and mixed Poisson regression, Canadian Journal of Statistics, 15, 209-225. https://doi.org/10.2307/3314912
  9. Lee S (2004). Development of a Sustainable Health Management System for Management of Chronic Diseases. Health and Welfare Policy Forum 1, 72-81.
  10. Lee Y and Nelder AJ (1996).Hierarchical generalized linear models, Journal of the Royal Statistical Society. Series B (Methodological), 58, 619-678. https://doi.org/10.1111/j.2517-6161.1996.tb02105.x
  11. Nakagawa T and Osaki S (1975). The discrete Weibull distribution.IEEE Transactions on Reliability , 24, 300-301. https://doi.org/10.1109/TR.1975.5214915
  12. Yau KKW, Wang K, and Lee AH (2003). Zero-inflated negative binomial mixed regression modeling of over-dispersed count data with extra zeros, Biometrics, 45, 437-452. https://doi.org/10.1002/bimj.200390024
  13. Luyts M, Geert M, Geert V, Koen M, Eduardo R, Clarice D, and John H. (2019). A Weibull-count approach for handling under- and overdispersed longitudinal/clustered data structures, Statistical Modelling, 19, 569-589. https://doi.org/10.1177/1471082X18789992
  14. Michas F (2021). Number of doctor visits per capita in selected countries 2019. Statista, Health Professionals Hospitals. Available from: https://www.statista.com/statistics/236589/number-of-doctor-visits-per-capita-by-country/#statisticContainer