DOI QR코드

DOI QR Code

Application of discrete Weibull regression model with multiple imputation

  • Yoo, Hanna (Department of Computer Software, Busan University of Foreign Studies)
  • Received : 2019.03.20
  • Accepted : 2019.04.29
  • Published : 2019.05.31

Abstract

In this article we extend the discrete Weibull regression model in the presence of missing data. Discrete Weibull regression models can be adapted to various type of dispersion data however, it is not widely used. Recently Yoo (Journal of the Korean Data and Information Science Society, 30, 11-22, 2019) adapted the discrete Weibull regression model using single imputation. We extend their studies by using multiple imputation also with several various settings and compare the results. The purpose of this study is to address the merit of using multiple imputation in the presence of missing data in discrete count data. We analyzed the seventh Korean National Health and Nutrition Examination Survey (KNHANES VII), from 2016 to assess the factors influencing the variable, 1 month hospital stay, and we compared the results using discrete Weibull regression model with those of Poisson, negative Binomial and zero-inflated Poisson regression models, which are widely used in count data analyses. The results showed that the discrete Weibull regression model using multiple imputation provided the best fit. We also performed simulation studies to show the accuracy of the discrete Weibull regression using multiple imputation given both under- and over-dispersed distribution, as well as varying missing rates and sample size. Sensitivity analysis showed the influence of mis-specification and the robustness of the discrete Weibull model. Using imputation with discrete Weibull regression to analyze discrete data will increase explanatory power and is widely applicable to various types of dispersion data with a unified model.

Keywords

References

  1. Barbiero A (2015). Discrete Weibull: Discrete Weibull Distributions (Type 1 and 3). Available from: http://CRAN.R-project.org/package=DiscreteWeibull. R package version 1.0.1
  2. Brand JJPL (1999). Development, Implementation and Evaluation of Multiple Imputation Strategies for the Statistical Analysis of Incomplete Data Sets, Erasmus University, Rotterdam.
  3. Chanialidis C, Evers L, Neocleous T, and Nobile A (2018). Efficient Bayesian inference for COM-Poisson regression models, Statistics and Computing, 28, 595-608. https://doi.org/10.1007/s11222-017-9750-x
  4. Consul P and F Famoye (1992). Generalized Poisson regression model, Communications in Statistics-Theory and Methods, 21, 89-109. https://doi.org/10.1080/03610929208830766
  5. Englehardt JD and R Li (2011). The discrete Weibull distribution: an application for correlated counts with confirmation for microbial counts in water, Risk Analysis, 31, 370-381. https://doi.org/10.1111/j.1539-6924.2010.01520.x
  6. Khan MA, Khalique A, and Abouammoh A (1989). On estimating parameters in a discrete Weibull distribution, IEEE Transactions on Reliability, 38, 348-350. https://doi.org/10.1109/24.44179
  7. Kim TI, Choi YY, and Lee KH (2008). Analysis on the differences in medical service usage in terms of income Levels, Korean Social Security Studies, 24, 53-75.
  8. Klakattawi HS, Vinciotti V, and Yu K (2018). A simple and adaptive dispersion regression model for count data, Entropy, 20, 142. https://doi.org/10.3390/e20020142
  9. Kleinke K and Reinecke J (2013). Multiple imputation of incomplete zero-inflated count data, Statistica Neerlandica, 67, 311-336. https://doi.org/10.1111/stan.12009
  10. Kulasekera K (1994). Approximate MLE's of the parameters of a discrete Weibull distribution with type 1 censored data. Microelectron, Reliab, 34, 1185-1188. https://doi.org/10.1016/0026-2714(94)90502-9
  11. Lee YC, Im BH, and Park YH (2010). The determinants and comparison of health behavior and health service by private medical insurance on National Health-Nutrition Survey, Journal of the Korea Contents Association, 10, 190-204.
  12. Nakagawa T and Osaki S (1975). The discrete Weibull distribution, IEEE Transactions on Reliability, R-24.
  13. Pahel BT, Presisser JS, Stearns SC, and Rozier RG (2011). Multiple imputation of dental caries data using a zero inflated Poisson regression model, Journal of Public Health Dental, 71, 71-78. https://doi.org/10.1111/j.1752-7325.2010.00197.x
  14. Peluso A and Vinciotti V (2018). Discrete weibull generalised additive model: an application to count fertility data, Journal of the Royal Statistical Society. Series C (Applied Statistics), arXiv:1801.0790.
  15. Rubin DB (1987). Multiple Imputation for Nonresponse in Surveys, John Wiley & Sons, New York.
  16. Saez-Castillo A and Conde-Sanchez A (2013). A hyper-Poisson regression model for overdispersed and underdispersed count data, Computational Statistics & Data Analysis, 61, 148-157. https://doi.org/10.1016/j.csda.2012.12.009
  17. Saffari SE and Adnan R (2010). Zero-inflated Poisson regression models with right censored count data, Mathematika, 27, 21-29.
  18. Sellers KF and Shmueli G (2010). A flexible regression model for count data, Annals of Applied Statistics, 4, 943-961. https://doi.org/10.1214/09-AOAS306
  19. van Buuren S (2007). Multiple imputation of discrete and continuous data by fully conditional specification, Statistical Methods in Medical Research, 16, 219-242. https://doi.org/10.1177/0962280206074463
  20. van Buuren S, Boshuizen HC, and Knook DL (1999). Multiple imputation of missing blood pressure covariates in survival analysis, Statistics in Medicine, 18, 681-694. https://doi.org/10.1002/(SICI)1097-0258(19990330)18:6<681::AID-SIM71>3.0.CO;2-R
  21. Willmot GE (1987). The Poisson-inverse Gaussian distribution as an alternative to the negative Binomial, Scandinavian Actuarial Journal, 113-127.
  22. Yoo H (2019). A study of discrete Weibull regression model with missing data, Journal of the Korean Data and Information Science Society, 30, 11-22. https://doi.org/10.7465/jkdi.2019.30.1.11