Browse > Article
http://dx.doi.org/10.29220/CSAM.2019.26.3.325

Application of discrete Weibull regression model with multiple imputation  

Yoo, Hanna (Department of Computer Software, Busan University of Foreign Studies)
Publication Information
Communications for Statistical Applications and Methods / v.26, no.3, 2019 , pp. 325-336 More about this Journal
Abstract
In this article we extend the discrete Weibull regression model in the presence of missing data. Discrete Weibull regression models can be adapted to various type of dispersion data however, it is not widely used. Recently Yoo (Journal of the Korean Data and Information Science Society, 30, 11-22, 2019) adapted the discrete Weibull regression model using single imputation. We extend their studies by using multiple imputation also with several various settings and compare the results. The purpose of this study is to address the merit of using multiple imputation in the presence of missing data in discrete count data. We analyzed the seventh Korean National Health and Nutrition Examination Survey (KNHANES VII), from 2016 to assess the factors influencing the variable, 1 month hospital stay, and we compared the results using discrete Weibull regression model with those of Poisson, negative Binomial and zero-inflated Poisson regression models, which are widely used in count data analyses. The results showed that the discrete Weibull regression model using multiple imputation provided the best fit. We also performed simulation studies to show the accuracy of the discrete Weibull regression using multiple imputation given both under- and over-dispersed distribution, as well as varying missing rates and sample size. Sensitivity analysis showed the influence of mis-specification and the robustness of the discrete Weibull model. Using imputation with discrete Weibull regression to analyze discrete data will increase explanatory power and is widely applicable to various types of dispersion data with a unified model.
Keywords
discrete count data; discrete Weibull regression model; missing data; multiple imputation; KNHANES;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Barbiero A (2015). Discrete Weibull: Discrete Weibull Distributions (Type 1 and 3). Available from: http://CRAN.R-project.org/package=DiscreteWeibull. R package version 1.0.1
2 Brand JJPL (1999). Development, Implementation and Evaluation of Multiple Imputation Strategies for the Statistical Analysis of Incomplete Data Sets, Erasmus University, Rotterdam.
3 Chanialidis C, Evers L, Neocleous T, and Nobile A (2018). Efficient Bayesian inference for COM-Poisson regression models, Statistics and Computing, 28, 595-608.   DOI
4 Consul P and F Famoye (1992). Generalized Poisson regression model, Communications in Statistics-Theory and Methods, 21, 89-109.   DOI
5 Englehardt JD and R Li (2011). The discrete Weibull distribution: an application for correlated counts with confirmation for microbial counts in water, Risk Analysis, 31, 370-381.   DOI
6 Khan MA, Khalique A, and Abouammoh A (1989). On estimating parameters in a discrete Weibull distribution, IEEE Transactions on Reliability, 38, 348-350.   DOI
7 Kim TI, Choi YY, and Lee KH (2008). Analysis on the differences in medical service usage in terms of income Levels, Korean Social Security Studies, 24, 53-75.
8 Klakattawi HS, Vinciotti V, and Yu K (2018). A simple and adaptive dispersion regression model for count data, Entropy, 20, 142.   DOI
9 Kleinke K and Reinecke J (2013). Multiple imputation of incomplete zero-inflated count data, Statistica Neerlandica, 67, 311-336.   DOI
10 Kulasekera K (1994). Approximate MLE's of the parameters of a discrete Weibull distribution with type 1 censored data. Microelectron, Reliab, 34, 1185-1188.   DOI
11 Lee YC, Im BH, and Park YH (2010). The determinants and comparison of health behavior and health service by private medical insurance on National Health-Nutrition Survey, Journal of the Korea Contents Association, 10, 190-204.
12 Nakagawa T and Osaki S (1975). The discrete Weibull distribution, IEEE Transactions on Reliability, R-24.
13 Pahel BT, Presisser JS, Stearns SC, and Rozier RG (2011). Multiple imputation of dental caries data using a zero inflated Poisson regression model, Journal of Public Health Dental, 71, 71-78.   DOI
14 Peluso A and Vinciotti V (2018). Discrete weibull generalised additive model: an application to count fertility data, Journal of the Royal Statistical Society. Series C (Applied Statistics), arXiv:1801.0790.
15 Rubin DB (1987). Multiple Imputation for Nonresponse in Surveys, John Wiley & Sons, New York.
16 Saez-Castillo A and Conde-Sanchez A (2013). A hyper-Poisson regression model for overdispersed and underdispersed count data, Computational Statistics & Data Analysis, 61, 148-157.   DOI
17 Saffari SE and Adnan R (2010). Zero-inflated Poisson regression models with right censored count data, Mathematika, 27, 21-29.
18 Sellers KF and Shmueli G (2010). A flexible regression model for count data, Annals of Applied Statistics, 4, 943-961.   DOI
19 van Buuren S (2007). Multiple imputation of discrete and continuous data by fully conditional specification, Statistical Methods in Medical Research, 16, 219-242.   DOI
20 van Buuren S, Boshuizen HC, and Knook DL (1999). Multiple imputation of missing blood pressure covariates in survival analysis, Statistics in Medicine, 18, 681-694.   DOI
21 Willmot GE (1987). The Poisson-inverse Gaussian distribution as an alternative to the negative Binomial, Scandinavian Actuarial Journal, 113-127.
22 Yoo H (2019). A study of discrete Weibull regression model with missing data, Journal of the Korean Data and Information Science Society, 30, 11-22.   DOI