• Title/Summary/Keyword: zero inflated Poisson

Search Result 51, Processing Time 0.026 seconds

Analysis of Food Poisoning via Zero Inflation Models

  • Jung, Hwan-Sik;Kim, Byung-Jip;Cho, Sin-Sup;Yeo, In-Kwon
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.5
    • /
    • pp.859-864
    • /
    • 2012
  • Poisson regression and negative binomial regression are usually used to analyze counting data; however, these models are unsuitable for fit zero-inflated data that contain unexpected zero-valued observations. In this paper, we review the zero-inflated regression in which Bernoulli process and the counting process are hierarchically mixed. It is known that zero-inflated regression can efficiently model the over-dispersion problem. Vuong statistic is employed to compare performances of the zero-inflated models with other standard models.

The Reanalysis of the Donation Data Using the Zero-Inflated Possion Regression (0이 팽창된 포아송 회귀모형을 이용한 기부회수 자료의 재분석)

  • Kim, In-Young;Park, Tae-Kyu;Kim, Byung-Soo
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.4
    • /
    • pp.819-827
    • /
    • 2009
  • Kim et al. (2006) analyzed the donation data surveyed by Voluneteer 21 in year 2002 at South Korea using a Poisson regression based on the mixture of two Poissons and detected significant variables for affecting the number of donations. However, noting the large deviation between the predicted and the actual frequencies of zero, we developed in this note a Poisson regression model based on a distribution in which zero inflated Poisson was added to the mixture of two Poissons. Thus the population distribution is now a mixture of three Poissons in which one component is concentrated on zero mass. We used the EM algorithm for estimating the regression parameters and detected the same variables with Kim et al's for significantly affecting the response. However, we could estimate the proportion of the fixed zero group to be 0.201, which was the characteristic of this model. We also noted that among two significant variables, the income and the volunteer experience(yes, no), the second variable could be utilized as a strategric variable for promoting the donation.

Zero-Inflated Poisson Model with a Change-point (변화시점이 있는 영과잉-포아송모형)

  • Kim, Kyung-Moo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.9 no.1
    • /
    • pp.1-9
    • /
    • 1998
  • In case of Zero-Inflated Poisson model with a change-point, likelihood ratio test statistic was used for testing hypothesis for a change-point. A change-point and several interesting parameters were estimated by using the method of moments and maximum likelihood. In order to compare the estimators, empirical mean-square-error was used. Real data for the Zero-Inflated Poisson model with a change-point and Poisson model without a change-point were examined.

  • PDF

A GLR Chart for Monitoring a Zero-Inflated Poisson Process (ZIP 공정을 관리하는 GLR 관리도)

  • Choi, Mi Lim;Lee, Jaeheon
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.2
    • /
    • pp.345-355
    • /
    • 2014
  • The number of nonconformities in a unit is commonly modeled by a Poisson distribution. As an extension of a Poisson distribution, a zero-inflated Poisson(ZIP) process can be used to fit count data with an excessive number of zeroes. In this paper, we propose a generalized likelihood ratio(GLR) chart to monitor shifts in the two parameters of the ZIP process. We also compare the proposed GLR chart with the combined cumulative sum(CUSUM) chart and the single CUSUM chart. It is shown that the overall performance of the GLR chart is comparable with CUSUM charts and is significantly better in some cases where the actual directions of the shifts are different from the pre-specified directions in CUSUM charts.

A Bayesian joint model for continuous and zero-inflated count data in developmental toxicity studies

  • Hwang, Beom Seuk
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.2
    • /
    • pp.239-250
    • /
    • 2022
  • In many applications, we frequently encounter correlated multiple outcomes measured on the same subject. Joint modeling of such multiple outcomes can improve efficiency of inference compared to independent modeling. For instance, in developmental toxicity studies, fetal weight and number of malformed pups are measured on the pregnant dams exposed to different levels of a toxic substance, in which the association between such outcomes should be taken into account in the model. The number of malformations may possibly have many zeros, which should be analyzed via zero-inflated count models. Motivated by applications in developmental toxicity studies, we propose a Bayesian joint modeling framework for continuous and count outcomes with excess zeros. In our model, zero-inflated Poisson (ZIP) regression model would be used to describe count data, and a subject-specific random effects would account for the correlation across the two outcomes. We implement a Bayesian approach using MCMC procedure with data augmentation method and adaptive rejection sampling. We apply our proposed model to dose-response analysis in a developmental toxicity study to estimate the benchmark dose in a risk assessment.

Developing the Accident Models of Cheongju Arterial Link Sections Using ZAM Model (ZAM 모형을 이용한 청주시 간선가로 구간의 사고모형 개발)

  • Park, Byung-Ho;Kim, Jun-Yong
    • International Journal of Highway Engineering
    • /
    • v.12 no.2
    • /
    • pp.43-49
    • /
    • 2010
  • This study deals with the traffic accident of the Cheongju arterial link sections. The purpose of the study is to develop the traffic accident model. In pursuing the above, this study gives particular attentions to developing the ZAM(zero-altered model) model using the accident data of arterial roads devided by 322 small link sections. The main results analyzed by ZIP(zero inflated Poisson model) and ZINB(zero inflated negative binomial model) which are the methods of ZAM, are as follows. First, the evaluation of various developed models by the Vuong statistic and t statistic for overdispersion parameter ${\alpha}$ shows that ZINB is analyzed to be optimal among Poisson, NB, ZIP(zero-inflated Poisson) and ZINB regression models. Second, ZINB is evaluated to be statistically significant in view of t, ${\rho}$ and ${\rho}^2$ (0.63) values compared to other models. Finally, the accident factors of ZINB models are developed to be the traffic volume(ADT), number of entry/exit and length of median. The traffic volume(ADT) and the number of entry/exit are evaluated to be the '+' factors and the length of median to be '-' factor of the accident.

Modelling Count Responses with Overdispersion

  • Jeong, Kwang Mo
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.6
    • /
    • pp.761-770
    • /
    • 2012
  • We frequently encounter outcomes of count that have extra variation. This paper considers several alternative models for overdispersed count responses such as a quasi-Poisson model, zero-inflated Poisson model and a negative binomial model with a special focus on a generalized linear mixed model. We also explain various goodness-of-fit criteria by discussing their appropriateness of applicability and cautions on misuses according to the patterns of response categories. The overdispersion models for counts data have been explained through two examples with different response patterns.

A Bayesian zero-inflated Poisson regression model with random effects with application to smoking behavior (랜덤효과를 포함한 영과잉 포아송 회귀모형에 대한 베이지안 추론: 흡연 자료에의 적용)

  • Kim, Yeon Kyoung;Hwang, Beom Seuk
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.2
    • /
    • pp.287-301
    • /
    • 2018
  • It is common to encounter count data with excess zeros in various research fields such as the social sciences, natural sciences, medical science or engineering. Such count data have been explained mainly by zero-inflated Poisson model and extended models. Zero-inflated count data are also often correlated or clustered, in which random effects should be taken into account in the model. Frequentist approaches have been commonly used to fit such data. However, a Bayesian approach has advantages of prior information, avoidance of asymptotic approximations and practical estimation of the functions of parameters. We consider a Bayesian zero-inflated Poisson regression model with random effects for correlated zero-inflated count data. We conducted simulation studies to check the performance of the proposed model. We also applied the proposed model to smoking behavior data from the Regional Health Survey (2015) of the Korea Centers for disease control and prevention.

Soccer goal distributions in K-league (K-리그에서 축구 골의 분포)

  • Lee, Jang Taek
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.6
    • /
    • pp.1231-1239
    • /
    • 2014
  • In this paper we analyse the distributions of the number of goals scored by home teams and away teams in K-league soccer outcomes between 1983 and 2012. Real soccer data is explained in K-league using statistical distributions such that Poisson, negative binomial, extreme value and zero inflated Poisson. How close the goals of home and away fits the different distributions are tested by performing chi-square goodness of fit tests. According to these tests, the Poisson distribution gives the best fit to the home goals data. But it is best to model the away goals data on zero inflated Poisson distribution. Also, there is some weak evidence of the dependence for home and away goals.

Comparison of probability distributions to analyze the number of occurrence of torrential rainfall events (집중호우사상의 발생횟수 분석을 위한 확률분포의 비교)

  • Kim, Sang Ug;Kim, Hyeung Bae
    • Journal of Korea Water Resources Association
    • /
    • v.49 no.6
    • /
    • pp.481-493
    • /
    • 2016
  • The statistical analysis to the torrential rainfall data that is defined as a rainfall amount more than 80 mm/day is performed with Daegu and Busan rainfall data which is collected during 384 months. The number of occurrence of the torrential rainfall events can be simulated usually using Poisson distribution. However, the Poisson distribution can be frequently failed to simulate the statistical characteristics of the observed value when the observed data is zero-inflated. Therefore, in this study, Generalized Poisson distribution (GPD), Zero-Inflated Poisson distribution (ZIP), Zero-Inflated Generalized Poisson distribution (ZIGP), and Bayesian ZIGP model were used to resolve the zero-inflated problem in the torrential rainfall data. Especially, in Bayesian ZIGP model, a informative prior distribution was used to increase the accuracy of that model. Finally, it was suggested that POI and GPD model should be discouraged to fit the frequency of the torrential rainfall data. Also, Bayesian ZIGP model using informative prior provided the most accurate results. Additionally, it was recommended that ZIP model could be alternative choice on the practical aspect since the Bayesian approach of this study was considerably complex.