• 제목/요약/키워드: overdispersed

검색결과 15건 처리시간 0.021초

Negative Binomial Varying Coefficient Partially Linear Models

  • Kim, Young-Ju
    • Communications for Statistical Applications and Methods
    • /
    • 제19권6호
    • /
    • pp.809-817
    • /
    • 2012
  • We propose a semiparametric inference for a generalized varying coefficient partially linear model(VCPLM) for negative binomial data. The VCPLM is useful to model real data in that varying coefficients are a special type of interaction between explanatory variables and partially linear models fit both parametric and nonparametric terms. The negative binomial distribution often arise in modelling count data which usually are overdispersed. The varying coefficient function estimators and regression parameters in generalized VCPLM are obtained by formulating a penalized likelihood through smoothing splines for negative binomial data when the shape parameter is known. The performance of the proposed method is then evaluated by simulations.

Effects of Overdispersion on Testing for Serial Dependence in the Time Series of Counts Data

  • Kim, Hee-Young;Park, You-Sung
    • Communications for Statistical Applications and Methods
    • /
    • 제17권6호
    • /
    • pp.829-843
    • /
    • 2010
  • To test for the serial dependence in time series of counts data, Jung and Tremayne (2003) evaluated the size and power of several tests under the class of INARMA models based on binomial thinning operations for Poisson marginal distributions. The overdispersion phenomenon(i.e., a variance greater than the expectation) is common in the real world. Overdispersed count data can be modeled by using alternative thinning operations such as random coefficient thinning, iterated thinning, and quasi-binomial thinning. Such thinning operations can lead to time series models of counts with negative binomial or generalized Poisson marginal distributions. This paper examines whether the test statistics used by Jung and Tremayne (2003) on serial dependence in time series of counts data are affected by overdispersion.

Sample size calculations for clustered count data based on zero-inflated discrete Weibull regression models

  • Hanna Yoo
    • Communications for Statistical Applications and Methods
    • /
    • 제31권1호
    • /
    • pp.55-64
    • /
    • 2024
  • In this study, we consider the sample size determination problem for clustered count data with many zeros. In general, zero-inflated Poisson and binomial models are commonly used for zero-inflated data; however, in real data the assumptions that should be satisfied when using each model might be violated. We calculate the required sample size based on a discrete Weibull regression model that can handle both underdispersed and overdispersed data types. We use the Monte Carlo simulation to compute the required sample size. With our proposed method, a unified model with a low failure risk can be used to cope with the dispersed data type and handle data with many zeros, which appear in groups or clusters sharing a common variation source. A simulation study shows that our proposed method provides accurate results, revealing that the sample size is affected by the distribution skewness, covariance structure of covariates, and amount of zeros. We apply our method to the pancreas disorder length of the stay data collected from Western Australia.

Tilted beta regression and beta-binomial regression models: Mean and variance modeling

  • Edilberto Cepeda-Cuervo
    • Communications for Statistical Applications and Methods
    • /
    • 제31권3호
    • /
    • pp.263-277
    • /
    • 2024
  • This paper proposes new parameterizations of the tilted beta binomial distribution, obtained from the combination of the binomial distribution and the tilted beta distribution, where the beta component of the mixture is parameterized as a function of their mean and variance. These new parameterized distributions include as particular cases the beta rectangular binomial and the beta binomial distributions. After that, we propose new linear regression models to deal with overdispersed binomial datasets. These new models are defined from the proposed new parameterization of the tilted beta binomial distribution, and assume regression structures for the mean and variance parameters. These new linear regression models are fitted by applying Bayesian methods and using the OpenBUGS software. The proposed regression models are fitted to a school absenteeism dataset and to the seeds germination rate according to the type seed and root.

가산자료모형(Count Data Model)을 이용한 버스이용횟수추정에 관한 연구 (서울시 통근.통학자를 대상으로) (Count Data Model for The Estimation of Bus Ridership (Focusing on Commuters and Students in Seoul))

  • 문진수;김순관;임강원
    • 대한교통학회지
    • /
    • 제17권5호
    • /
    • pp.123-135
    • /
    • 1999
  • 개인교통수단의 선호로 인한 자가용 승용차의 급증은 서울시의 교통혼잡을 가중시키는 주요한 요인이 되고 있다. 이러한 서울시의 교통혼잡을 완화하기 위해서는 대중교통 중심의 교통체계가 구축되어야 하며 승용차 이용자를 대중교통수단으로 유인할 수 있는 대중교통 활성화정책이 필요하다. 이러한 인식하에 버스를 이용하는 통근 및 통학목적 통행자의 버스이용횟수에 대한 개별행태모형을 통하여 버스 이용에 영향을 미치는 요인을 파악함으로써 승용차 이용자를 대중교통수단으로 유인할 수 있는 정책적인 시사점을 도출하고자 하였다. 본 연구의 목적은 일주일간 버스이용횟수 추정에 적합한 가산자료모형의 적용이다. 국내에서는 가산자료모형을 이용한 연구가 많지 않은 실정이며, 또한 모형의 설정시 과산포(overdispersion)에 대한 검정을 통하여 자료에 적합한 모형을 설정하는 것이 중요함에도 불구하고 적절한 검정없이 일반적으로 사용되고 있는 포와송 회귀모형을 주로 사용하여 왔다. 그러나 본 연구에서는 가산자료모형을 선정하기 전에 과산포에 대한 통계적인 검정을 시행한 결과 음이항 회귀모형이 본 연구의 자료에 적합한 것으로 판정되었으며, 모형설정의 중요성을 살펴보기 위하여 음이항 회귀모형을 이용하여 추정한 결과와 포와송 회귀모형을 이용하여 추정한 결과를 비교하여 보았다.

  • PDF