• Title/Summary/Keyword: binomial data

Search Result 340, Processing Time 0.035 seconds

Effects of Overdispersion on Testing for Serial Dependence in the Time Series of Counts Data

  • Kim, Hee-Young;Park, You-Sung
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.6
    • /
    • pp.829-843
    • /
    • 2010
  • To test for the serial dependence in time series of counts data, Jung and Tremayne (2003) evaluated the size and power of several tests under the class of INARMA models based on binomial thinning operations for Poisson marginal distributions. The overdispersion phenomenon(i.e., a variance greater than the expectation) is common in the real world. Overdispersed count data can be modeled by using alternative thinning operations such as random coefficient thinning, iterated thinning, and quasi-binomial thinning. Such thinning operations can lead to time series models of counts with negative binomial or generalized Poisson marginal distributions. This paper examines whether the test statistics used by Jung and Tremayne (2003) on serial dependence in time series of counts data are affected by overdispersion.

A Binomial Sampling Plans for Aphis gossypii (Hemiptera: Aphididae) in Greenhouse Cultivation of Cucumbers

  • Kang, Taek Jun;Park, Jung-Joon;Cho, Kijong;Lee, Joon-Ho
    • Horticultural Science & Technology
    • /
    • v.30 no.5
    • /
    • pp.596-602
    • /
    • 2012
  • Infestations of Aphis gossypii per leaf in greenhouse cultivation of cucumbers were investigated to develop binomial sampling plans. An empirical $P_T-m$ model, $ln(m)={\alpha}+{\beta}ln[-ln(1-P_T)]$, was used to evaluate relationship between the proportion of infested leaves with ${\leq}$ T aphids per leaf ($P_T$) and mean aphid density (m). Tally thresholds (T) were set to 1, 3, 5, 7, and 9 aphids per leaf to find appropriate T in greenhouse cultivation of cucumbers. Increasing sample size had little effect on the precision of the binomial sampling plan. However, the precision increased with tally threshold. The binomial model with T = 5 provided appropriate predictions of the mean densities of A. gossypii in the greenhouse cultivation of cucumbers. Using a binomial model with T = 5 (sample size = 200), a wide range of densities (1.2 - 222.8 aphids per leaf) could be estimated with precision levels of 0.346 - 0.380 for $P_T$ values between 0.15 and 0.96. Binomial models were validated at T = 5 and 7 using 12 independent data sets. Both binomial models were robust and adequately described aphid densities; most of the independent sampling data fell within 95% confidence intervals around the prediction model.

Mixed Effects Kernel Binomial Regression

  • Hwang, Chang-Ha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.19 no.4
    • /
    • pp.1327-1334
    • /
    • 2008
  • Mixed effect binomial regression models are widely used for analysis of correlated count data in which the response is the result of a series of one of two possible disjoint outcomes. In this paper, we consider kernel extensions with nonparametric fixed effects and parametric random effects. The estimation is through the penalized likelihood method based on kernel trick, and our focus is on the efficient computation and the effective hyperparameter selection. For the selection of hyperparameters, cross-validation techniques are employed. Examples illustrating usage and features of the proposed method are provided.

  • PDF

A simple zero inflated bivariate negative binomial regression model with different dispersion parameters

  • Kim, Dongseok
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.4
    • /
    • pp.895-900
    • /
    • 2013
  • In this research, we propose a simple bivariate zero inflated negative binomial regression model with different dispersion for bivariate count data with excess zeros. An application to the demand for health services shows that the proposed model is better than existing models in terms of log-likelihood and AIC.

Tests for homogeneity of proportions in clustered binomial data

  • Jeong, Kwang Mo
    • Communications for Statistical Applications and Methods
    • /
    • v.23 no.5
    • /
    • pp.433-444
    • /
    • 2016
  • When we observe binary responses in a cluster (such as rat lab-subjects), they are usually correlated to each other. In clustered binomial counts, the independence assumption is violated and we encounter an extra-variation. In the presence of extra-variation, the ordinary statistical analyses of binomial data are inappropriate to apply. In testing the homogeneity of proportions between several treatment groups, the classical Pearson chi-squared test has a severe flaw in the control of Type I error rates. We focus on modifying the chi-squared statistic by incorporating variance inflation factors. We suggest a method to adjust data in terms of dispersion estimate based on a quasi-likelihood model. We explain the testing procedure via an illustrative example as well as compare the performance of a modified chi-squared test with competitive statistics through a Monte Carlo study.

Development of a p Control Chart for Overdispersed Process with Beta-Binomial Model (베타-이항모형을 이용한 과산포 공정용 p 관리도의 개발)

  • Bae, Bong-Soo;Seo, Sun-Keun
    • Journal of Korean Society for Quality Management
    • /
    • v.45 no.2
    • /
    • pp.209-225
    • /
    • 2017
  • Purpose: Since traditional p chart is unable to deal with the variation of attribute data, this paper proposes a new attribute control chart for nonconforming proportions incorporating overdispersion with a beta-binomial model. Methods: Statistical theories for control chart developed under the beta-binomial model and a new approach using this control chart are presented Results: False alarm probabilities of p chart with the beta-binomial model are evaluated and demerits of p chart under overdispersion are discussed from three examples. Hence a concrete procedure for the proposed control chart is provided and illustrated with examples Conclusion: The proposed chart is more useful than traditional p chart, individual chart to treat observed proportions nonconforming as variable data and Laney p' chart.

Coherent Forecasting in Binomial AR(p) Model

  • Kim, Hee-Young;Park, You-Sung
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.1
    • /
    • pp.27-37
    • /
    • 2010
  • This article concerns the forecasting in binomial AR(p) models which is proposed by Wei$\ss$ (2009b) for time series of binomial counts. Our method extends to binomial AR(p) models a recent result by Jung and Tremayne (2006) for integer-valued autoregressive model of second order, INAR(2), with simple Poisson innovations. Forecasts are produced by conditional median which gives 'coherent' forecasts, and we estimate the forecast distributions of future values of binomial AR(p) models by means of a Monte Carlo method allowing for parameter uncertainty. Model parameters are estimated by the method of moments and estimated standard errors are calculated by means of block of block bootstrap. The method is fitted to log data set used in Wei$\ss$ (2009b).

Ex-ante and Ex-post Economic Value Analysis on Ecological River Restoration Project (생태하천복원사업 전후 경제적 가치 비교분석)

  • Lee, Yoon;Chang, Hoon;Yoon, Taeyeon;Chung, Young-Keun;Park, Heeyoung
    • Journal of the Korean Regional Science Association
    • /
    • v.31 no.3
    • /
    • pp.39-54
    • /
    • 2015
  • To assess an economic value of Cheonggyecheon river restoration project, an in-depth exit survey data was collected to apply travel cost method in this study. Poisson model, Negative Binomial, Zero-truncated Poisson, and Zero-truncated Negative Binomial model were executed due to the nature of count data. Empirical results showed that regressors were statistically significant and corresponded to general consumer theory. Since our survey data showed over-dispersion, Zero-truncated Negative Binomial was selected as an optimal one to analyze travel demand of Cheonggyecheon by model goodness of fit test among those aforementioned empirical models. Estimating an economic value of Cheonggyecheon river restoration project, which is known as an ecological river restoration project, we used annual visit of individual traveler and an optimal model. Suffice to say that the annual economic value of Cheonggyecheon river restoration project was estimated as 193.4 billion won in 2013.

Hierarchical Bayesian Inference of Binomial Data with Nonresponse

  • Han, Geunshik;Nandram, Balgobin
    • Journal of the Korean Statistical Society
    • /
    • v.31 no.1
    • /
    • pp.45-61
    • /
    • 2002
  • We consider the problem of estimating binomial proportions in the presence of nonignorable nonresponse using the Bayesian selection approach. Inference is sampling based and Markov chain Monte Carlo (MCMC) methods are used to perform the computations. We apply our method to study doctor visits data from the Korean National Family Income and Expenditure Survey (NFIES). The ignorable and nonignorable models are compared to Stasny's method (1991) by measuring the variability from the Metropolis-Hastings (MH) sampler. The results show that both models work very well.

A binomial CUSUM chart for monitoring type I right-censored Weibull lifetimes (제1형의 우측중도절단된 와이블 수명자료를 관리하는 이항 누적합 관리도)

  • Choi, Min-jae;Lee, Jaeheon
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.5
    • /
    • pp.823-833
    • /
    • 2016
  • The lifetime is a key characteristic of product quality. It is best to obtain the lifetime data of all samples, but they are often censored due to time or expense limitations. In this paper, we propose a binomial cumulative sum (CUSUM) chart to monitor the mean of type I right-censored Weibull lifetime data, for a xed value of the Weibull shape parameter. We compare the performance of the proposed binomial CUSUM chart with CUSUM charts studied previously using the steady-state average run length (ARL). The results show that the performance of the binomial CUSUM chart is better when the censoring rate is high and/or the sample size is small.