• Title/Summary/Keyword: Poisson-gamma mixture

Search Result 4, Processing Time 0.019 seconds

Classification Analysis in Information Retrieval by Using Gauss Patterns

  • Lee, Jung-Jin;Kim, Soo-Kwan
    • Communications for Statistical Applications and Methods
    • /
    • v.9 no.1
    • /
    • pp.1-11
    • /
    • 2002
  • This paper discusses problems of the Poisson Mixture model which Is widely used to decide the effective words in judging relevant document. Gamma Distribution model and Gauss Patterns model as an alternative of the Poisson Mixture model are studied. Classification experiments by using TREC sub-collection, WSJ[1,2] with MGQUERY and AidSearch3.0 system are discussed.

Effects on Regression Estimates under Misspecified Generalized Linear Mixed Models for Counts Data

  • Jeong, Kwang Mo
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.6
    • /
    • pp.1037-1047
    • /
    • 2012
  • The generalized linear mixed model(GLMM) is widely used in fitting categorical responses of clustered data. In the numerical approximation of likelihood function the normality is assumed for the random effects distribution; subsequently, the commercial statistical packages also routinely fit GLMM under this normality assumption. We may also encounter departures from the distributional assumption on the response variable. It would be interesting to investigate the impact on the estimates of parameters under misspecification of distributions; however, there has been limited researche on these topics. We study the sensitivity or robustness of the maximum likelihood estimators(MLEs) of GLMM for counts data when the true underlying distribution is normal, gamma, exponential, and a mixture of two normal distributions. We also consider the effects on the MLEs when we fit Poisson-normal GLMM whereas the outcomes are generated from the negative binomial distribution with overdispersion. Through a small scale Monte Carlo study we check the empirical coverage probabilities of parameters and biases of MLEs of GLMM.

Impact of Heterogeneous Dispersion Parameter on the Expected Crash Frequency (이질적 과분산계수가 기대 교통사고건수 추정에 미치는 영향)

  • Shin, Kangwon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.9
    • /
    • pp.5585-5593
    • /
    • 2014
  • This study tested the hypothesis that the significance of the heterogeneous dispersion parameter in safety performance function (SPF) used to estimate the expected crashes is affected by the endogenous heterogeneous prior distributions, and analyzed the impacts of the mis-specified dispersion parameter on the evaluation results for traffic safety countermeasures. In particular, this study simulated the Poisson means based on the heterogeneous dispersion parameters and estimated the SPFs using both the negative binomial (NB) model and the heterogeneous negative binomial (HNB) model for analyzing the impacts of the model mis-specification on the mean and dispersion functions in SPF. In addition, this study analyzed the characteristics of errors in the crash reduction factors (CRFs) obtained when the two models are used to estimate the posterior means and variances, which are essentially estimated through the estimated hyper-parameters in the heterogeneous prior distributions. The simulation study results showed that a mis-estimation on the heterogeneous dispersion parameters through the NB model does not affect the coefficient of the mean functions, but the variances of the prior distribution are seriously mis-estimated when the NB model is used to develop SPFs without considering the heterogeneity in dispersion. Consequently, when the NB model is used erroneously to estimate the prior distributions with heterogeneous dispersion parameters, the mis-estimated posterior mean can produce large errors in CRFs up to 120%.

Comparative Study on the Estimation Methods of Traffic Crashes: Empirical Bayes Estimate vs. Observed Crash (교통사고 추정방법 비교 연구: 경험적 베이즈 추정치 vs. 관측교통사고건수)

  • Shin, Kangwon
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.30 no.5D
    • /
    • pp.453-459
    • /
    • 2010
  • In the study of traffic safety, it is utmost important to obtain more reliable estimates of the expected crashes for a site (or a segment). The observed crashes have been mainly used as the estimate of the expected crashes in Korea, while the empirical Bayes (EB) estimates based on the Poisson-gamma mixture model have been used in the USA and several European countries. Although numerous studies have used the EB method for estimating the expected crashes and/or the effectiveness of the safety countermeasures, no past studies examine the difference in the estimation errors between the two estimates. Thus, this study compares the estimation errors of the two estimates using a Monte Carlo simulation study. By analyzing the crash dataset at 3,000,000 simulated sites, this study reveals that the estimation errors of the EB estimates are always less than those of the observed crashes. Hence, it is imperative to incorporate the EB method into the traffic safety research guideline in Korea. However, the results show that the differences in the estimation errors between the two estimates decrease as the uncertainty of the prior distribution increases. Consequently, it is recommended that the EB method be used with reliable hyper-parameter estimates after conducting a comprehensive examination on the estimated negative binomial model.