• 제목/요약/키워드: Binomial regression

검색결과 253건 처리시간 0.025초

Tilted beta regression and beta-binomial regression models: Mean and variance modeling

  • Edilberto Cepeda-Cuervo
    • Communications for Statistical Applications and Methods
    • /
    • 제31권3호
    • /
    • pp.263-277
    • /
    • 2024
  • This paper proposes new parameterizations of the tilted beta binomial distribution, obtained from the combination of the binomial distribution and the tilted beta distribution, where the beta component of the mixture is parameterized as a function of their mean and variance. These new parameterized distributions include as particular cases the beta rectangular binomial and the beta binomial distributions. After that, we propose new linear regression models to deal with overdispersed binomial datasets. These new models are defined from the proposed new parameterization of the tilted beta binomial distribution, and assume regression structures for the mean and variance parameters. These new linear regression models are fitted by applying Bayesian methods and using the OpenBUGS software. The proposed regression models are fitted to a school absenteeism dataset and to the seeds germination rate according to the type seed and root.

고속도로 연결로의 교통사고예측모형 개발 (Traffic Crash Prediction Models for Expressway Ramps)

  • 최윤환;오영태;최기주;이철기;윤일수
    • 한국도로학회논문집
    • /
    • 제14권5호
    • /
    • pp.133-143
    • /
    • 2012
  • PURPOSES: Using the collected data for crash, traffic volume, and design elements on ramps between 2007 and 2009, this research effort was initiated to develop traffic crash prediction models for expressway ramps. METHODS: Three negative binomial regression models and three zero-inflated negative binomial regression models were developed for individual ramp types, including direct, semi-direct and loop, respectively. For validating the developed models, authors compared the estimated crash frequencies with actual crash frequencies of twelve randomly selected interchanges, the ramps of which have not been used for model developing. RESULTS: The results show that the negative binomial regression models for direct, semi-direct and loop ramps showed 60.3%, 63.8% and 48.7% error rates on average whereas the zero-inflated negative binomial regression models showed 82.1%, 120.4% and 57.3%, respectively. CONCLUSIONS: Conclusively, the negative binomial regression models worked better in traffic crash prediction than the zero-inflated negative binomial regression models for estimating the frequency of traffic accidents on expressway ramps.

Penalized Likelihood Regression with Negative Binomial Data with Unknown Shape Parameter

  • Kim, Young-Ju
    • Communications for Statistical Applications and Methods
    • /
    • 제14권1호
    • /
    • pp.23-32
    • /
    • 2007
  • We consider penalized likelihood regression with data from the negative binomial distribution with unknown shape parameter. Smoothing parameter selection and asymptotically efficient low dimensional approximations are employed for negative binomial data along with shape parameter estimation through several different algorithms.

영과잉을 고려한 중심상업지구 교통사고모형 개발에 관한 연구 (Safety Performance Functions for Central Business Districts Using a Zero-Inflated Model)

  • 이상혁;우용한
    • 한국도로학회논문집
    • /
    • 제18권4호
    • /
    • pp.83-92
    • /
    • 2016
  • PURPOSES : The purpose of this study was to develop safety performance functions (SPFs) that use zero-inflated negative binomial regression models for urban intersections in central business districts (CBDs), and to compare the statistical significance of developed models against that of regular negative binomial regression models. METHODS : To develop and analyze the SPFs of intersections in CBDs, data acquisition was conducted for dependent and independent variables in areas of study. We analyzed the SPFs using zero-inflated negative binomial regression model as well as regular negative binomial regression model. We then compared the results by analyzing the statistical significance of the models. RESULTS : SPFs were estimated for all accidents and injury accidents at intersections in CBDs in terms of variables such as AADT, Number of Lanes at Major Roads, Median Barriers, Right Turn with an Exclusive Turn Lane, Turning Guideline, and Front Signal. We also estimated the log-likelihood at convergence and the likelihood ratio of SPFs for comparing the zero-inflated model with the regular model. In he SPFs, estimated log-likelihood at convergence and the likelihood ratio of the zero-inflated model were at -836.736, 0.193 and -836.415, 0.195. Also estimated the log-likelihood at convergence and likelihood ratio of the regular model were at -843.547, 0.187 and -842.631, 0.189, respectively. These figures demonstrate that zero-inflated negative binomial regression models can better explain traffic accidents at intersections in CBDs. CONCLUSIONS : SPFs that use a zero-inflated negative binomial regression model demonstrate better statistical significance compared with those that use a regular negative binomial regression model.

Bayesian Analysis for Random Effects Binomial Regression

  • Kim, Dal-Ho;Kim, Eun-Young
    • Communications for Statistical Applications and Methods
    • /
    • 제7권3호
    • /
    • pp.817-827
    • /
    • 2000
  • In this paper, we investigate the Bayesian approach to random effect binomial regression models with improper prior due to the absence of information on parameter. We also propose a method of estimating the posterior moments and prediction and discuss some general methods for studying model assessment. The methodology is illustrated with Crowder's Seeds Data. Markov Chain Monte Carlo techniques are used to overcome the computational difficulties.

  • PDF

Mixed Effects Kernel Binomial Regression

  • Hwang, Chang-Ha
    • Journal of the Korean Data and Information Science Society
    • /
    • 제19권4호
    • /
    • pp.1327-1334
    • /
    • 2008
  • Mixed effect binomial regression models are widely used for analysis of correlated count data in which the response is the result of a series of one of two possible disjoint outcomes. In this paper, we consider kernel extensions with nonparametric fixed effects and parametric random effects. The estimation is through the penalized likelihood method based on kernel trick, and our focus is on the efficient computation and the effective hyperparameter selection. For the selection of hyperparameters, cross-validation techniques are employed. Examples illustrating usage and features of the proposed method are provided.

  • PDF

A simple zero inflated bivariate negative binomial regression model with different dispersion parameters

  • Kim, Dongseok
    • Journal of the Korean Data and Information Science Society
    • /
    • 제24권4호
    • /
    • pp.895-900
    • /
    • 2013
  • In this research, we propose a simple bivariate zero inflated negative binomial regression model with different dispersion for bivariate count data with excess zeros. An application to the demand for health services shows that the proposed model is better than existing models in terms of log-likelihood and AIC.

국내 4지 원형교차로 법규위반별 사고모형 개발 (Development of Accident Model by Traffic Violation Type in Korea 4-legged Circular Intersections)

  • 박병호;김경용
    • 한국안전학회지
    • /
    • 제30권2호
    • /
    • pp.70-76
    • /
    • 2015
  • This study deals with the traffic accident of circular intersections. The purpose of the study is to develop the accident models by traffic violation type. In pursuing the above, this study gives particular attention to analyzing various factors that influence traffic accident and developing such the optimal models as Poisson and Negative binomial regression models. The main results are the followings. First, 4 negative binomial models which were statistically significant were developed. This was because the over-dispersion coefficients had a value greater than 1.96. Second, the common variables in these models were not adopted. The specific variables by model were analyzed to be traffic volume, conflicting ratio, number of circulatory lane, width of circulatory lane, number of traffic island by access road, number of reduction facility, feature of central island and crosswalk.

도시 및 지방 회전교차로 사고 발생 모형 (Urban and Rural Roundabout Accident Occurrence Models)

  • 백태헌;임진강;박병호
    • 한국도로학회논문집
    • /
    • 제17권5호
    • /
    • pp.39-46
    • /
    • 2015
  • PURPOSES: The operational characteristics of roundabouts are generally influenced by location as well as traffic volume. The goal of this study is to develop urban and rural roundabout accident models and to discuss safety improvement guidelines based on the model. METHODS : To analyze accidents, count data models are utilized in this study. This study used accident data from 2010 to 2013 for 56 roundabouts collected from the Traffic Accident Analysis System (TASS) of Road Traffic Authority. Poisson and negative binomial regression models were developed for this study using NLOGIT 4.0. RESULTS : The main results are as follows. First, the hypotheses that there are distributional differences in the number of accidents and injuries/fatalities among rural and urban roundabouts were accepted. Second, Poisson and negative binomial regression accident models, which were all statistically significant, were developed. Seven independent variables, which were statistically significant, were adopted. Third, the common variable of models was evaluated to be traffic volume. CONCLUSIONS : This study developed two negative binomial roundabout accident models and suggested some accident reduction strategies. The results are expected to give some implications to the safety improvement of roundabout.

Analysis of Food Poisoning via Zero Inflation Models

  • Jung, Hwan-Sik;Kim, Byung-Jip;Cho, Sin-Sup;Yeo, In-Kwon
    • 응용통계연구
    • /
    • 제25권5호
    • /
    • pp.859-864
    • /
    • 2012
  • Poisson regression and negative binomial regression are usually used to analyze counting data; however, these models are unsuitable for fit zero-inflated data that contain unexpected zero-valued observations. In this paper, we review the zero-inflated regression in which Bernoulli process and the counting process are hierarchically mixed. It is known that zero-inflated regression can efficiently model the over-dispersion problem. Vuong statistic is employed to compare performances of the zero-inflated models with other standard models.