• Title/Summary/Keyword: negative binomial regression model

Search Result 113, Processing Time 0.034 seconds

Development of Time-based Safety Performance Function for Freeways (세부 집계단위별 교통 특성을 반영한 고속도로 안전성능함수 개발)

  • Kang, Kawon;Park, Juneyoung;Lee, Kiyoung;Park, Joonggyu;Song, Changjun
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.20 no.6
    • /
    • pp.203-213
    • /
    • 2021
  • A vehicle crash occurs due to various factors such as the geometry of the road section, traffic, and driver characteristics. A safety performance function has been used in many studies to estimate the relationship between vehicle crash and road factors statistically. And depends on the purpose of the analysis, various characteristic variables have been used. And various characteristic variables have been used in the studies depending on the purpose of analysis. The existing domestic studies generally reflect the average characteristics of the sections by quantifying the traffic volume in macro aggregate units such as the ADT, but this has a limitation that it cannot reflect the real-time changing traffic characteristics. Therefore, the need for research on effective aggregation units that can flexibly reflect the characteristics of the traffic environment arises. In this paper, we develop a safety performance function that can reflect the traffic characteristics in detail with an aggregate unit for one hour in addition to the daily model used in the previous studies. As part of the present study, we also perform a comparison and evaluation between models. The safety performance function for daily and hourly units is developed using a negative binomial regression model with the number of accidents as a dependent variable. In addition, the optimal negative binomial regression model for each of the hourly and daily models was selected, and their prediction performances were compared. The model and evaluation results presented in this paper can be used to determine the risk factors for accidents in the highway section considering the dynamic characteristics. In addition, the model and evaluation results can also be used as the basis for evaluating the availability and transferability of the hourly model.

Application of discrete Weibull regression model with multiple imputation

  • Yoo, Hanna
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.3
    • /
    • pp.325-336
    • /
    • 2019
  • In this article we extend the discrete Weibull regression model in the presence of missing data. Discrete Weibull regression models can be adapted to various type of dispersion data however, it is not widely used. Recently Yoo (Journal of the Korean Data and Information Science Society, 30, 11-22, 2019) adapted the discrete Weibull regression model using single imputation. We extend their studies by using multiple imputation also with several various settings and compare the results. The purpose of this study is to address the merit of using multiple imputation in the presence of missing data in discrete count data. We analyzed the seventh Korean National Health and Nutrition Examination Survey (KNHANES VII), from 2016 to assess the factors influencing the variable, 1 month hospital stay, and we compared the results using discrete Weibull regression model with those of Poisson, negative Binomial and zero-inflated Poisson regression models, which are widely used in count data analyses. The results showed that the discrete Weibull regression model using multiple imputation provided the best fit. We also performed simulation studies to show the accuracy of the discrete Weibull regression using multiple imputation given both under- and over-dispersed distribution, as well as varying missing rates and sample size. Sensitivity analysis showed the influence of mis-specification and the robustness of the discrete Weibull model. Using imputation with discrete Weibull regression to analyze discrete data will increase explanatory power and is widely applicable to various types of dispersion data with a unified model.

Study on the Development of Truck Traffic Accident Prediction Models and Safety Rating on Expressways (고속도로 화물차 교통사고 건수 예측모형 및 안전등급 개발 연구)

  • Jungeun Yoon;Harim Jeong;Jangho Park;Donghyo Kang;Ilsoo Yun
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.22 no.1
    • /
    • pp.1-15
    • /
    • 2023
  • In this study, the number of truck traffic accidents was predicted by using Poisson and negative binomial regression analysis to understand what factors affect accidents using expressway data. Significant variables in the truck traffic accident prediction model were continuous driving time, link length, truck traffic volume. number of bridges and number of drowsy shelters. The calculated LOSS rating was expressed on the national expressway network to diagnose the risk of truck accidents. This is expected to be used as basic data for policy establishment to reduce truck accidents on expressways.

Traffic Accident Models of Cheongju Four-Legged Signalized Intersections by Accident Type (사고유형에 따른 청주시 4지 신호교차로 교통사고모형)

  • Park, Byung-Ho;Han, Sang-Wook;Kim, Tae-Young;Kim, Won-Ho
    • Journal of Korean Society of Transportation
    • /
    • v.26 no.5
    • /
    • pp.153-162
    • /
    • 2008
  • This study deals with the traffic accidents at the 4-legged signalized intersections in Cheong-ju. The purpose is to comparatively analyze the characteristics and models by the accident type using the data of 143 intersections. In pursuing the above, this study gives particular emphasis to modeling such the accidents as head on collision, rear end collision, side swipe, side right angle collision, and others. The main results are the followings. First, the overdispersion tests show that the negative binomial regression models are appropriate to the traffic accident data in the above contexts. Second, five accident models are developed, which are all analyzed to be statistically significant. Finally, the models are comparatively evaluated using the common variable(ADT) and type-specific variables.

Macro-Level Accident Prediction Model using Mobile Phone Data (이동통신 자료를 활용한 거시적 교통사고 예측 모형 개발)

  • Kwak, Ho-Chan;Song, Ji Young;Lee, In Mook;Lee, Jun
    • Journal of the Korean Society of Safety
    • /
    • v.33 no.4
    • /
    • pp.98-104
    • /
    • 2018
  • Macroscopic accident analyses have been conducted to incorporate transportation safety into long-term transportation planning. In macro-level accident prediction model, exposure variable(e.g. a settled population) have been used as fundamental explanatory variable under the concept that each trip will be subjected to a probable risk of accident. However, a settled population may be embedded error by exclusion of active population concept. The objective of this research study is to develop macro-level accident prediction model using floating population variable(concept of including a settled population and active population) collected from mobile phone data. The concept of accident prediction models is introduced utilizing exposure variable as explanatory variable in a generalized linear regression with assumption of a negative binomial error structure. The goodness of fit of model using floating population variable is compared with that of the each models using population and the number of household variables. Also, log transformation models are additionally developed to improve the goodness of fit. The results show that the log transformation model using floating population variable is useful for capturing the relationships between accident and exposure variable and generally perform better than the models using other existing exposure variables. The developed model using floating population variable can be used to guide transportation safety policy decision makers to allocate resources more efficiently for the regions(or zones) with higher risk and improve urban transportation safety in transportation planning step.

Accident Models of Rotary by Vehicle Type (차량유형별 로터리 사고모형)

  • Han, Su-San;Park, Byeong-Ho
    • Journal of Korean Society of Transportation
    • /
    • v.29 no.6
    • /
    • pp.67-74
    • /
    • 2011
  • This study deals with the traffic accidents data from the Korean rotaries (circular intersections) to verify their characteristics affected by different vehicle types. This paper categorized the data into three groups based on vehicle types, and developed a set of accident models. The paper proposed two ZIP models and one negative binomial model through a statistical analysis for three vehicle types: automobile, truck and van, and others. The differences among those models were then statistically compared.

A Ppoisson Regression Aanlysis of Physician Visits (외래이용빈도 분석의 모형과 기법)

  • 이영조;한달선;배상수
    • Health Policy and Management
    • /
    • v.3 no.2
    • /
    • pp.159-176
    • /
    • 1993
  • The utilization of outpatient care services involves two steps of sequential decisions. The first step decision is about whether to initiate the utilization and the second one is about how many more visits to make after the initiation. Presumably, the initiation decision is largely made by the patient and his or her family, while the number of additional visits is decided under a strong influence of the physician. Implication is that the analysis of the outpatient care utilization requires to specify each of the two decisions underlying the utilization as a distinct stochastic process. This paper is concerned with the number of physician visits, which is, by definition, a discrete variable that can take only non-negative integer values. Since the initial visit is considered in the analysis of whether or not having made any physician visit, the focus on the number of visits made in addition to the initial one must be enough. The number of additional visits, being a kind of count data, could be assumed to exhibit a Poisson distribution. However, it is likely that the distribution is over dispersed since the number of physician visits tends to cluster around a few values but still vary widely. A recently reported study of outpatient care utilization employed an analysis based upon the assumption of a negative binomial distribution which is a type of overdispersed Poisson distribution. But there is an indication that the use of Poisson distribution making adjustments for over-dispersion results in less loss of efficiency in parameter estimation compared to the use of a certain type of distribution like a negative binomial distribution. An analysis of the data for outpatient care utilization was performed focusing on an assessment of appropriateness of available techniques. The data used in the analysis were collected by a community survey in Hwachon Gun, Kangwon Do in 1990. It was observed that a Poisson regression with adjustments for over-dispersion is superior to either an ordinary regression or a Poisson regression without adjustments oor over-dispersion. In conclusion, it seems the most approprite to assume that the number of physician visits made in addition to the initial visist exhibits an overdispersed Poisson distribution when outpatient care utilization is studied based upon a model which embodies the two-part character of the decision process uderlying the utilization.

  • PDF

Comparative Analysis on the Characteristics and Models of Traffic Accidents by Day and Nighttime in the Case of Cheongju 4-legged ignalized Intersections (주·야간 교통사고의 특성 및 사고모형 비교분석 -청주시 4지 신호교차로를 중심으로 -)

  • Yoo, Doo Seon;Oh, Sang Jin;Kim, Tae Young;Park, Byung Ho
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.28 no.2D
    • /
    • pp.181-189
    • /
    • 2008
  • The purpose of this study is to comparatively analyze the characteristics and models of traffic accidents by day and nighttime. In pursuing the above, this study gives particular attentions to testing the differences and developing the models (multiple linear and non-linear and Poisson and negative binomial regression) using the data of Cheongju 4-legged signalized intersections. The main results analyzed are as follows. First, the differences between day and nighttime accidents were defined. Second, 12 accident models which are all statistically significant were developed. Finally, the differences between day and nighttime models were comparatively analyzed using the common and specific variables.

Tests for Equality of Dispersions in the Generalized Bivariate Negative Binomial Regression Model with Heterogeneous Dispersions (서로 다른 산포를 갖는 이변량 음이항 회귀모형에서 산포의 동일성에 대한 검정)

  • Han, Sang-Moon;Jung, Byoung-Cheol
    • Communications for Statistical Applications and Methods
    • /
    • v.18 no.2
    • /
    • pp.219-227
    • /
    • 2011
  • In this paper, we proposed a generalized bivariate negative binomial distribution allowing heterogeneous dispersions on two dependent variables based on a trivariate reduction technique. In this model, we propose the score and LR tests for testing the equality of dispersions and compare the efficiencies of the proposed tests using a Monte Carlo study. The Monte Carlo study shows that the proposed score and LR tests prove to be an efficient test for the equality of dispersions in the view of the significance level and power. However, the score test is easier to compute than the LR test and it shows a slightly better performance than the LR test from the Monte Carlo study, we suggest the use of score tests for testing the equality of dispersions on two dependent variables. In addition, an empirical example is provided to illustrate the results.

Analysis of Accident Characteristics and Improvement Strategies of Flash Signal-operated Intersection in Seoul (서울시 점멸신호 운영에 따른 교통사고 분석 및 개선방안에 관한 연구)

  • Kim, Seung-Jun;Park, Byung-Jung;Lee, Jin-Hak;Kim, Ok-Sun
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.13 no.6
    • /
    • pp.54-63
    • /
    • 2014
  • Traffic accident frequency and severity level in Korea are known to be very serious. Especially the number of pedestrian fatalities was much worse and 1.6 time higher than the OECD average. According to the National Police Agency, the flash signals are reported to have many safety benefits as well as travel time reduction, which is opposed to the foreign studies. With this background of expanding the flash signal, this research aims to investigate the overall impact of the flash signal operation on safety, investigating and comparing the accident occurrence on the flash signal and the full signal intersections. For doing this accident prediction models for both flash and full signal intersections were estimated using independent variables (geometric features and traffic volume) and 3-year (2011-2013) accident data collected in Seoul. Considering the rare and random nature of accident occurrence and overdispersion (variance > mean) of the data, the negative binomial regression model was applied. As a result, installing wider crosswalk and increasing the number of pedestrian push buttons seemed to increase the safety of the flash signal intersections. In addition, the result showed that the average accident occurrence at the flash signal intersections was higher than at the full signal-operated intersections, 9% higher with everything else the same.