• Title/Summary/Keyword: Poisson and Negative Binomial Regression Models

Search Result 52, Processing Time 0.026 seconds

Analyzing the Characteristics of Traffic Accidents and Developing the Models by Day and Night in the Case of the Cheongju Arterial Link Sections (청주시 간선가로 구간의 주.야간 사고특성 및 모형개발)

  • Kim, Tae-Young;Lim, Jin-Kang;Park, Byung-Ho
    • International Journal of Highway Engineering
    • /
    • v.13 no.1
    • /
    • pp.13-19
    • /
    • 2011
  • The purpose of this study is to analyze the characteristics of traffic accidents and to develop the models by day and night-time in the case of the arterial link sections. In pursuing the above, this study uses the 224 accident data occurred at the 24 arterial link sections in Cheongju. The main results analyzed are as follows. First, it was analyzed that the number of accidents during day was more than night, but the accidents rate during night was higher than day. Second, four models which were all statistically significant were developed. Finally, the differences between the day and night models were comparatively analyzed using independent variables.

Forecasting of the COVID-19 pandemic situation of Korea

  • Goo, Taewan;Apio, Catherine;Heo, Gyujin;Lee, Doeun;Lee, Jong Hyeok;Lim, Jisun;Han, Kyulhee;Park, Taesung
    • Genomics & Informatics
    • /
    • v.19 no.1
    • /
    • pp.11.1-11.8
    • /
    • 2021
  • For the novel coronavirus disease 2019 (COVID-19), predictive modeling, in the literature, uses broadly susceptible exposed infected recoverd (SEIR)/SIR, agent-based, curve-fitting models. Governments and legislative bodies rely on insights from prediction models to suggest new policies and to assess the effectiveness of enforced policies. Therefore, access to accurate outbreak prediction models is essential to obtain insights into the likely spread and consequences of infectious diseases. The objective of this study is to predict the future COVID-19 situation of Korea. Here, we employed 5 models for this analysis; SEIR, local linear regression (LLR), negative binomial (NB) regression, segment Poisson, deep-learning based long short-term memory models (LSTM) and tree based gradient boosting machine (GBM). After prediction, model performance comparison was evelauated using relative mean squared errors (RMSE) for two sets of train (January 20, 2020-December 31, 2020 and January 20, 2020-January 31, 2021) and testing data (January 1, 2021-February 28, 2021 and February 1, 2021-February 28, 2021) . Except for segmented Poisson model, the other models predicted a decline in the daily confirmed cases in the country for the coming future. RMSE values' comparison showed that LLR, GBM, SEIR, NB, and LSTM respectively, performed well in the forecasting of the pandemic situation of the country. A good understanding of the epidemic dynamics would greatly enhance the control and prevention of COVID-19 and other infectious diseases. Therefore, with increasing daily confirmed cases since this year, these results could help in the pandemic response by informing decisions about planning, resource allocation, and decision concerning social distancing policies.

Comparing the performance of likelihood ratio test and F-test for gamma generalized linear models (감마 일반화 선형 모형에서의 가능도비 검정과 F-검정 비교연구)

  • Jo, Seongil;Han, Jeongseop;Lee, Woojoo
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.4
    • /
    • pp.475-484
    • /
    • 2018
  • Gamma generalized linear models are useful for non-negative and skewed responses. However, these models have received less attention than Poisson and binomial generalized linear models. In particular, hypothesis testing for the significance of regression coefficients has not been thoroughly studied. In this paper we assess the performance of various test statistics for gamma generalized linear models based on numerical studies. Our results show that the likelihood ratio test and F-type test are generally recommended and that the partial deviance test should be avoided in practice.

Traffic Accident Models of Urban Circular Intersections by Operational Type (운영유형별 도시부 원형교차로 사고모형)

  • Kim, Kyung-Hwan;Park, Kil-Soo;Park, Byung-Ho
    • Journal of the Korean Society of Safety
    • /
    • v.27 no.2
    • /
    • pp.111-116
    • /
    • 2012
  • This study deals with the traffic accidents of circular intersections in Korea. The purposes are to comparatively analyze the characteristics by operational type, and to develop the models using the data of 82 intersections. In pursuing the above, this study gives particular emphasis to modeling such the accidents as the roundabout and rotary in urban area. The main results analyzed are as follows. First, the null hypotheses that the number of accidents are the same in both the urban and rural intersections, and roundabout and rotary in urban area, were analyzed to be rejected. Second, 3 accident models were developed, which were all statistically significant. The independent variables used in the above models are the ADT, number of approach lane, bus stop, parking facilities, and others. This study could be expected to give some implications to the traffic safety policy decision-making.

Accident Models of Circular Intersection by Cause Using ZAM (ZAM을 이용한 원형교차로 원인별 사고모형 개발)

  • Na, Hee;Park, Byung-Ho
    • International Journal of Highway Engineering
    • /
    • v.14 no.2
    • /
    • pp.101-108
    • /
    • 2012
  • This study deals with the traffic accidents of circular intersections in Korea. The goal of this study is to develop the traffic accident models using ZAM. The main results are as follows. First, in the case of 'violating the operational method of intersection', ZINB(zero-inflatednegative binomial) models were analyzed to be the best fit to the data. Second, in the case of' no maintaining the safe distance', ZINB models were also analyzed to be the best fit to the data. Finally, such the common variables as traffic volume and width of circular roadway were selected as the independent variables. The more traffic volume and the less width of circulatory roadway were evaluated to make the more accidents. Such the specific variables as the number of approach lanes and speed reduction facilities were selected as the explanatory variables. The more approach lanes and the less speed reduction facilities were evaluated to give the more accidents. This study might be expected to give some implications to the accident research on the circular intersections.

Analysis of Traffic Accident by Circular Intersection Type in Korea Using Count Data Model (가산자료 모형을 이용한 국내 원형교차로 유형별 교통사고 분석)

  • Kim, Tae Yang;Lee, Min Yeong;Park, Byung Ho
    • Journal of the Korean Society of Safety
    • /
    • v.32 no.5
    • /
    • pp.129-134
    • /
    • 2017
  • This study aims to develop the traffic accident models by circular intersection type using count data model. The number of accident, the number of fatal and injured persons(FSI), and EPDO are calculated from the traffic accident data of TAAS. The circular intersection accident models are developed through Poisson and negative binomial regression analysis. The main results of this study are as follows. First, the null hypotheses that there are differences in the number of traffic accidents, FSI and EPDO by type of circular intersections are rejected. Second, the scale of intersection(median, large), number of approach road, mean width and length of exit road, area of the circulating roadway and central island are selected as factors influencing the number of traffic accidents, FSI and EPDO in rotary. Third, the scale of intersection(median), guide signs(limited speed, direction, roundabout), number of approach road, entry angle, area of the intersection and central island are adopted as factors influencing the number of traffic accidents, FSI and EPDO in roundabout. Finally, transferring from rotary to roundabout could be expected to make the accident decrease.

Heat-Wave Data Analysis based on the Zero-Inflated Regression Models (영-과잉 회귀모형을 활용한 폭염자료분석)

  • Kim, Seong Tae;Park, Man Sik
    • Journal of the Korean Data Analysis Society
    • /
    • v.20 no.6
    • /
    • pp.2829-2840
    • /
    • 2018
  • The random variable with an arbitrary value or more is called semi-continuous variable or zero-inflated one in case that its boundary value is more frequently observed than expected. This means the boundary value is likely to be practically observed more than it should be theoretically under certain probability distribution. When the distribution considered is continuous, the variable is defined as semi-continuous and when one of discrete distribution is assumed for the variable, we regard it as zero-inflated. In this study, we introduce the two-part model, which consists of one part for modelling the binary response and the other part for modelling the variable greater than the boundary value. Especially, the zero-inflated regression models are explained by using Poisson distribution and negative binomial distribution. In real data analysis, we employ the zero-inflated regression models to estimate the number of days under extreme heat-wave circumstances during the last 10 years in South Korea. Based on the estimation results, we create prediction maps for the estimated number of days under heat-wave advisory and heat-wave warning by using the universal kriging, which is one of the spatial prediction methods.

Characteristics of Geometric Conditions Affecting Freeway Traffic Safety at Nighttime, Sunrise, and Sunset (야간 및 일출몰 시간대 교통안전에 영향을 미치는 고속도로 기하구조 특성분석)

  • Hong, Sung-Min;Kim, Joon-Ki;Oh, Cheol
    • Journal of Korean Society of Transportation
    • /
    • v.30 no.4
    • /
    • pp.95-106
    • /
    • 2012
  • Driver's capability of identifying the change in freeway alignments and environments is one of important factors associated with traffic safety on freeways. In particular, driver's visibility and recognition capability are highly dependent on the altitude of the sun by sunset, sunrise, and nighttime. The purpose of this study is to identify the characteristics of geometric conditions affecting crash occurrences at sunset, sunrise, and nighttime. Poisson and negative binomial regressions were adopted to predict freeway crash frequency in this study. Freeway crash data during 2007~2010 were used for developing the crash frequency models. A set of variables representing the characteristics of geometric conditions were identified as significant ones affecting crash occurrences. The results of this study would be useful in deriving effective countermeasures for preventing traffic crashes that mainly occur at sunset, sunrise, and nighttime on freeways.

Analysis of Total Crime Count Data Based on Spatial Association Structure (공간적 연관구조를 고려한 총범죄 자료 분석)

  • Choi, Jung-Soon;Park, Man-Sik;Won, Yu-Bok;Kim, Hag-Yeol;Heo, Tae-Young
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.2
    • /
    • pp.335-344
    • /
    • 2010
  • Reliability of the estimation is usually damaged in the situation where a linear regression model without spatial dependencies is employed to the spatial data analysis. In this study, we considered the conditional autoregressive model in order to construct spatial association structures and estimate the parameters via the Bayesian approaches. Finally, we compared the performances of the models with spatial effects and the ones without spatial effects. We analyzed the yearly total crime count data measured from each of 25 districts in Seoul, South Korea in 2007.

Analysis of Elderly Drivers' Accident Models Considering Operations and Physical Characteristics (고령운전자 운전 및 신체특성을 반영한 교통사고 분석 연구)

  • Lim, Sam Jin;Park, Jun Tae;Kim, Young Il;Kim, Tae Ho
    • Journal of Korean Society of Transportation
    • /
    • v.30 no.6
    • /
    • pp.37-46
    • /
    • 2012
  • The number of traffic accidents caused by elderly drivers over the age of 65 has surged over the past ten years from 37,000 to 274,000 cases. The proportion of elderly drivers' accidents has jumped 3.1 times from 1.2% to 3.7% out of all traffic accidents, and traffic safety organizations are pursuing diverse measures to address the situation. Above all, connecting safety measures with an in-depth research on behavioral and physical characteristics of elderly drivers will prove vital. This study conducted an empirical research linking the driving characteristics and traffic accidents by elderly drivers based on the Driving Aptitude Test items and traffic accident data, which enabled the measurement of behavioral characteristics of elderly drivers. In developing the Influence Model, we applied the zero-inflated Poisson (ZIP) regression model and selected an accident prediction model based on the Bayesian Influence in regards to the ZIP regression model and the zero-inflated negative binomial (ZINB) regression model. According to the results of the AAE analysis, the ZIP regression model was more appropriate and it was found that three variables? prediction of velocity, diversion, and cognitive ability? had a relation of influence with traffic accidents caused by elderly drivers.