• Title/Summary/Keyword: Poisson regression model

Search Result 153, Processing Time 0.024 seconds

A Ppoisson Regression Aanlysis of Physician Visits (외래이용빈도 분석의 모형과 기법)

  • 이영조;한달선;배상수
    • Health Policy and Management
    • /
    • v.3 no.2
    • /
    • pp.159-176
    • /
    • 1993
  • The utilization of outpatient care services involves two steps of sequential decisions. The first step decision is about whether to initiate the utilization and the second one is about how many more visits to make after the initiation. Presumably, the initiation decision is largely made by the patient and his or her family, while the number of additional visits is decided under a strong influence of the physician. Implication is that the analysis of the outpatient care utilization requires to specify each of the two decisions underlying the utilization as a distinct stochastic process. This paper is concerned with the number of physician visits, which is, by definition, a discrete variable that can take only non-negative integer values. Since the initial visit is considered in the analysis of whether or not having made any physician visit, the focus on the number of visits made in addition to the initial one must be enough. The number of additional visits, being a kind of count data, could be assumed to exhibit a Poisson distribution. However, it is likely that the distribution is over dispersed since the number of physician visits tends to cluster around a few values but still vary widely. A recently reported study of outpatient care utilization employed an analysis based upon the assumption of a negative binomial distribution which is a type of overdispersed Poisson distribution. But there is an indication that the use of Poisson distribution making adjustments for over-dispersion results in less loss of efficiency in parameter estimation compared to the use of a certain type of distribution like a negative binomial distribution. An analysis of the data for outpatient care utilization was performed focusing on an assessment of appropriateness of available techniques. The data used in the analysis were collected by a community survey in Hwachon Gun, Kangwon Do in 1990. It was observed that a Poisson regression with adjustments for over-dispersion is superior to either an ordinary regression or a Poisson regression without adjustments oor over-dispersion. In conclusion, it seems the most approprite to assume that the number of physician visits made in addition to the initial visist exhibits an overdispersed Poisson distribution when outpatient care utilization is studied based upon a model which embodies the two-part character of the decision process uderlying the utilization.

  • PDF

Claims Reserving via Kernel Machine

  • Kim, Mal-Suk;Park, He-Jung;Hwang, Chang-Ha;Shim, Joo-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.19 no.4
    • /
    • pp.1419-1427
    • /
    • 2008
  • This paper shows the kernel Poisson regression which can be applied in the claims reserving, where the row effect is assumed to be a nonlinear function of the row index. The paper concentrates on the chain-ladder technique, within the framework of the chain-ladder linear model. It is shown that the proposed method can provide better reserve estimates than the Poisson model. The cross validation function is introduced to choose optimal hyper-parameters in the procedure. Experimental results are then presented which indicate the performance of the proposed model.

  • PDF

Analysis of Food Poisoning via Zero Inflation Models

  • Jung, Hwan-Sik;Kim, Byung-Jip;Cho, Sin-Sup;Yeo, In-Kwon
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.5
    • /
    • pp.859-864
    • /
    • 2012
  • Poisson regression and negative binomial regression are usually used to analyze counting data; however, these models are unsuitable for fit zero-inflated data that contain unexpected zero-valued observations. In this paper, we review the zero-inflated regression in which Bernoulli process and the counting process are hierarchically mixed. It is known that zero-inflated regression can efficiently model the over-dispersion problem. Vuong statistic is employed to compare performances of the zero-inflated models with other standard models.

Assessment Model for Industrial Accidents Prevention Policy (산업재해 예방정책에 대한 평가모형)

  • Kim, Youngsun;Jo, Jinnam;Baik, Jaiwook
    • Journal of Applied Reliability
    • /
    • v.17 no.1
    • /
    • pp.38-49
    • /
    • 2017
  • Purpose: The purpose of this study is to introduce the assessment model for industrial accidents prevention policy. Methods: 10 years of industrial accidents data are explored through EDA approach. Case control study is tried in order to assess the effectiveness of the measures taken by Korea Safety and Health Agency, Civilian, and 'Ministry of Employment and Labor'. Propensity score matching is used to match the characteristics of the two groups compared, and then case control study is again conducted. Next, logistic and Poisson regressions are used to assess the risk factors. Results: According to case control study involvement of 'Korea Safety and Health Agency' and 'Ministry of Employment and Labor' were not effective, but Civilian was. Propensity score matching leads to the same conclusion. Poisson regression reveals the impact of the risk factors on the industrial accidents. Industrial accidents occur more often as the number of employees grows. Mining, farming, fishing, 'transportation storage and telecommunication' and forestry have a higher level of industrial accidents but service industry has a lower level. It is odd that more involvement of Korea Safety and Health Agency, Civilian, and Ministry of Employment and Labor means more industrial accidents. Conclusion: 'Korea Safety and Health Agency', Civilian, and 'Ministry of Employment and Labor' seem to visit those industries with more industrial accidents.

Urban and Rural Roundabout Accident Occurrence Models (도시 및 지방 회전교차로 사고 발생 모형)

  • Beck, Tea Hun;Lim, Jin Kang;Park, Byung Ho
    • International Journal of Highway Engineering
    • /
    • v.17 no.5
    • /
    • pp.39-46
    • /
    • 2015
  • PURPOSES: The operational characteristics of roundabouts are generally influenced by location as well as traffic volume. The goal of this study is to develop urban and rural roundabout accident models and to discuss safety improvement guidelines based on the model. METHODS : To analyze accidents, count data models are utilized in this study. This study used accident data from 2010 to 2013 for 56 roundabouts collected from the Traffic Accident Analysis System (TASS) of Road Traffic Authority. Poisson and negative binomial regression models were developed for this study using NLOGIT 4.0. RESULTS : The main results are as follows. First, the hypotheses that there are distributional differences in the number of accidents and injuries/fatalities among rural and urban roundabouts were accepted. Second, Poisson and negative binomial regression accident models, which were all statistically significant, were developed. Seven independent variables, which were statistically significant, were adopted. Third, the common variable of models was evaluated to be traffic volume. CONCLUSIONS : This study developed two negative binomial roundabout accident models and suggested some accident reduction strategies. The results are expected to give some implications to the safety improvement of roundabout.

Developing Rear-End Collision Models of Roundabouts in Korea (국내 회전교차로의 추돌사고 모형 개발)

  • Park, Byung Ho;Beak, Tae Hun
    • Journal of the Korean Society of Safety
    • /
    • v.29 no.6
    • /
    • pp.151-157
    • /
    • 2014
  • This study deals with the rear-end collision at roundabouts. The purpose of this study is to develop the accident models of rear-end collision in Korea. In pursuing the above, this study gives particular attention to developing the appropriate models using Poisson, negative binomial model, ZAM, multiple linear and nonlinear regression models, and statistical analysis tools. The main results are as follows. First, the Vuong statistics and overdispersion parameters indicate that ZIP is the most appropriate model among count data models. Second, RMSE, MPB, MAD and correlation coefficient tests show that the multiple nonlinear model is the most suitable to the rear-end collision data. Finally, such the independent variables as traffic volume, ratio of heavy vehicle, number of circulatory roadway lane, number of crosswalk and stop line are adopted in the optimal model.

The study on the determinants of the number of job changes (중소기업 청년인턴 이직횟수 결정요인 분석)

  • Park, Sungik;Ryu, Jangsoo;Kim, Jonghan;Cho, Jangsik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.2
    • /
    • pp.387-397
    • /
    • 2015
  • In this paper, the determinants of the number of job changes in the SMEs (small and medium enterprises) youth-intern project is analysed, utilizing SMEs youth-intern DB and employment insurance DB. Since the number of job changes are count data which take integer values other than negative values, general linear regression analysis becomes inappropriate. Therefore, four models such as Poisson regression model, zero inflated Poisson regression model, negative binomial regression model and zero inflated negative binomial regression model are tried to fit count data. A zero inflated negative binomial regression model is selected to be the best model. Major results are the followings. First, the number of job changes is shown to be significantly smaller in the treatment group than in the control group. Second, the number of job changes turns out to be significantly smaller in the young-age group than in the old-age group. Third, it is also shown that the number of job changes of man is significantly greater than that of woman. Lastly, the number of job changes in the bigger firm is shown to be significantly less than that of the smaller firm.

Rear-end Accident Models of Rural Area Signalized Intersections in the Cases of Cheongju and Cheongwon (청주.청원 지방부 신호교차로의 후미추돌 사고모형)

  • Park, Byoung-Ho;In, Byung-Chul
    • International Journal of Highway Engineering
    • /
    • v.11 no.2
    • /
    • pp.151-158
    • /
    • 2009
  • This study deals with the rear-end collisions in the rural aiea. The objectives of this study are 1) to analyze the characteristics of rear-end accidents of signalized intersections, and 2) to develop the accident models for Cheongju-Cheongwon. In pursing the above, this study gives the particular attentions to comparing the characters of urban and rural area. In this study, the dependent variables are the number of accidents and value of EPDO(equivalent property damage only), and independent variables are the traffic volumes and geometric elements. The main results analyzed are the followings. First, the statistical analyses show that the Poisson accident model using the number of accident as a dependant variable are statistically significant and the negative binomial accident model using the value of EPDO are statistically significant. Second, the independent variables of Poisson model are analyzed to be the ratio of high-occupancy vehicles, total traffic volume and the sum of exit/entry, and those of negative binomial regression are the main road width, total traffic volume and the ratio of high-occupancy vehicles. Finally, the specific independent variables to the rural area are the main road width, the ratio of high occupancy vehicle, and the sum exit/entry.

  • PDF

Analysis of Old Driver's Accident Influencing Factors Considering Human Factors (인적특성을 고려한 고령 운전자 교통사고 영향요인 분석)

  • Kim, Tae-Ho;Kim, Eun-Kyung;Rho, Jeong-Hyun
    • Journal of the Korean Society of Safety
    • /
    • v.24 no.1
    • /
    • pp.69-77
    • /
    • 2009
  • This paper reports the aging driver traffic accident severity modeling results. For the modeling, Poisson regression approach is applied using the data set obtained from the Korea Transportation Safety Authority's simulator-based driver aptitude test results. The test items include the estimations of moving objects' speed and stopping distance, drivers' multi-task capability, and kinetic depth perception and so on. The resulting model with the response variable of equivalent property damage only(EPDO) indicated that EPDO is significantly influenced by moving objects' speed estimation and drivers' multi-task capabilities. More interestingly, a comparison with the younger driver model revealed that the degradation of such capabilities may result in severer crashes for older drivers as suggested by the higher estimated parameters for the older driver model.

Bayesian Analysis for the Zero-inflated Regression Models (영과잉 회귀모형에 대한 베이지안 분석)

  • Jang, Hak-Jin;Kang, Yun-Hee;Lee, S.;Kim, Seong-W.
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.4
    • /
    • pp.603-613
    • /
    • 2008
  • We often encounter the situation that discrete count data have a large portion of zeros. In this case, it is not appropriate to analyze the data based on standard regression models such as the poisson or negative binomial regression models. In this article, we consider Bayesian analysis for two commonly used models. They are zero-inflated poisson and negative binomial regression models. We use the Bayes factor as a model selection tool and computation is proceeded via Markov chain Monte Carlo methods. Crash count data are analyzed to support theoretical results.