• Title/Summary/Keyword: binomial data

Search Result 342, Processing Time 0.028 seconds

A Study on Factors Influencing Floating Population using Mobile Phone Data in Urban Area (이동통신 자료를 활용한 대도시 유동인구 영향요인 분석)

  • Kwak, Ho-Chan;Song, Ji Young;Eom, Jin Ki;Kim, Kyoung Tae
    • Journal of The Korean Society For Urban Railway
    • /
    • v.6 no.4
    • /
    • pp.373-381
    • /
    • 2018
  • The floating population that is index to figure out dynamic activities in urban area will be important in urban railway planning, but it is not useful because it is collected by posterior method. This study aims to investigate factors influencing floating population. The floating population data that was collected in Seoul for a month in December 2013 is used as dependent variable, and the negative binomial regression analysis is used in modelling. The number of households, number of employees, number of subway stations, and number of bus lines variables are statistically significant in predicting floating population.

A Ppoisson Regression Aanlysis of Physician Visits (외래이용빈도 분석의 모형과 기법)

  • 이영조;한달선;배상수
    • Health Policy and Management
    • /
    • v.3 no.2
    • /
    • pp.159-176
    • /
    • 1993
  • The utilization of outpatient care services involves two steps of sequential decisions. The first step decision is about whether to initiate the utilization and the second one is about how many more visits to make after the initiation. Presumably, the initiation decision is largely made by the patient and his or her family, while the number of additional visits is decided under a strong influence of the physician. Implication is that the analysis of the outpatient care utilization requires to specify each of the two decisions underlying the utilization as a distinct stochastic process. This paper is concerned with the number of physician visits, which is, by definition, a discrete variable that can take only non-negative integer values. Since the initial visit is considered in the analysis of whether or not having made any physician visit, the focus on the number of visits made in addition to the initial one must be enough. The number of additional visits, being a kind of count data, could be assumed to exhibit a Poisson distribution. However, it is likely that the distribution is over dispersed since the number of physician visits tends to cluster around a few values but still vary widely. A recently reported study of outpatient care utilization employed an analysis based upon the assumption of a negative binomial distribution which is a type of overdispersed Poisson distribution. But there is an indication that the use of Poisson distribution making adjustments for over-dispersion results in less loss of efficiency in parameter estimation compared to the use of a certain type of distribution like a negative binomial distribution. An analysis of the data for outpatient care utilization was performed focusing on an assessment of appropriateness of available techniques. The data used in the analysis were collected by a community survey in Hwachon Gun, Kangwon Do in 1990. It was observed that a Poisson regression with adjustments for over-dispersion is superior to either an ordinary regression or a Poisson regression without adjustments oor over-dispersion. In conclusion, it seems the most approprite to assume that the number of physician visits made in addition to the initial visist exhibits an overdispersed Poisson distribution when outpatient care utilization is studied based upon a model which embodies the two-part character of the decision process uderlying the utilization.

  • PDF

RESPONSES OF DAMPED HARMONIC OSCILLATORS TO EXCITATIONS OBEYING POISSON DISTRIBUTIONS

  • Lee, Hyoung-In;Mok, Jinsik
    • Journal of applied mathematics & informatics
    • /
    • v.31 no.1_2
    • /
    • pp.111-118
    • /
    • 2013
  • External excitations are employed to investigate properties of optical media, with measurement data often analyzed via linear response theory. In this respect, external forcing is modeled here by well-known Poisson and negative-binomial distributions. Ensuing dynamics is examined with a special attention to the relative decay rates of damped harmonic oscillators to such external forcing, along with its relationship to other physical phenomena.

Empirical Bayes Pproblems with Dependent and Nonidentical Components

  • Inha Jung;Jee-Chang Hong;Kang Sup Lee
    • Communications for Statistical Applications and Methods
    • /
    • v.2 no.1
    • /
    • pp.145-154
    • /
    • 1995
  • Empirical Bayes approach is applied to estimation of the binomial parameter when there is a cost for observations. Both the sample size and the decision rule for estimating the parameter are determined stochastically by the data, making the result more useful in applications. Our empirical Bayes problems with non-iid components are compared to the usual empirical Bayes problems with iid components. The asymptotic optimal procedure with a computer simulation is given.

  • PDF

A Study on the Power Comparison between Logistic Regression and Offset Poisson Regression for Binary Data

  • Kim, Dae-Youb;Park, Heung-Sun
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.4
    • /
    • pp.537-546
    • /
    • 2012
  • In this paper, for analyzing binary data, Poisson regression with offset and logistic regression are compared with respect to the power via simulations. Poisson distribution can be used as an approximation of binomial distribution when n is large and p is small; however, we investigate if the same conditions can be held for the power of significant tests between logistic regression and offset poisson regression. The result is that when offset size is large for rare events offset poisson regression has a similar power to logistic regression, but it has an acceptable power even with a moderate prevalence rate. However, with a small offset size (< 10), offset poisson regression should be used with caution for rare events or common events. These results would be good guidelines for users who want to use offset poisson regression models for binary data.

Bayesian Methods for Generalized Linear Models

  • Paul E. Green;Kim, Dae-Hak
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.2
    • /
    • pp.523-532
    • /
    • 1999
  • Generalized linear models have various applications for data arising from many kinds of statistical studies. Although the response variable is generally assumed to be generated from a wide class of probability distributions we focus on count data that are most often analyzed using binomial models for proportions or poisson models for rates. The methods and results presented here also apply to many other categorical data models in general due to the relationship between multinomial and poisson sampling. The novelty of the approach suggested here is that all conditional distribution s can be specified directly so that staraightforward Gibbs sampling is possible. The prior distribution consists of two stages. We rely on a normal nonconjugate prior at the first stage and a vague prior for hyperparameters at the second stage. The methods are demonstrated with an illustrative example using data collected by Rosenkranz and raftery(1994) concerning the number of hospital admissions due to back pain in Washington state.

  • PDF

Threshold-asymmetric volatility models for integer-valued time series

  • Kim, Deok Ryun;Yoon, Jae Eun;Hwang, Sun Young
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.3
    • /
    • pp.295-304
    • /
    • 2019
  • This article deals with threshold-asymmetric volatility models for over-dispersed and zero-inflated time series of count data. We introduce various threshold integer-valued autoregressive conditional heteroscedasticity (ARCH) models as incorporating over-dispersion and zero-inflation via conditional Poisson and negative binomial distributions. EM-algorithm is used to estimate parameters. The cholera data from Kolkata in India from 2006 to 2011 is analyzed as a real application. In order to construct the threshold-variable, both local constant mean which is time-varying and grand mean are adopted. It is noted via a data application that threshold model as an asymmetric version is useful in modelling count time series volatility.

Analysis of Accident Characteristics and Improvement Strategies of Flash Signal-operated Intersection in Seoul (서울시 점멸신호 운영에 따른 교통사고 분석 및 개선방안에 관한 연구)

  • Kim, Seung-Jun;Park, Byung-Jung;Lee, Jin-Hak;Kim, Ok-Sun
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.13 no.6
    • /
    • pp.54-63
    • /
    • 2014
  • Traffic accident frequency and severity level in Korea are known to be very serious. Especially the number of pedestrian fatalities was much worse and 1.6 time higher than the OECD average. According to the National Police Agency, the flash signals are reported to have many safety benefits as well as travel time reduction, which is opposed to the foreign studies. With this background of expanding the flash signal, this research aims to investigate the overall impact of the flash signal operation on safety, investigating and comparing the accident occurrence on the flash signal and the full signal intersections. For doing this accident prediction models for both flash and full signal intersections were estimated using independent variables (geometric features and traffic volume) and 3-year (2011-2013) accident data collected in Seoul. Considering the rare and random nature of accident occurrence and overdispersion (variance > mean) of the data, the negative binomial regression model was applied. As a result, installing wider crosswalk and increasing the number of pedestrian push buttons seemed to increase the safety of the flash signal intersections. In addition, the result showed that the average accident occurrence at the flash signal intersections was higher than at the full signal-operated intersections, 9% higher with everything else the same.

Estimating Travel Frequency of Public Bikes in Seoul Considering Intermediate Stops (경유지를 고려한 서울시 공공자전거 통행발생량 추정 모형 개발)

  • Jonghan Park;Joonho Ko
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.22 no.3
    • /
    • pp.1-19
    • /
    • 2023
  • Bikes have recently emerged as an alternative to carbon neutrality. To understand the demand for public bikes, we endeavored to estimate travel frequency of public bike by considering the intermediate stops. Using the GPS trajectory data of 'Ttareungyi', a public bike service in Seoul, we identified a stay point and estimated travel frequency reflecting population, land use, and physical characteristics. Application of map matching and a stay point detection algorithm revealed that stay point appeared in about 12.1% of the total trips. Compared to a trip without stay point, the trip with stay point has a longer average travel distance and travel time and a higher occurrence rate during off-peak hours. According to visualization analysis, the stay points are mainly found in parks, leisure facilities, and business facilities. To consider the stay point, the unit of analysis was set as a hexagonal grid rather than the existing rental station base. Travel frequency considering the stay point were analyzed using the Zero-Inflated Negative Binomial (ZINB) model. Results of our analysis revealed that the travel frequency were higher in bike infrastructure where the safety of bike users was secured, such as 'Bikepath' and 'Bike and pedestrian path'. Also, public bikes play a role as first & last mile means of access to public transportation. The measure of travel frequency was also observed to increase in life and employment centers. Considering the results of this analysis, securing safety facilities and space for users should be given priority when planning any additional expansion of bike infrastructure. Moreover, there is a necessity to establish a plan to supply bike infrastructure facilities linked to public transportation, especially the subway.

Evaluation for usefulness of Chukwookee Data in Rainfall Frequency Analysis (강우빈도해석에서의 측우기자료의 유용성 평가)

  • Kim, Kee-Wook;Yoo, Chul-Sang;Park, Min-Kyu;Kim, Hyeon-Jun
    • Journal of Korea Water Resources Association
    • /
    • v.40 no.11
    • /
    • pp.851-859
    • /
    • 2007
  • In this study, the chukwookee data were evaluated by applying that for the historical rainfall frequency analysis. To derive a two parameter log-normal distribution by using historical data and modem data, censored data MLE and binomial censored data MLE were applied. As a result, we found that both average and standard deviation were all estimated smaller with chukwookee data then those with only modern data. This indicates that rather big events rarely happens during the period of chukwookee data then during the modern period. The frequency analysis results using the parameters estimated were also similar to those expected. The point to be noticed is that the rainfall quantiles estimated by both methods were similar. This result indicates that the historical document records like the annals of Chosun dynasty could be valuable and effective for the frequency analysis. This also means the extension of data available for frequency analysis.