• Title/Summary/Keyword: Poisson count data

Search Result 83, Processing Time 0.029 seconds

Analysis of counts in the one-way layout (일원배열 가산자료에서의 처리효과 비교)

  • 이선호
    • The Korean Journal of Applied Statistics
    • /
    • v.10 no.1
    • /
    • pp.105-119
    • /
    • 1997
  • Barnwal and Paul(1988) derived the likelihood ratio statistic and $C(\alpha)$ statistic for testing the equality of the means of several groups of count data in the presence of a common dispersion parameter. These tests are generalized to be applicable without the restriction of a common dispersion parameter. And the assumed model of data is also extended from negative binomial to double exponential Poisson model. Monte Carlo simulations show the superiority of $C(\alpha)$ statistic based on the double exponential Poisson family which has a very simple form and requires estimates of the parameters only under the null hypothesis.

  • PDF

The Prefetching Method in Mobile Environments

  • Yoo, Jin-Ah;Koh, Tea-Wook
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.4
    • /
    • pp.1261-1270
    • /
    • 2006
  • This paper proposes a mobile computing prefetching method providing the effective information about location change of mobile user or mobile computing in mobile information services. For mobile computing environments, there exist restrictions as like low bandwidth, latency and traffic. To solve those problems, a variety of techniques have been developed including caching and prefetching. In this paper we present a Statistical Poisson Prefetching Scheme using the reference count to provide a mobile user information that will be likely referenced in the near future. Comparing to existing methods in numerical results, the proposed method improves the prefetching performance to give the maximum effectiveness and reduces the failure rate of information searching.

  • PDF

The study on the determinants of the number of job changes (중소기업 청년인턴 이직횟수 결정요인 분석)

  • Park, Sungik;Ryu, Jangsoo;Kim, Jonghan;Cho, Jangsik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.2
    • /
    • pp.387-397
    • /
    • 2015
  • In this paper, the determinants of the number of job changes in the SMEs (small and medium enterprises) youth-intern project is analysed, utilizing SMEs youth-intern DB and employment insurance DB. Since the number of job changes are count data which take integer values other than negative values, general linear regression analysis becomes inappropriate. Therefore, four models such as Poisson regression model, zero inflated Poisson regression model, negative binomial regression model and zero inflated negative binomial regression model are tried to fit count data. A zero inflated negative binomial regression model is selected to be the best model. Major results are the followings. First, the number of job changes is shown to be significantly smaller in the treatment group than in the control group. Second, the number of job changes turns out to be significantly smaller in the young-age group than in the old-age group. Third, it is also shown that the number of job changes of man is significantly greater than that of woman. Lastly, the number of job changes in the bigger firm is shown to be significantly less than that of the smaller firm.

Analysis of Traffic Accident by Circular Intersection Type in Korea Using Count Data Model (가산자료 모형을 이용한 국내 원형교차로 유형별 교통사고 분석)

  • Kim, Tae Yang;Lee, Min Yeong;Park, Byung Ho
    • Journal of the Korean Society of Safety
    • /
    • v.32 no.5
    • /
    • pp.129-134
    • /
    • 2017
  • This study aims to develop the traffic accident models by circular intersection type using count data model. The number of accident, the number of fatal and injured persons(FSI), and EPDO are calculated from the traffic accident data of TAAS. The circular intersection accident models are developed through Poisson and negative binomial regression analysis. The main results of this study are as follows. First, the null hypotheses that there are differences in the number of traffic accidents, FSI and EPDO by type of circular intersections are rejected. Second, the scale of intersection(median, large), number of approach road, mean width and length of exit road, area of the circulating roadway and central island are selected as factors influencing the number of traffic accidents, FSI and EPDO in rotary. Third, the scale of intersection(median), guide signs(limited speed, direction, roundabout), number of approach road, entry angle, area of the intersection and central island are adopted as factors influencing the number of traffic accidents, FSI and EPDO in roundabout. Finally, transferring from rotary to roundabout could be expected to make the accident decrease.

The Effects of Dispersion Parameters and Test for Equality of Dispersion Parameters in Zero-Truncated Bivariate Generalized Poisson Models (제로절단된 이변량 일반화 포아송 분포에서 산포모수의 효과 및 산포의 동일성에 대한 검정)

  • Lee, Dong-Hee;Jung, Byoung-Cheol
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.3
    • /
    • pp.585-594
    • /
    • 2010
  • This study, investigates the effects of dispersion parameters between two response variables in zero-truncated bivariate generalized Poisson distributions. A Monte Carlo study shows that the zero-truncated bivariate Poisson and negative binomial models fit poorly wherein the zero-truncated bivariate count data has heterogeneous dispersion parameters on dependent variables. In addition, we derive the score test for testing the equality of the dispersion parameters and compare its efficiency with the likelihood ratio test.

Estimating the Economic Value of Recreational Fishing in the Jeonnam Marine Ranching Area (여행비용모형을 이용한 전남 바다목장 해역 유어활동의 경제적 가치 추정)

  • Seo, Ju-Nam;Kim, Do-Hoon;Kang, Sung-Kyung
    • The Journal of Fisheries Business Administration
    • /
    • v.43 no.2
    • /
    • pp.41-49
    • /
    • 2012
  • This study aimed to estimate the economic value of the recreational fishing in the Jeonnam marine ranching area as a part of the total socioeconomic evaluation of the Jeonnam marine ranching program. A travel cost method was applied to the estimation of economic value of the recreational fishing in the Jeonnam marine ranching area and input variables included annual fishing trip days, average travel cost per trip, average catch amount, monthly income, marriage, age, and personal perception on the marine ranching program. In the analysis, due to its characteristic of count data, both poisson model and negative binomial model were used. Model results indicated that a negative binomial model was statistically more suitable than the poisson model as the overdispersion problem occurred in the poisson model. All signs of the estimated parameters were estimated as previous studies showed. Based on the results, the economic value per trip of the recreational fishing in the Jeonnam marine ranching area was estimated to be 145,000 won and the annual total economic value of the recreational fishing in the Jeonnam marine ranching area was analyzed to be 2,514,000 won. In addition, the change of total value by catch rate showed that the economic value could be increased by 180,900 won as the catch increased by one kilogram.

Modeling of The Learning-Curve Effects on Count Responses (개수형 자료에 대한 학습곡선효과의 모형화)

  • Choi, Minji;Park, Man Sik
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.3
    • /
    • pp.445-459
    • /
    • 2014
  • As a certain job is repeatedly done by a worker, the outcome comparative to the effort to complete the job gets more remarkable. The outcome may be the time required and fraction defective. This phenomenon is referred to a learning-curve effect. We focus on the parametric modeling of the learning-curve effects on count data using a logistic cumulative distribution function and some probability mass functions such as a Poisson and negative binomial. We conduct various simulation scenarios to clarify the characteristics of the proposed model. We also consider a real application to compare the two discrete-type distribution functions.

Estimating Heterogeneous Customer Arrivals to a Large Retail store : A Bayesian Poisson model perspective (대형할인매점의 요일별 고객 방문 수 분석 및 예측 : 베이지언 포아송 모델 응용을 중심으로)

  • Kim, Bumsoo;Lee, Joonkyum
    • Korean Management Science Review
    • /
    • v.32 no.2
    • /
    • pp.69-78
    • /
    • 2015
  • This paper considers a Bayesian Poisson model for multivariate count data using multiplicative rates. More specifically we compose the parameter for overall arrival rates by the product of two parameters, a common effect and an individual effect. The common effect is composed of autoregressive evolution of the parameter, which allows for analysis on seasonal effects on all multivariate time series. In addition, analysis on individual effects allows the researcher to differentiate the time series by whatevercharacterization of their choice. This type of model allows the researcher to specifically analyze two different forms of effects separately and produce a more robust result. We illustrate a simple MCMC generation combined with a Gibbs sampler step in estimating the posterior joint distribution of all parameters in the model. On the whole, the model presented in this study is an intuitive model which may handle complicated problems, and we highlight the properties and possible applications of the model with an example, analyzing real time series data involving customer arrivals to a large retail store.

Urban and Rural Roundabout Accident Occurrence Models (도시 및 지방 회전교차로 사고 발생 모형)

  • Beck, Tea Hun;Lim, Jin Kang;Park, Byung Ho
    • International Journal of Highway Engineering
    • /
    • v.17 no.5
    • /
    • pp.39-46
    • /
    • 2015
  • PURPOSES: The operational characteristics of roundabouts are generally influenced by location as well as traffic volume. The goal of this study is to develop urban and rural roundabout accident models and to discuss safety improvement guidelines based on the model. METHODS : To analyze accidents, count data models are utilized in this study. This study used accident data from 2010 to 2013 for 56 roundabouts collected from the Traffic Accident Analysis System (TASS) of Road Traffic Authority. Poisson and negative binomial regression models were developed for this study using NLOGIT 4.0. RESULTS : The main results are as follows. First, the hypotheses that there are distributional differences in the number of accidents and injuries/fatalities among rural and urban roundabouts were accepted. Second, Poisson and negative binomial regression accident models, which were all statistically significant, were developed. Seven independent variables, which were statistically significant, were adopted. Third, the common variable of models was evaluated to be traffic volume. CONCLUSIONS : This study developed two negative binomial roundabout accident models and suggested some accident reduction strategies. The results are expected to give some implications to the safety improvement of roundabout.

The Effects of Sentiment and Readability on Useful Votes for Customer Reviews with Count Type Review Usefulness Index (온라인 리뷰의 감성과 독해 용이성이 리뷰 유용성에 미치는 영향: 가산형 리뷰 유용성 정보 활용)

  • Cruz, Ruth Angelie;Lee, Hong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.43-61
    • /
    • 2016
  • Customer reviews help potential customers make purchasing decisions. However, the prevalence of reviews on websites push the customer to sift through them and change the focus from a mere search to identifying which of the available reviews are valuable and useful for the purchasing decision at hand. To identify useful reviews, websites have developed different mechanisms to give customers options when evaluating existing reviews. Websites allow users to rate the usefulness of a customer review as helpful or not. Amazon.com uses a ratio-type helpfulness, while Yelp.com uses a count-type usefulness index. This usefulness index provides helpful reviews to future potential purchasers. This study investigated the effects of sentiment and readability on useful votes for customer reviews. Similar studies on the relationship between sentiment and readability have focused on the ratio-type usefulness index utilized by websites such as Amazon.com. In this study, Yelp.com's count-type usefulness index for restaurant reviews was used to investigate the relationship between sentiment/readability and usefulness votes. Yelp.com's online customer reviews for stores in the beverage and food categories were used for the analysis. In total, 170,294 reviews containing information on a store's reputation and popularity were used. The control variables were the review length, store reputation, and popularity; the independent variables were the sentiment and readability, while the dependent variable was the number of helpful votes. The review rating is the moderating variable for the review sentiment and readability. The length is the number of characters in a review. The popularity is the number of reviews for a store, and the reputation is the general average rating of all reviews for a store. The readability of a review was calculated with the Coleman-Liau index. The sentiment is a positivity score for the review as calculated by SentiWordNet. The review rating is a preference score selected from 1 to 5 (stars) by the review author. The dependent variable (i.e., usefulness votes) used in this study is a count variable. Therefore, the Poisson regression model, which is commonly used to account for the discrete and nonnegative nature of count data, was applied in the analyses. The increase in helpful votes was assumed to follow a Poisson distribution. Because the Poisson model assumes an equal mean and variance and the data were over-dispersed, a negative binomial distribution model that allows for over-dispersion of the count variable was used for the estimation. Zero-inflated negative binomial regression was used to model count variables with excessive zeros and over-dispersed count outcome variables. With this model, the excess zeros were assumed to be generated through a separate process from the count values and therefore should be modeled as independently as possible. The results showed that positive sentiment had a negative effect on gaining useful votes for positive reviews but no significant effect on negative reviews. Poor readability had a negative effect on gaining useful votes and was not moderated by the review star ratings. These findings yield considerable managerial implications. The results are helpful for online websites when analyzing their review guidelines and identifying useful reviews for their business. Based on this study, positive reviews are not necessarily helpful; therefore, restaurants should consider which type of positive review is helpful for their business. Second, this study is beneficial for businesses and website designers in creating review mechanisms to know which type of reviews to highlight on their websites and which type of reviews can be beneficial to the business. Moreover, this study highlights the review systems employed by websites to allow their customers to post rating reviews.