• Title/Summary/Keyword: 영과잉모형

Search Result 32, Processing Time 0.019 seconds

An Analysis on the Determinants of Employed Labour Quantity in the Fishing Industry (어가의 고용량 결정요인 분석)

  • Kim, Tae-Hyun;Park, Cheol-Hyung;Nam, Jongoh
    • Environmental and Resource Economics Review
    • /
    • v.27 no.3
    • /
    • pp.545-567
    • /
    • 2018
  • This study applied and compared Poisson model, negative binomial model, zero inflated Poisson model, and zero inflated negative binomial model to estimate determinants of employed labour quantity. To estimate each of models, this study used fisheries census data which were obtained at microdata integrated service running by Statistics Korea. The study selected zero inflated negative binomial model according to the Vuong test and Likelihood-ratio test. In addition, the study estimated fishing village's practical changes on employed labour quantity as analyzing changes from 2010 to 2015. The results showed that the household with fishing vessels and high selling price had a significant effect on decrease of the labour quantities. Meanwhile, the longer work experience of the household, the more significant the increase in the labour quantities. In conclusion, this study presented that capitalized fishing household and the acceleration of aging had a significant impact on the change in the labour quantities.

Bayesian Analysis of a Zero-inflated Poisson Regression Model: An Application to Korean Oral Hygienic Data (영과잉 포아송 회귀모형에 대한 베이지안 추론: 구강위생 자료에의 적용)

  • Lim, Ah-Kyoung;Oh, Man-Suk
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.3
    • /
    • pp.505-519
    • /
    • 2006
  • We consider zero-inflated count data, which is discrete count data but has too many zeroes compared to the Poisson distribution. Zero-inflated data can be found in various areas. Despite its increasing importance in practice, appropriate statistical inference on zero-inflated data is limited. Classical inference based on a large number theory does not fit unless the sample size is very large. And regular Poisson model shows lack of St due to many zeroes. To handle the difficulties, a mixture of distributions are considered for the zero-inflated data. Specifically, a mixture of a point mass at zero and a Poisson distribution is employed for the data. In addition, when there exist meaningful covariates selected to the response variable, loglinear link is used between the mean of the response and the covariates in the Poisson distribution part. We propose a Bayesian inference for the zero-inflated Poisson regression model by using a Markov Chain Monte Carlo method. We applied the proposed method to a Korean oral hygienic data and compared the inference results with other models. We found that the proposed method is superior in that it gives small parameter estimation error and more accurate predictions.

Likelihood Ratio Test for the Epidemic Alternatives on the Zero-Inflated Poisson Model (변화시점이 있는 영과잉-포아송모형에서 돌출대립가설에 대한 우도비검정)

  • Kim, Kyung-Moo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.9 no.2
    • /
    • pp.247-253
    • /
    • 1998
  • In ease of the epidemic Zero-Inflated Poisson model, likelihood ratio test was used for testing epidemic alternatives. Epidemic changepoints were estimated by the method of least squares. It were used for starting points to estimate the maximum likelihood estimators. And several parameters were compared through the Monte Carlo simulations. As a result, maximum likelihood estimators for the epidemic chaagepoints and several parameters are better than the least squares and moment estimators.

  • PDF

Prediction of K-league soccer scores using bivariate Poisson distributions (이변량 포아송분포를 이용한 K-리그 골 점수의 예측)

  • Lee, Jang Taek
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.6
    • /
    • pp.1221-1229
    • /
    • 2014
  • In this paper we choose the best model among several bivariate Poisson models on Korean soccer data. The models considered allow for correlation between the number of goals of two competing teams. We use an R package called bivpois for bivariate Poisson regression models and the data of K-league for season 1983-2012. Finally we conclude that the best fitted model supported by the AIC and BIC is the bivariate Poisson model with constant covariance. The zero and diagonal inflated models did not improve the model fit. The model can be used to examine home-away effect, goodness of fit, attack and defense parameters.

An application to Multivariate Zero-Inflated Poisson Regression Model

  • Kim, Kyung-Moo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.2
    • /
    • pp.177-186
    • /
    • 2003
  • The Zero-Inflated Poisson regression is a model for count data with exess zeros. When the correlated response variables are intrested, we have to extend the univariate zero-inflated regression model to multivariate model. In this paper, we study and simulate the multivariate zero-inflated regression model. A real example was applied to this model. Regression parameters are estimated by using MLE's. We also compare the fitness of multivariate zero-inflated Poisson regression model with the decision tree model.

  • PDF

An application to Zero-Inflated Poisson Regression Model

  • Kim, Kyung-Moo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.1
    • /
    • pp.45-53
    • /
    • 2003
  • The Zero-Inflated Poisson regression is a model for count data with exess zeros. When the reponse variables have excess zeros, it is not easy to apply the Poisson regression model. In this paper, we study and simulate the zero-inflated Poisson regression model. An real example was applied to this model. Regression parameters are estimated by using MLE's. We also compare the fitness of zero-inflated Poisson model with the Poisson regression and decision tree model.

  • PDF

Integer-Valued GARCH Models for Count Time Series: Case Study (계수 시계열을 위한 정수값 GARCH 모델링: 사례분석)

  • Yoon, J.E.;Hwang, S.Y.
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.1
    • /
    • pp.115-122
    • /
    • 2015
  • This article is concerned with count time series taking values in non-negative integers. Along with the first order mean of the count time series, conditional variance (volatility) has recently been paid attention to and therefore various integer-valued GARCH(generalized autoregressive conditional heteroscedasticity) models have been suggested in the last decade. We introduce diverse integer-valued GARCH(INGARCH, for short) processes to count time series and a real data application is illustrated as a case study. In addition, zero inflated INGARCH models are discussed to accommodate zero-inflated count time series.

Estimation of Advertising Exposure Distribution by Zero-inflation Regression Models (영과잉 회귀모형을 이용한 광고노출분포 추정)

  • Lee, Dong-Hee
    • Journal of the Korean Data Analysis Society
    • /
    • v.20 no.6
    • /
    • pp.2841-2852
    • /
    • 2018
  • This study examines regression modeling method using zero-inflated distribution in relation to estimation of exposure distribution required in advertisement media planning. Exposure distribution is the percentage of audiences that are exposed each time the ad is repeated. Such an exposure distribution plays a very important role in providing basic information necessary for calculating various indicators for quantitatively measuring the advertising effect. Especially, due to the decrease of advertising price and the spread of various media, the frequency of the advertisement or the broadcasting of specific advertisements has been greatly increased compared to the past. As a result, the frequency of exposure is relatively decreasing. In this situation, the number of individuals who are not exposed to the media, that is, are not exposed to advertising structurally is increasing. This research proposes advertising exposure distribution models using a zero-inflated regression model, and conducts a comparative study using actual cases.

A joint modeling of longitudinal zero-inflated count data and time to event data (경시적 영과잉 가산자료와 생존자료의 결합모형)

  • Kim, Donguk;Chun, Jihun
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.7
    • /
    • pp.1459-1473
    • /
    • 2016
  • Both longitudinal data and survival data are collected simultaneously in longitudinal data which are observed throughout the passage of time. In this case, the effect of the independent variable becomes biased (provided that sole use of longitudinal data analysis does not consider the relation between both data used) if the missing that occurred in the longitudinal data is non-ignorable because it is caused by a correlation with the survival data. A joint model of longitudinal data and survival data was studied as a solution for such problem in order to obtain an unbiased result by considering the survival model for the cause of missing. In this paper, a joint model of the longitudinal zero-inflated count data and survival data is studied by replacing the longitudinal part with zero-inflated count data. A hurdle model and proportional hazards model were used for each longitudinal zero inflated count data and survival data; in addition, both sub-models were linked based on the assumption that the random effect of sub-models follow the multivariate normal distribution. We used the EM algorithm for the maximum likelihood estimator of parameters and estimated standard errors of parameters were calculated using the profile likelihood method. In simulation, we observed a better performance of the joint model in bias and coverage probability compared to the separate model.

The study on the determinants of the number of job changes (중소기업 청년인턴 이직횟수 결정요인 분석)

  • Park, Sungik;Ryu, Jangsoo;Kim, Jonghan;Cho, Jangsik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.2
    • /
    • pp.387-397
    • /
    • 2015
  • In this paper, the determinants of the number of job changes in the SMEs (small and medium enterprises) youth-intern project is analysed, utilizing SMEs youth-intern DB and employment insurance DB. Since the number of job changes are count data which take integer values other than negative values, general linear regression analysis becomes inappropriate. Therefore, four models such as Poisson regression model, zero inflated Poisson regression model, negative binomial regression model and zero inflated negative binomial regression model are tried to fit count data. A zero inflated negative binomial regression model is selected to be the best model. Major results are the followings. First, the number of job changes is shown to be significantly smaller in the treatment group than in the control group. Second, the number of job changes turns out to be significantly smaller in the young-age group than in the old-age group. Third, it is also shown that the number of job changes of man is significantly greater than that of woman. Lastly, the number of job changes in the bigger firm is shown to be significantly less than that of the smaller firm.