• 제목/요약/키워드: Count model

검색결과 514건 처리시간 0.021초

Model Checking for Time-Series Count Data

  • Lee, Sung-Im
    • Communications for Statistical Applications and Methods
    • /
    • 제12권2호
    • /
    • pp.359-364
    • /
    • 2005
  • This paper considers a specification test of conditional Poisson regression model for time series count data. Although conditional models for count data have received attention and proposed in several ways, few studies focused on checking its adequacy. Motivated by the test of martingale difference assumption, a specification test via Ljung-Box statistic is proposed in the conditional model of the time series count data. In order to illustrate the performance of Ljung- Box test, simulation results will be provided.

Modeling clustered count data with discrete weibull regression model

  • Yoo, Hanna
    • Communications for Statistical Applications and Methods
    • /
    • 제29권4호
    • /
    • pp.413-420
    • /
    • 2022
  • In this study we adapt discrete weibull regression model for clustered count data. Discrete weibull regression model has an attractive feature that it can handle both under and over dispersion data. We analyzed the eighth Korean National Health and Nutrition Examination Survey (KNHANES VIII) from 2019 to assess the factors influencing the 1 month outpatient stay in 17 different regions. We compared the results using clustered discrete Weibull regression model with those of Poisson, negative binomial, generalized Poisson and Conway-maxwell Poisson regression models, which are widely used in count data analyses. The results show that the clustered discrete Weibull regression model using random intercept model gives the best fit. Simulation study is also held to investigate the performance of the clustered discrete weibull model under various dispersion setting and zero inflated probabilities. In this paper it is shown that using a random effect with discrete Weibull regression can flexibly model count data with various dispersion without the risk of making wrong assumptions about the data dispersion.

토빗모형을 이용한 가로구간 보행자 사고모형 개발 (Developing the Pedestrian Accident Models Using Tobit Model)

  • 이승주;김윤환;박병호
    • 한국도로학회논문집
    • /
    • 제16권3호
    • /
    • pp.101-107
    • /
    • 2014
  • PURPOSES : This study deals with the pedestrian accidents in case of Cheongju. The goals are to develop the pedestrian accident model. METHODS : To analyze the accident, count data models, truncated count data models and Tobit regression models are utilized in this study. The dependent variable is the number of accident. Independent variables are traffic volume, intersection geometric structure and the transportation facility. RESULTS : The main results are as follows. First, Tobit model was judged to be more appropriate model than other models. Also, these models were analyzed to be statistically significant. Second, such the main variables related to accidents as traffic volume, pedestrian volume, number of Entry/exit, number of crosswalk and bus stop were adopted in the above model. CONCLUSIONS : The optimal model for pedestrian accidents is evaluated to be Tobit model.

가산자료모형을 이용한 송정 해수욕장의 경제적 가치추정: - 비수기 해수욕장의 가치추정 - (Estimating the Economic Value of the Songieong Beach Using A Count Data Model: - Off-season Estimating Value of the Beach -)

  • 허윤정;이승래
    • 수산경영론집
    • /
    • 제38권2호
    • /
    • pp.79-101
    • /
    • 2007
  • The purpose of this study is to estimate the economic value of the Songieong Beach in Off-season, using a Individual Travel Cost Model(ITCM). Songieong Beach is located in Busan but far away from city. These days, however, the increased rate of traffic inflow to the Songieong beach and the five-day working week are reflected in the trend analysis. Moreover, people have changed psychological value. For that reason, visitors are on the increase on the beach in off-season. The ITCM is applied to estimate non-market value or environmental Good like a Contingent Valuation Method and Hedonic Price Model etc. The ITCM was derived from the Count Data Model(i.e. Poisson and Negative Binomial model). So this paper compares Poisson and negative binomial count data models to measure the tourism demands. The data for the study were collected from the Songjeong Beach on visitors over the a week from November 1 through November 23, 2006. Interviewers were instructed to interview only individuals. So the sample was taken in 113. A dependent variable that is defined on the non-negative integers and subject to sampling truncation is the result of a truncated count data process. This paper analyzes the effects of determinants on visitors' demand for exhibition using a class of maximum-likelihood regression estimators for count data from truncated samples, The count data and truncated models are used primarily to explain non-negative integer and truncation properties of tourist trips as suggested by the economic valuation literature. The results suggest that the truncated negative binomial model is improved overdispersion problem and more preferred than the other models in the study. This paper is not the same as the others. One thing is that Estimating Value of the Beach in off-season. The other thing is this study emphasizes in particular 'travel cost' that is not only monetary cost but also including opportunity cost of 'travel time'. According to the truncated negative binomial model, estimates the Consumer Surplus(CS) values per trip of about 199,754 Korean won and the total economic value was estimated to be 1,288,680 Korean won.

  • PDF

Modelling Count Responses with Overdispersion

  • Jeong, Kwang Mo
    • Communications for Statistical Applications and Methods
    • /
    • 제19권6호
    • /
    • pp.761-770
    • /
    • 2012
  • We frequently encounter outcomes of count that have extra variation. This paper considers several alternative models for overdispersed count responses such as a quasi-Poisson model, zero-inflated Poisson model and a negative binomial model with a special focus on a generalized linear mixed model. We also explain various goodness-of-fit criteria by discussing their appropriateness of applicability and cautions on misuses according to the patterns of response categories. The overdispersion models for counts data have been explained through two examples with different response patterns.

제로팽창 모형을 이용한 보험데이터 분석 (A Zero-Inated Model for Insurance Data)

  • 최종후;고인미;전수영
    • 응용통계연구
    • /
    • 제24권3호
    • /
    • pp.485-494
    • /
    • 2011
  • 계수(Count) 데이터는 반응변수가 음이 아닌 계수로, 자동차 사고건수나 지진이 일어난 횟수, 보험처리 발생건수 등을 말한다. 이런 경우에는 주로 포아송 회귀모형을 사용하지만, 평균과 분산이 동일한 경우만 이용될 수 있다는 제약이 따른다. 실증적 자료에서는 그룹 간 이질성으로 인해 분산이 매우 큰 과대산포(Overdispersion) 현상을 볼 수 있는데, 이를 무시할 경우 회귀계수나 표준오차가 편의되는 현상이 발생한다. 보험은 보장성 개념이 강하기 때문에 실제로 보험처리가 발생하지 않는 경우가 많아, 보험처리 건수에 '0'값이 있을 수 있다. 본 논문에서는 '0'값이 많은 자료의 분석을 위해 제로팽창 모형(Zero-Inflated Model)을 고려하고, 여러 모형들의 효율성을 실증자료를 통하여 비교하였다. 실증 자료 분석 결과, 과대산포와 제로팽창 현상이 존재하는 자료에서 제로팽창 음이항 모형(Zero-Inflated Negative Binomial Regression Model)이 가장 효율적인 모형임을 보여 주었다.

A Bayesian joint model for continuous and zero-inflated count data in developmental toxicity studies

  • Hwang, Beom Seuk
    • Communications for Statistical Applications and Methods
    • /
    • 제29권2호
    • /
    • pp.239-250
    • /
    • 2022
  • In many applications, we frequently encounter correlated multiple outcomes measured on the same subject. Joint modeling of such multiple outcomes can improve efficiency of inference compared to independent modeling. For instance, in developmental toxicity studies, fetal weight and number of malformed pups are measured on the pregnant dams exposed to different levels of a toxic substance, in which the association between such outcomes should be taken into account in the model. The number of malformations may possibly have many zeros, which should be analyzed via zero-inflated count models. Motivated by applications in developmental toxicity studies, we propose a Bayesian joint modeling framework for continuous and count outcomes with excess zeros. In our model, zero-inflated Poisson (ZIP) regression model would be used to describe count data, and a subject-specific random effects would account for the correlation across the two outcomes. We implement a Bayesian approach using MCMC procedure with data augmentation method and adaptive rejection sampling. We apply our proposed model to dose-response analysis in a developmental toxicity study to estimate the benchmark dose in a risk assessment.

Count Data Model을 이용한 중소기업의 정보화 효과 분석 (Analysis on the Effects of the Informatization Level on SMEs through Count Data Model)

  • 황순환
    • 한국IT서비스학회지
    • /
    • 제3권1호
    • /
    • pp.5-20
    • /
    • 2004
  • It has been known generally that investment in the extending ability to use the IT applications have further enhanced the productivity of effects of IT on firms by reducing costs, increasing returns, and increasing the speed of operations, etc. Notwithstanding this fact, it was very complex and difficult to evaluate concretely the effect of informatization of firm. SMEs(Small- & Medium-sized Enterprises) in particular. In this study, I point out the weakness of SMEs and analyze the effects of informatization through the count data model. For this analysis, I separate the effects into two part, such as organizational effect and personal effect. It comes to conclusion that organizational effect is larger than personal effect and the ability to practical use of IT systems is most efficient item related with informatization level. Since it will be important to cencentrate on raising this ability for heightening the competitiveness of SMEs.

영과잉 경시적 가산자료 분석을 위한 허들모형 (Hurdle Model for Longitudinal Zero-Inflated Count Data Analysis)

  • 진익태;이근백
    • 응용통계연구
    • /
    • 제27권6호
    • /
    • pp.923-932
    • /
    • 2014
  • 허들모형은 영이 과잉 가산자료를 분석하기 위해서 사용되어 왔다. 이 모형은 이산부분을 위한 로짓모형과 절삭된 가산부분을 위한 절삭된 포아송모형의 혼합모형이다. 이 논문에서 우리는 경시적 영과잉 가산자료를 분석하기 위해서 수정된 콜레스키 분해을 이용하여 일반적인 이분산성을 가지는 변량효과 공분산행렬을 제안한다. 수정된 콜레스키 분해는 변량효과 공분산행렬을 일반화자기상관 모수와 혁신분산모수로 분리되면, 이러한 모수들은 베이지안 일반화 선형모형을 통해 추정된다. 그리고 실제 자료분석을 통하여 설명한다.

타코펄스 불균일성으로 인한 펄스개수측정방법 영향성 (Tacho Pulse Non-uniformity Effects on Pulse Count Method)

  • 손준원
    • 한국항공우주학회지
    • /
    • 제49권4호
    • /
    • pp.301-309
    • /
    • 2021
  • 펄스개수측정방법은 반작용휠의 속도를 측정하는 고전적인 방법이다. 본 연구에서는 펄스개수측정방법을 수식으로 표현하였다. 반작용휠의 회전을 속도가 아니라 샘플링 시간 동안의 회전각도로 모델링하였다. 제안된 모델의 유효성은 모델에서 얻어진 펄스개수 변화와 이동평균의 효과가 기존 연구결과와 동일함을 확인하는 방법으로 검증하였다. 이렇게 검증된 모델에 타코펄스 불균일성을 추가하고 펄스개수측정방법의 오차에 대해서 살펴보았다. 불균일성으로 인해서 증가하는 측정오차의 크기를 수식으로 표현하였으며, 측정오차를 상쇄하기 위해서 취해야 하는 이동평균의 개수 조건을 제시하였다.