• Title/Summary/Keyword: 포아송 분포모형

Search Result 105, Processing Time 0.026 seconds

An application to Zero-Inflated Poisson Regression Model

  • Kim, Kyung-Moo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.1
    • /
    • pp.45-53
    • /
    • 2003
  • The Zero-Inflated Poisson regression is a model for count data with exess zeros. When the reponse variables have excess zeros, it is not easy to apply the Poisson regression model. In this paper, we study and simulate the zero-inflated Poisson regression model. An real example was applied to this model. Regression parameters are estimated by using MLE's. We also compare the fitness of zero-inflated Poisson model with the Poisson regression and decision tree model.

  • PDF

Comparative Study of Model Selection Using Bayes Factor through Simulation : Poisson vs. Negative Binomial Model Selection and Normal, Double Exponential vs. Cauchy Model Selection (시뮬레이션을 통한 베이즈요인에 의한 모형선택의 비교연구 : 포아송, 음이항모형의 선택과 정규, 이중지수, 코쉬모형의 선택)

  • 오미라;윤소영;심정욱;손영숙
    • The Korean Journal of Applied Statistics
    • /
    • v.16 no.2
    • /
    • pp.335-349
    • /
    • 2003
  • In this paper, we use Bayesian method for model selection of poisson vs. negative binomial distribution, and normal, double exponential vs. cauchy distribution. The fractional Bayes factor of O'Hagan (1995) was applied to Bayesian model selection under the assumption of noninformative improper priors for all parameters in the models. Through the analyses of real data and simulation data, we examine the usefulness of the fractional Bayes factor in comparison with intrinsic Bayes factors of Berger and Pericchi (1996, 1998).

Analysis of Drought Spatial Distribution Using Poisson Process (포아송과정을 이용한 가뭄의 공간분포 분석)

  • Yoo, Chul-Sang;Ahn, Jae-Hyun;Ryoo, So-Ra
    • Journal of Korea Water Resources Association
    • /
    • v.37 no.10
    • /
    • pp.813-822
    • /
    • 2004
  • This study quantifies and compares the drought return and duration characteristics by applying the Poisson process as well as based on by analyzing the observed data directly. The drought spatial distributions derived for the Gyunggi province are also compared. The monthly rainfall data are used to construct the SPI as a drought index. Especially, this study focuses on the evaluation of the Poisson process model when applying it to various data lengths such as in the spatial analysis 'of drought. Summarizing the results are as follows. (1) The Poisson process is found to be effective for the quantification of drought, especially when the data length is short. When applying the Poisson process, two neighboring sites are found insensitive to the data length to show similar drought characteristics, so the overall drought pattern becomes smoother than that derived directly from the observed data. (2) When the data length is very different site by site, the spatial analysis of drought based on a model application seems better than that based on the direct data analysis. This study also found more obvious spatial pattern of drought occurrence and duration when applying the Poisson process.

The Analysis of the Number of Donations Based on a Mixture of Poisson Regression Model (포아송 분포의 혼합모형을 이용한 기부 횟수 자료 분석)

  • Kim In-Young;Park Su-Bum;Kim Byung-Soo;Park Tae-Kyu
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.1
    • /
    • pp.1-12
    • /
    • 2006
  • The aim of this study is to analyse a survey data on the number of charitable donations using a mixture of two Poisson regression models. The survey was conducted in 2002 by Volunteer 21, an nonprofit organization, based on Koreans, who were older than 20. The mixture of two Poisson distributions is used to model the number of donations based on the empirical distribution of the data. The mixture of two Poisson distributions implies the whole population is subdivided into two groups, one with lesser number of donations and the other with larger number of donations. We fit the mixture of Poisson regression models on the number of donations to identify significant covariates. The expectation-maximization algorithm is employed to estimate the parameters. We computed 95% bootstrap confidence interval based on bias-corrected and accelerated method and used then for selecting significant explanatory variables. As a result, the income variable with four categories and the volunteering variable (1: experience of volunteering, 0: otherwise) turned out to be significant with the positive regression coefficients both in the lesser and the larger donation groups. However, the regression coefficients in the lesser donation group were larger than those in larger donation group.

The Effects of Dispersion Parameters and Test for Equality of Dispersion Parameters in Zero-Truncated Bivariate Generalized Poisson Models (제로절단된 이변량 일반화 포아송 분포에서 산포모수의 효과 및 산포의 동일성에 대한 검정)

  • Lee, Dong-Hee;Jung, Byoung-Cheol
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.3
    • /
    • pp.585-594
    • /
    • 2010
  • This study, investigates the effects of dispersion parameters between two response variables in zero-truncated bivariate generalized Poisson distributions. A Monte Carlo study shows that the zero-truncated bivariate Poisson and negative binomial models fit poorly wherein the zero-truncated bivariate count data has heterogeneous dispersion parameters on dependent variables. In addition, we derive the score test for testing the equality of the dispersion parameters and compare its efficiency with the likelihood ratio test.

The Study for NHPP Software Reliability Growth Model based on Exponentiated Exponential Distribution (지수화 지수 분포에 의존한 NHPP 소프트웨어 신뢰성장 모형에 관한 연구)

  • Kim, Hee-Cheul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.11 no.5 s.43
    • /
    • pp.9-18
    • /
    • 2006
  • Finite failure NHPP models presented in the literature exhibit either constant, monotonic increasing or monotonic decreasing failure occurrence rates per fault. In this paper, Goel-Okumoto and Yamada-Ohba-Osaki model was reviewed, proposes the exponentiated exponential distribution reliability model, which maked out efficiency substituted for gamma and Weibull model(2 parameter shape illustrated by Gupta and Kundu(2001) Algorithm to estimate the parameters used to maximum likelihood estimator and bisection method, model selection based on SSE, AIC statistics and Kolmogorov distance, for the sake of efficient model, was employed. Analysis of failure using NTDS data set for the sake of proposing shape parameter of the exponentiated exponential distribution was employed. This analysis of failure data compared with the exponentiated exponential distribution model and the existing model (using arithmetic and Laplace trend tests, bias tests) is presented.

  • PDF

The Comparative Study of Software Optimal Release Time Based on Log property Distribution (로그형 특성분포에 근거한 소프트웨어 최적 방출시기에 관한 비교 연구)

  • Kim, Hee-Cheul;Park, Hyoung-Keun
    • Proceedings of the KAIS Fall Conference
    • /
    • 2010.05a
    • /
    • pp.149-152
    • /
    • 2010
  • 본 연구에서는 소프트웨어 제품을 개발하여 테스팅을 거친 후 사용자에게 인도하는 시기를 결정하는 방출문제에 대하여 연구되었다. 인도시기에 관한 모형은 무한 고장 수에 의존하는 비동질적인 포아송 과정을 적용하였다. 이러한 포아송 과정은 소프트웨어의 결함을 제거하거나 수정 작업 중에도 새로운 결함이 발생될 가능성을 반영하는 모형이다. 적용모형은 여러 수명 분포들을 적합시키는데 효율적인 특성을 가진 콤페르쯔, 파레토, 로그-로지스틱 모형과 같은 로그형 특성분포를 이용하였다. 따라서 소프트웨어 요구 신뢰도를 만족시키고 소프트웨어 개발 및 유지 총비용을 최소화 시키는 방출시간이 최적 소프트웨어 방출 정책이 된다. 본 논문의 수치적인 예에서는 고장 간격 시간 자료를 적용하고 모수추정 방법은 최우추정법을 이용하여 최적 방출시기를 추정하였다.

  • PDF

The Comparative Study for Software Reliability Models Based on NHPP (NHPP에 기초한 소프트웨어 신뢰도 모형에 대한 비교연구)

  • Gan, Gwang-Hyeon;Kim, Hui-Cheol;Lee, Byeong-Su
    • The KIPS Transactions:PartD
    • /
    • v.8D no.4
    • /
    • pp.393-400
    • /
    • 2001
  • This paper presents a stochastic model for the software failure phenomenon based on a nonhomogeneous Poisson process (NHPP). The failure process is analyzed to develop a suitable mean value function for the NHPP ; expressions are given for several performance measure. Actual software failure data are compared with generalized model by Goel dependent on the constant reflecting the quality of testing. The performance measures and parametric inferences of the new models, Rayleigh and Gumbel distributions, are discussed. The results of the new models are applied to real software failure data and compared with Goel-Okumoto and Yamada, Ohba and Osaki models. Tools of parameter inference was used method of the maximun likelihood estimate and the bisection algorithm for the computing nonlinear root. In this paper, using the sum of the squared errors, model selection was employed. The numerical example by NTDS data was illustrated.

  • PDF

An application to Multivariate Zero-Inflated Poisson Regression Model

  • Kim, Kyung-Moo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.2
    • /
    • pp.177-186
    • /
    • 2003
  • The Zero-Inflated Poisson regression is a model for count data with exess zeros. When the correlated response variables are intrested, we have to extend the univariate zero-inflated regression model to multivariate model. In this paper, we study and simulate the multivariate zero-inflated regression model. A real example was applied to this model. Regression parameters are estimated by using MLE's. We also compare the fitness of multivariate zero-inflated Poisson regression model with the decision tree model.

  • PDF

The Study of Software Optimal Release Time Based on Log-Logistic Distribution (로그로지스틱 분포특성에 근거한 소프트웨어 최적 방출시기에 관한 연구)

  • Kim, Hee-Cheul;Park, Hyoung-Keun
    • Proceedings of the KAIS Fall Conference
    • /
    • 2011.05a
    • /
    • pp.176-178
    • /
    • 2011
  • 본 연구에서는 소프트웨어 제품을 개발하여 테스팅을 거친 후 사용자에게 인도하는 시기를 결정하는 방출문제에 대하여 연구되었다. 인도시기에 관한 모형은 무한 고장수에 의존하는 비동질적인 포아송 과정을 적용하였다. 이러한 포아송 과정은 소프트웨어의 결함을 제거하거나 수정 작업 중에도 새로운 결함이 발생될 가능성을 반영하는 모형이다. 강도함수는 로그-로지스틱 패턴을 이용하였다. 따라서 소프트웨어 요구 신뢰도를 만족시키고 소프트웨어 개발 및 유지 총비용을 최소화 시키는 방출시간이 최적 소프트웨어 방출 정책이 된다.

  • PDF