• Title/Summary/Keyword: Poisson 분포

Search Result 341, Processing Time 0.024 seconds

Moments of the Bivariate Zero-Inflated Poisson Distributions (이변량 영과잉-포아송 분포의 적률)

  • Kim, Kyung-Moo;Lee, Sung-Ho;Kim, Jong-Tae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.9 no.1
    • /
    • pp.47-56
    • /
    • 1998
  • Zero-Inflated Poisson models are mixed models of the Poisson and Bernoulli models. Recently Zero-Inflated Poisson distributions have been used frequently rather than previous Poisson distributions because the developement of industrial technology make few defects in manufacturing process. It is important that univariate Zero-Inflated Poisson distributions are extended to bivariate distributions to generalize the multivariate distributions. In this paper we proposed three types of the bivariate Zero-Inflated Poisson distributions and obtained these moments. We compared the three types of distributions by using the moments.

  • PDF

Soccer goal distributions in K-league (K-리그에서 축구 골의 분포)

  • Lee, Jang Taek
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.6
    • /
    • pp.1231-1239
    • /
    • 2014
  • In this paper we analyse the distributions of the number of goals scored by home teams and away teams in K-league soccer outcomes between 1983 and 2012. Real soccer data is explained in K-league using statistical distributions such that Poisson, negative binomial, extreme value and zero inflated Poisson. How close the goals of home and away fits the different distributions are tested by performing chi-square goodness of fit tests. According to these tests, the Poisson distribution gives the best fit to the home goals data. But it is best to model the away goals data on zero inflated Poisson distribution. Also, there is some weak evidence of the dependence for home and away goals.

A GLR Chart for Monitoring a Zero-Inflated Poisson Process (ZIP 공정을 관리하는 GLR 관리도)

  • Choi, Mi Lim;Lee, Jaeheon
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.2
    • /
    • pp.345-355
    • /
    • 2014
  • The number of nonconformities in a unit is commonly modeled by a Poisson distribution. As an extension of a Poisson distribution, a zero-inflated Poisson(ZIP) process can be used to fit count data with an excessive number of zeroes. In this paper, we propose a generalized likelihood ratio(GLR) chart to monitor shifts in the two parameters of the ZIP process. We also compare the proposed GLR chart with the combined cumulative sum(CUSUM) chart and the single CUSUM chart. It is shown that the overall performance of the GLR chart is comparable with CUSUM charts and is significantly better in some cases where the actual directions of the shifts are different from the pre-specified directions in CUSUM charts.

Development of probability distribution for simulation of monthly characteristics of torrential rainfall events (집중호우사상의 월별 발생특성 모의를 위한 확률분포 개발)

  • Kim, Sang Ug;Kim, Hyeong Bae
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2016.05a
    • /
    • pp.246-246
    • /
    • 2016
  • 본 연구에서는 최근 기후변화로 인한 집중호우의 발생횟수의 경향을 확률적으로 분석함에 있어 1개월 동안 80 mm/day 이상의 강우사상을 집중호우로 정의하여, 대구 및 부산 강우관측소로부터 수집된 384개월 동안의 집중호우를 분석하였다. 집중호우 월별 발생횟수와 같은 형식의 자료의 확률적 분석은 대개 Poisson 분포 (POI)가 사용되나 자료에 포함된 0자료의 과잉은 확률분포를 왜곡시키는 문제를 발생시킨다. 본 연구에서는 이 문제를 개선하기 위하여 개발된 일반화 Poisson 확률분포 (GPD), 0-과잉 Poisson 확률분포 (ZIP), 0-과잉 일반화 Poisson 확률분포 (ZIGP), Bayesian 0-과잉 일반화 Poisson 확률분포 (Bayesian ZIGP)를 집중호우 자료에 적용하고, 5개 모형의 특성을 비교분석하였으며, Bayesian ZIGP 모형의 구축에 있어서는 정보적 사전분포를 사용함으로써 모형의 정확도를 개선하였다. 분석결과 분석하고자 하는 자료에 0이 과다하게 포함되어 있는 경우 POI 및 GPD 분포는 관측결과와는 다른 결과를 제시하여 적절한 모형으로 고려되지 못함을 알 수 있었다. 5가지 모형 중 정보적 사전분포를 탑재한 Bayesian ZIGP 모형이 가장 관측 자료와 유사한 결과를 도출하였으나 모형의 구축에 수반되는 실용적인 측면을 고려하면 ZIP 모형도 충분히 사용될 수 있는 모형으로 추천되었다.

  • PDF

Analysis of torrential rainfall characteristics using 'zero-inflated models' ('0-과잉 모형'을 이용한 집중호우의 발생특성 분석)

  • Kim, Sang Ug
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2017.05a
    • /
    • pp.453-453
    • /
    • 2017
  • 본 연구에서는 최근 기후변화로 인한 집중호우의 발생횟수의 경향을 확률적으로 분석함에 있어 1개월 동안 80 mm/day 이상의 강우사상을 집중호우로 정의하여, 대구 및 부산 강우관측소로부터 수집된 384개월 동안의 집중호우를 분석하였다. 집중호우 월별 발생횟수와 같은 형식의 자료의 확률적 분석은 대개 Poisson 분포 (POI)가 사용되나 자료에 포함된 0자료의 과잉은 확률분포를 왜곡시키는 문제를 발생시킨다. 본 연구에서는 이 문제를 개선하기 위하여 개발된 일반화 Poisson 확률분포 (GPD), 0-과잉 Poisson 확률분포 (ZIP), 0-과잉 일반화 Poisson 확률분포 (ZIGP), Bayesian 0-과잉 일반화 Poisson 확률분포 (Bayesian ZIGP)를 집중호우 자료에 적용하고, 5개 모형의 특성을 비교분석하였으며, Bayesian ZIGP 모형의 구축에 있어서는 정보적 사전분포를 사용함으로써 모형의 정확도를 개선하였다. 분석결과 분석하고자 하는 자료에 0이 과다하게 포함되어 있는 경우 POI 및 GPD 분포는 관측결과와는 다른 결과를 제시하여 적절한 모형으로 고려되지 못함을 알 수 있었다. 5가지 모형 중 정보적 사전분포를 탑재한 Bayesian ZIGP 모형이 가장 관측 자료와 유사한 결과를 도출하였으나 모형의 구축에 수반되는 실용적인 측면을 고려하면 ZIP 모형도 충분히 사용될 수 있는 모형으로 추천되었다.

  • PDF

Comparison of probability distributions to analyze the number of occurrence of torrential rainfall events (집중호우사상의 발생횟수 분석을 위한 확률분포의 비교)

  • Kim, Sang Ug;Kim, Hyeung Bae
    • Journal of Korea Water Resources Association
    • /
    • v.49 no.6
    • /
    • pp.481-493
    • /
    • 2016
  • The statistical analysis to the torrential rainfall data that is defined as a rainfall amount more than 80 mm/day is performed with Daegu and Busan rainfall data which is collected during 384 months. The number of occurrence of the torrential rainfall events can be simulated usually using Poisson distribution. However, the Poisson distribution can be frequently failed to simulate the statistical characteristics of the observed value when the observed data is zero-inflated. Therefore, in this study, Generalized Poisson distribution (GPD), Zero-Inflated Poisson distribution (ZIP), Zero-Inflated Generalized Poisson distribution (ZIGP), and Bayesian ZIGP model were used to resolve the zero-inflated problem in the torrential rainfall data. Especially, in Bayesian ZIGP model, a informative prior distribution was used to increase the accuracy of that model. Finally, it was suggested that POI and GPD model should be discouraged to fit the frequency of the torrential rainfall data. Also, Bayesian ZIGP model using informative prior provided the most accurate results. Additionally, it was recommended that ZIP model could be alternative choice on the practical aspect since the Bayesian approach of this study was considerably complex.

On the Multivariate Poisson Distribution with Specific Covariance Matrix

  • Kim, Dae-Hak;Jeong, Heong-Chul;Jung, Byoung-Cheol
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.1
    • /
    • pp.161-171
    • /
    • 2006
  • In this paper, we consider the random number generation method for multivariate Poisson distribution with specific covariance matrix. Random number generating method for the multivariate Poisson distribution is considered into two part, by first solving the linear equation to determine the univariate Poisson parameter, then convoluting independent univariate Poisson variates with appropriate expectations. We propose a numerical algorithm to solve the linear equation given the specific covariance matrix.

  • PDF

Bayesian Analysis of a Zero-inflated Poisson Regression Model: An Application to Korean Oral Hygienic Data (영과잉 포아송 회귀모형에 대한 베이지안 추론: 구강위생 자료에의 적용)

  • Lim, Ah-Kyoung;Oh, Man-Suk
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.3
    • /
    • pp.505-519
    • /
    • 2006
  • We consider zero-inflated count data, which is discrete count data but has too many zeroes compared to the Poisson distribution. Zero-inflated data can be found in various areas. Despite its increasing importance in practice, appropriate statistical inference on zero-inflated data is limited. Classical inference based on a large number theory does not fit unless the sample size is very large. And regular Poisson model shows lack of St due to many zeroes. To handle the difficulties, a mixture of distributions are considered for the zero-inflated data. Specifically, a mixture of a point mass at zero and a Poisson distribution is employed for the data. In addition, when there exist meaningful covariates selected to the response variable, loglinear link is used between the mean of the response and the covariates in the Poisson distribution part. We propose a Bayesian inference for the zero-inflated Poisson regression model by using a Markov Chain Monte Carlo method. We applied the proposed method to a Korean oral hygienic data and compared the inference results with other models. We found that the proposed method is superior in that it gives small parameter estimation error and more accurate predictions.

Bayesian Multiple Change-Point Estimation for Single Quantum Dot Luminescence Intensity Data (단일 양자점으로부터 발생한 발광세기 변화에 대한 베이지안 다중 변화점 추정)

  • Kima, Jaehee;Kimb, Hahkjoon
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.4
    • /
    • pp.569-579
    • /
    • 2013
  • In the field of single-molecule spectroscopy, it is essential to analyze luminescence Intensity changes that result from a single molecule. With the CdSe/ZnS core-shell structured quantum dot photon emission data Bayesian multiple change-point estimation is done with the gamma prior for Poisson parameters and truncated Poisson distribution for the number of change-points.

Analysis of Drought Spatial Distribution Using Poisson Process (포아송과정을 이용한 가뭄의 공간분포 분석)

  • Yoo, Chul-Sang;Ahn, Jae-Hyun;Ryoo, So-Ra
    • Journal of Korea Water Resources Association
    • /
    • v.37 no.10
    • /
    • pp.813-822
    • /
    • 2004
  • This study quantifies and compares the drought return and duration characteristics by applying the Poisson process as well as based on by analyzing the observed data directly. The drought spatial distributions derived for the Gyunggi province are also compared. The monthly rainfall data are used to construct the SPI as a drought index. Especially, this study focuses on the evaluation of the Poisson process model when applying it to various data lengths such as in the spatial analysis 'of drought. Summarizing the results are as follows. (1) The Poisson process is found to be effective for the quantification of drought, especially when the data length is short. When applying the Poisson process, two neighboring sites are found insensitive to the data length to show similar drought characteristics, so the overall drought pattern becomes smoother than that derived directly from the observed data. (2) When the data length is very different site by site, the spatial analysis of drought based on a model application seems better than that based on the direct data analysis. This study also found more obvious spatial pattern of drought occurrence and duration when applying the Poisson process.