• Title/Summary/Keyword: Poisson Mixture Model

Search Result 31, Processing Time 0.021 seconds

Classification Analysis in Information Retrieval by Using Gauss Patterns

  • Lee, Jung-Jin;Kim, Soo-Kwan
    • Communications for Statistical Applications and Methods
    • /
    • v.9 no.1
    • /
    • pp.1-11
    • /
    • 2002
  • This paper discusses problems of the Poisson Mixture model which Is widely used to decide the effective words in judging relevant document. Gamma Distribution model and Gauss Patterns model as an alternative of the Poisson Mixture model are studied. Classification experiments by using TREC sub-collection, WSJ[1,2] with MGQUERY and AidSearch3.0 system are discussed.

Bayesian Analysis of a Zero-inflated Poisson Regression Model: An Application to Korean Oral Hygienic Data (영과잉 포아송 회귀모형에 대한 베이지안 추론: 구강위생 자료에의 적용)

  • Lim, Ah-Kyoung;Oh, Man-Suk
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.3
    • /
    • pp.505-519
    • /
    • 2006
  • We consider zero-inflated count data, which is discrete count data but has too many zeroes compared to the Poisson distribution. Zero-inflated data can be found in various areas. Despite its increasing importance in practice, appropriate statistical inference on zero-inflated data is limited. Classical inference based on a large number theory does not fit unless the sample size is very large. And regular Poisson model shows lack of St due to many zeroes. To handle the difficulties, a mixture of distributions are considered for the zero-inflated data. Specifically, a mixture of a point mass at zero and a Poisson distribution is employed for the data. In addition, when there exist meaningful covariates selected to the response variable, loglinear link is used between the mean of the response and the covariates in the Poisson distribution part. We propose a Bayesian inference for the zero-inflated Poisson regression model by using a Markov Chain Monte Carlo method. We applied the proposed method to a Korean oral hygienic data and compared the inference results with other models. We found that the proposed method is superior in that it gives small parameter estimation error and more accurate predictions.

Development of Modified Effective Crack Model to Take into Account for variation of Poisson's ratio and Low-Temperature Properties of Asphalt Concrete (포아슨 비의 변화를 고려한 수정 ECM 모델 개발 및 아스팔트 콘크리트의 저온 특성 연구)

  • Keon, Seung-Zun;Doh, Young-Soo;Kim, Kwang-Woo
    • International Journal of Highway Engineering
    • /
    • v.3 no.1 s.7
    • /
    • pp.185-197
    • /
    • 2001
  • This paper dealt with modification of effective crack length model (ECM) by adding Poisson's ratio term to evaluate fracture toughness of asphalt concrete which varies its material property by temperature. The original ECM model was developed for solid materials, such as cement concrete, and Poisson's ratio of materials was not considered. However, since asphalt concrete is sensitive to temperature variation and changes its Poisson's ratio by temperature, it should be taken into consideration to know exact fracture property under various temperatures. Four binders, including 3 polymer-modified asphalt (PMA) binders, were used to make a dense-grade asphalt mixture and 3-point bending test was peformed on notched beam at low temperatures, from -5oC to 35oC. Elastic modulus, flexural strength and fracture toughness were obtained from the test. The results showed that, since Poisson's ratio was considered, the more accurate test values could be obtained using modified ECM equation than original ECM. PMA mixture showed higher stiffness and fracture toughness than normal asphalt mixture under very low temperatures.

  • PDF

The Reanalysis of the Donation Data Using the Zero-Inflated Possion Regression (0이 팽창된 포아송 회귀모형을 이용한 기부회수 자료의 재분석)

  • Kim, In-Young;Park, Tae-Kyu;Kim, Byung-Soo
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.4
    • /
    • pp.819-827
    • /
    • 2009
  • Kim et al. (2006) analyzed the donation data surveyed by Voluneteer 21 in year 2002 at South Korea using a Poisson regression based on the mixture of two Poissons and detected significant variables for affecting the number of donations. However, noting the large deviation between the predicted and the actual frequencies of zero, we developed in this note a Poisson regression model based on a distribution in which zero inflated Poisson was added to the mixture of two Poissons. Thus the population distribution is now a mixture of three Poissons in which one component is concentrated on zero mass. We used the EM algorithm for estimating the regression parameters and detected the same variables with Kim et al's for significantly affecting the response. However, we could estimate the proportion of the fixed zero group to be 0.201, which was the characteristic of this model. We also noted that among two significant variables, the income and the volunteer experience(yes, no), the second variable could be utilized as a strategric variable for promoting the donation.

A Study on Optimal Release Time for Software Systems based on Mixture Weibull NHPP Model (혼합 와이블 NHPP 모형에 근거한 소프트웨어 최적방출시기에 관한 연구)

  • Lee, Sang Sik;Kim, Hee Cheul
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.6 no.2
    • /
    • pp.183-191
    • /
    • 2010
  • Decision problem called an optimal release policies, after testing a software system in development phase and transfer it to the user, is studied. The applied model of release time exploited infinite non-homogeneous Poisson process. This infinite non-homogeneous Poisson process is a model which reflects the possibility of introducing new faults when correcting or modifying the software. The failure life-cycle distribution used mixture which has various intensity, if the system is complicated. Thus, software release policies which minimize a total average software cost of development and maintenance under the constraint of satisfying a software reliability requirement becomes an optimal release policies. In a numerical example, after trend test applied and estimated the parameters using maximum likelihood estimation of inter-failure time data, estimated software optimal release time.

The Analysis of the Number of Donations Based on a Mixture of Poisson Regression Model (포아송 분포의 혼합모형을 이용한 기부 횟수 자료 분석)

  • Kim In-Young;Park Su-Bum;Kim Byung-Soo;Park Tae-Kyu
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.1
    • /
    • pp.1-12
    • /
    • 2006
  • The aim of this study is to analyse a survey data on the number of charitable donations using a mixture of two Poisson regression models. The survey was conducted in 2002 by Volunteer 21, an nonprofit organization, based on Koreans, who were older than 20. The mixture of two Poisson distributions is used to model the number of donations based on the empirical distribution of the data. The mixture of two Poisson distributions implies the whole population is subdivided into two groups, one with lesser number of donations and the other with larger number of donations. We fit the mixture of Poisson regression models on the number of donations to identify significant covariates. The expectation-maximization algorithm is employed to estimate the parameters. We computed 95% bootstrap confidence interval based on bias-corrected and accelerated method and used then for selecting significant explanatory variables. As a result, the income variable with four categories and the volunteering variable (1: experience of volunteering, 0: otherwise) turned out to be significant with the positive regression coefficients both in the lesser and the larger donation groups. However, the regression coefficients in the lesser donation group were larger than those in larger donation group.

A Stochastic Model for Precipitation Occurrence Process of Hourly Precipitation Series (시간강수계열의 강수발생과정에 대한 추계학적 모형)

  • Lee, Jae-Jun;Lee, Jeong-Sik
    • Journal of Korea Water Resources Association
    • /
    • v.35 no.1
    • /
    • pp.109-124
    • /
    • 2002
  • This study is an effort to develop a stochastic model of precipitation series that preserves the pattern of occurrence of precipitation events throughout the year as well as several characteristics of the duration, amount, and intensity of precipitation events. In this study an event cluster model is used to describe the occurrence of precipitation events. A logarithmic negative mixture distribution is used to describe event duration and separation. The number of events within each cluster is also described by the Poisson cluster process. The duration of each event within a cluster and the separation of events within a single cluster are described by a logarithmic negative mixture distribution. The stochastic model for hourly precipitation occurrence process is fitted to historical precipitation data by estimating the model parameters. To allow for seasonal variations in the precipitation process, the model parameters are estimated separately for each month. an analysis of thirty-four years of historical and simulated hourly precipitation data for Seoul indicates that the stochastic model preserves many features of historical precipitation. The seasonal variations in number of precipitation events in each month for the historical and simulated data are also approximately identical. The marginal distributions for event characteristics for the historical and simulated data were similar. The conditional distributions for event characteristics for the historical and simulated data showed in general good agreement with each other.

Compressive Creep Properties of Reinforced Soil Mixture (보강혼합토의 압축 크리프 특성)

  • 이상호;차현주;김철영
    • Magazine of the Korean Society of Agricultural Engineers
    • /
    • v.44 no.6
    • /
    • pp.115-123
    • /
    • 2002
  • This study was performed to provide basic data for development and construction of reinforced soil wall that mixed with reinforcements such as calcium carbonate, monofilament fiber. In order to determine proper moisture content and mixing ratio by weight of reinforcement, Poisson's ratio and compressive strength tests for sandy soil had been conducted. Model tests for long-term behavior of reinforced soil wall were carried out to investigate the effect of reinforcement during loads and under static loads. The results of creep and model tests for sandy soil compared with clayey soil. Reinforced sandy soil mixed with calcium carbonate and cement showed brittle rupture by shear but that of mixed with monofilament fiber showed ductile rupture due to the tension force of fiber. It was shown that when age increased, creep strain of reinforced soil under sustained load approached constant values.

On NBUL class at specific age

  • Mahmoud, M.A.W.;Moshref, M.E.;Gadallah, A.M.
    • International Journal of Reliability and Applications
    • /
    • v.15 no.1
    • /
    • pp.11-22
    • /
    • 2014
  • New classes of life distributions called new better (worse) than used at age $t_0$ in Laplace transform order, NBUL- $t_0$(NWUL - $t_0$) are introduced. For the classes NBUL - $t_0$(NWUL - $t_0$), preservation under convolution, mixture, mixing and the homogeneous Poisson shock model are studied. In the sequel, we obtain a test for $H_0$ : F is exponential versus $H_1$ : F is NBUL - $t_0$ and not exponential. The critical values and the powers of this test are calculated to assess the performance of the test. It is shown that the proposed test has high efficiencies for some commonly used distributions in reliability. Sets of real data are used as examples to elucidate the use of the proposed test for practical problems.

  • PDF

On NBUmgf class at specific age

  • Gadallah, A.M.
    • International Journal of Reliability and Applications
    • /
    • v.17 no.2
    • /
    • pp.107-119
    • /
    • 2016
  • A new concept of aging classes namely new better (worse) than used at age $t_0$ in moment generating function order, $NBU_{mgf}-t_0$ ($NWU_{mgf}-t_0$) is introduced. For the classes $NBU_{mgf}-t_0$ ($NWU_{mgf}-t_0$), preservation under convolution, mixture, mixing and the homogeneous Poisson shock model are studied. In the sequel, nonparametric test is proposed, the asymptotic normality of the class is established and the asymptotic null variance is estimated. The percentiles and powers of this test are tabulated. The asymptotic efficiencies for some alternatives distributions are derived. Finally sets of real data are used as examples to elucidate the use of the proposed test in practical application.