• Title/Summary/Keyword: 2-포아송 분포모형

Search Result 33, Processing Time 0.025 seconds

Application of the 2-Poisson Model to Full-Text Information Retrieval System (2-포아송 모형의 전문검색시스템 응용에 관한 연구)

  • 문성빈
    • Journal of the Korean Society for information Management
    • /
    • v.16 no.3
    • /
    • pp.49-63
    • /
    • 1999
  • The purpose of this study is to investigate whether the terms in queries are distributed according to the 2-Poisson model in the documents represented by abstract/title or full-text. In this study, retrieval experiments using Binary independence and 2-Poisson independence model, which are based on the probabilistic theory, were conducted to see if the 2-Poisson distribution of the query terms has an influence on the retrieval effectiveness, particularly of full-text information retrieval system.

  • PDF

A Study on the Applicability of 2-Poisson Model for Selecting Korean Subject Words (2-포아송 모형을 이용한 한글 주제어 선정에 관한 연구)

  • 정영미;최대식
    • Journal of the Korean Society for information Management
    • /
    • v.17 no.1
    • /
    • pp.129-148
    • /
    • 2000
  • Experiments were performed on three subsets of a Korean test collection in order to determine whether 2-Poisson model's Z value is a good measure for selecting subject words from a document to be indexed. It was found that subject word selection based on the Z value was effective for only one subset with short texts, i.e., the Science and Technology subset. Correlation analyses between 2-Poisson model's Z and TF.IDF weight for the three subsets showed that the correlation was relatively high for two test subsets with short texts, i.e., the Science and Technology subset and the Newspaper subset.

  • PDF

Inferences for the Changepoint in Bivariate Zero-Inflated Poisson Model (이변량 영과잉-포아송모형에서 변화시점에 관한 추론)

  • Kim, Kyung-Moon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.10 no.2
    • /
    • pp.319-327
    • /
    • 1999
  • Zero-Inflated Poisson distributions have been widely used for defect-free products in manufacturing processes. It is very interesting to check the shift after the unknown changepoint. If the detectives are caused by the two different types of factor, we should use bivariate zero-inflated model. In this paper, likelihood ratio tests were used to detect the shift of changes after the changepoint. Some inferences for the parameters in this model were made.

  • PDF

Subset Selection in the Poisson Models - A Normal Predictors case - (포아송 모형에서의 설명변수 선택문제 - 정규분포 설명변수하에서 -)

  • 박종선
    • The Korean Journal of Applied Statistics
    • /
    • v.11 no.2
    • /
    • pp.247-255
    • /
    • 1998
  • In this paper, a new subset selection problem in the Poisson model is considered under the normal predictors. It turns out that the subset model has bigger valiance than that of the Poisson model with random predictors and this has been used to derive new subset selection method similar to Mallows'$C_p$.

  • PDF

Comparative Study of Model Selection Using Bayes Factor through Simulation : Poisson vs. Negative Binomial Model Selection and Normal, Double Exponential vs. Cauchy Model Selection (시뮬레이션을 통한 베이즈요인에 의한 모형선택의 비교연구 : 포아송, 음이항모형의 선택과 정규, 이중지수, 코쉬모형의 선택)

  • 오미라;윤소영;심정욱;손영숙
    • The Korean Journal of Applied Statistics
    • /
    • v.16 no.2
    • /
    • pp.335-349
    • /
    • 2003
  • In this paper, we use Bayesian method for model selection of poisson vs. negative binomial distribution, and normal, double exponential vs. cauchy distribution. The fractional Bayes factor of O'Hagan (1995) was applied to Bayesian model selection under the assumption of noninformative improper priors for all parameters in the models. Through the analyses of real data and simulation data, we examine the usefulness of the fractional Bayes factor in comparison with intrinsic Bayes factors of Berger and Pericchi (1996, 1998).

Analysis of Drought Spatial Distribution Using Poisson Process (포아송과정을 이용한 가뭄의 공간분포 분석)

  • Yoo, Chul-Sang;Ahn, Jae-Hyun;Ryoo, So-Ra
    • Journal of Korea Water Resources Association
    • /
    • v.37 no.10
    • /
    • pp.813-822
    • /
    • 2004
  • This study quantifies and compares the drought return and duration characteristics by applying the Poisson process as well as based on by analyzing the observed data directly. The drought spatial distributions derived for the Gyunggi province are also compared. The monthly rainfall data are used to construct the SPI as a drought index. Especially, this study focuses on the evaluation of the Poisson process model when applying it to various data lengths such as in the spatial analysis 'of drought. Summarizing the results are as follows. (1) The Poisson process is found to be effective for the quantification of drought, especially when the data length is short. When applying the Poisson process, two neighboring sites are found insensitive to the data length to show similar drought characteristics, so the overall drought pattern becomes smoother than that derived directly from the observed data. (2) When the data length is very different site by site, the spatial analysis of drought based on a model application seems better than that based on the direct data analysis. This study also found more obvious spatial pattern of drought occurrence and duration when applying the Poisson process.

The Study for NHPP Software Reliability Growth Model based on Exponentiated Exponential Distribution (지수화 지수 분포에 의존한 NHPP 소프트웨어 신뢰성장 모형에 관한 연구)

  • Kim, Hee-Cheul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.11 no.5 s.43
    • /
    • pp.9-18
    • /
    • 2006
  • Finite failure NHPP models presented in the literature exhibit either constant, monotonic increasing or monotonic decreasing failure occurrence rates per fault. In this paper, Goel-Okumoto and Yamada-Ohba-Osaki model was reviewed, proposes the exponentiated exponential distribution reliability model, which maked out efficiency substituted for gamma and Weibull model(2 parameter shape illustrated by Gupta and Kundu(2001) Algorithm to estimate the parameters used to maximum likelihood estimator and bisection method, model selection based on SSE, AIC statistics and Kolmogorov distance, for the sake of efficient model, was employed. Analysis of failure using NTDS data set for the sake of proposing shape parameter of the exponentiated exponential distribution was employed. This analysis of failure data compared with the exponentiated exponential distribution model and the existing model (using arithmetic and Laplace trend tests, bias tests) is presented.

  • PDF

The Analysis of the Number of Donations Based on a Mixture of Poisson Regression Model (포아송 분포의 혼합모형을 이용한 기부 횟수 자료 분석)

  • Kim In-Young;Park Su-Bum;Kim Byung-Soo;Park Tae-Kyu
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.1
    • /
    • pp.1-12
    • /
    • 2006
  • The aim of this study is to analyse a survey data on the number of charitable donations using a mixture of two Poisson regression models. The survey was conducted in 2002 by Volunteer 21, an nonprofit organization, based on Koreans, who were older than 20. The mixture of two Poisson distributions is used to model the number of donations based on the empirical distribution of the data. The mixture of two Poisson distributions implies the whole population is subdivided into two groups, one with lesser number of donations and the other with larger number of donations. We fit the mixture of Poisson regression models on the number of donations to identify significant covariates. The expectation-maximization algorithm is employed to estimate the parameters. We computed 95% bootstrap confidence interval based on bias-corrected and accelerated method and used then for selecting significant explanatory variables. As a result, the income variable with four categories and the volunteering variable (1: experience of volunteering, 0: otherwise) turned out to be significant with the positive regression coefficients both in the lesser and the larger donation groups. However, the regression coefficients in the lesser donation group were larger than those in larger donation group.

An application to Multivariate Zero-Inflated Poisson Regression Model

  • Kim, Kyung-Moo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.2
    • /
    • pp.177-186
    • /
    • 2003
  • The Zero-Inflated Poisson regression is a model for count data with exess zeros. When the correlated response variables are intrested, we have to extend the univariate zero-inflated regression model to multivariate model. In this paper, we study and simulate the multivariate zero-inflated regression model. A real example was applied to this model. Regression parameters are estimated by using MLE's. We also compare the fitness of multivariate zero-inflated Poisson regression model with the decision tree model.

  • PDF

The Comparative Study for Software Reliability Models Based on NHPP (NHPP에 기초한 소프트웨어 신뢰도 모형에 대한 비교연구)

  • Gan, Gwang-Hyeon;Kim, Hui-Cheol;Lee, Byeong-Su
    • The KIPS Transactions:PartD
    • /
    • v.8D no.4
    • /
    • pp.393-400
    • /
    • 2001
  • This paper presents a stochastic model for the software failure phenomenon based on a nonhomogeneous Poisson process (NHPP). The failure process is analyzed to develop a suitable mean value function for the NHPP ; expressions are given for several performance measure. Actual software failure data are compared with generalized model by Goel dependent on the constant reflecting the quality of testing. The performance measures and parametric inferences of the new models, Rayleigh and Gumbel distributions, are discussed. The results of the new models are applied to real software failure data and compared with Goel-Okumoto and Yamada, Ohba and Osaki models. Tools of parameter inference was used method of the maximun likelihood estimate and the bisection algorithm for the computing nonlinear root. In this paper, using the sum of the squared errors, model selection was employed. The numerical example by NTDS data was illustrated.

  • PDF