• 제목/요약/키워드: Binomial random variable

검색결과 12건 처리시간 0.022초

RECURRENCE RELATIONS FOR HIGHER ORDER MOMENTS OF A COMPOUND BINOMIAL RANDOM VARIABLE

  • Kim, Donghyun;Kim, Yoora
    • East Asian mathematical journal
    • /
    • 제34권1호
    • /
    • pp.59-67
    • /
    • 2018
  • We present new recurrence formulas for the raw and central moments of a compound binomial random variable. Our approach involves relating two compound binomial random variables that have parameters with a difference of 1 for the number of trials, but which have the same parameters for the success probability for each trial. As a consequence of our recursions, the raw and central moments of a binomial random variable are obtained in a recursive manner without the use of Stirling numbers.

Estimating reliability in discrete distributions

  • Moon, Yeung-Gil;Lee, Chang-Soo
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권4호
    • /
    • pp.811-817
    • /
    • 2011
  • We shall introduce a general probability mass function which includes several discrete probability mass functions. Especially, when the random variable X is Poisson, binomial, and negative binomial random variables as some special cases of the introduced distribution, the maximum likelihood estimator (MLE) and the uniformly minimum variance unbiased estimator (UMVUE) of the probability P(X ${\leq}$ t) are considered. And the efficiencies of the MLE and the UMVUE of the reliability ar compared each other.

On Some Distributions Generated by Riff-Shuffle Sampling

  • Son M.S.;Hamdy H.I.
    • International Journal of Contents
    • /
    • 제2권2호
    • /
    • pp.17-24
    • /
    • 2006
  • The work presented in this paper is divided into two parts. The first part presents finite urn problems which generate truncated negative binomial random variables. Some combinatorial identities that arose from the negative binomial sampling and truncated negative binomial sampling are established. These identities are constructed and serve important roles when we deal with these distributions and their characteristics. Other important results including cumulants and moments of the distributions are given in somewhat simple forms. Second, the distributions of the maximum of two chi-square variables and the distributions of the maximum correlated F-variables are then derived within the negative binomial sampling scheme. Although multinomial theory applied to order statistics and standard transformation techniques can be used to derive these distributions, the negative binomial sampling approach provides more information and deeper insight regarding the nature of the relationship between the sampling vehicle and the probability distributions of these functions of chi-square variables. We also provide an algorithm to compute the percentage points of these distributions. We supplement our findings with exact simple computational methods where no interpolations are involved.

  • PDF

Effects on Regression Estimates under Misspecified Generalized Linear Mixed Models for Counts Data

  • Jeong, Kwang Mo
    • 응용통계연구
    • /
    • 제25권6호
    • /
    • pp.1037-1047
    • /
    • 2012
  • The generalized linear mixed model(GLMM) is widely used in fitting categorical responses of clustered data. In the numerical approximation of likelihood function the normality is assumed for the random effects distribution; subsequently, the commercial statistical packages also routinely fit GLMM under this normality assumption. We may also encounter departures from the distributional assumption on the response variable. It would be interesting to investigate the impact on the estimates of parameters under misspecification of distributions; however, there has been limited researche on these topics. We study the sensitivity or robustness of the maximum likelihood estimators(MLEs) of GLMM for counts data when the true underlying distribution is normal, gamma, exponential, and a mixture of two normal distributions. We also consider the effects on the MLEs when we fit Poisson-normal GLMM whereas the outcomes are generated from the negative binomial distribution with overdispersion. Through a small scale Monte Carlo study we check the empirical coverage probabilities of parameters and biases of MLEs of GLMM.

Weighted zero-inflated Poisson mixed model with an application to Medicaid utilization data

  • Lee, Sang Mee;Karrison, Theodore;Nocon, Robert S.;Huang, Elbert
    • Communications for Statistical Applications and Methods
    • /
    • 제25권2호
    • /
    • pp.173-184
    • /
    • 2018
  • In medical or public health research, it is common to encounter clustered or longitudinal count data that exhibit excess zeros. For example, health care utilization data often have a multi-modal distribution with excess zeroes as well as a multilevel structure where patients are nested within physicians and hospitals. To analyze this type of data, zero-inflated count models with mixed effects have been developed where a count response variable is assumed to be distributed as a mixture of a Poisson or negative binomial and a distribution with a point mass of zeros that include random effects. However, no study has considered a situation where data are also censored due to the finite nature of the observation period or follow-up. In this paper, we present a weighted version of zero-inflated Poisson model with random effects accounting for variable individual follow-up times. We suggested two different types of weight function. The performance of the proposed model is evaluated and compared to a standard zero-inflated mixed model through simulation studies. This approach is then applied to Medicaid data analysis.

계층 이항 로지스틱모형에 의한 고속도로 교통사고 심각도 분석 (Analysis of Traffic Crash Severity on Freeway Using Hierarchical Binomial Logistic Model)

  • 문승라;이영인
    • 한국도로학회논문집
    • /
    • 제13권4호
    • /
    • pp.199-209
    • /
    • 2011
  • 교통사고발생시 사고 심각도에 영향을 미치는 요인과 그 관계를 이해하는 것은 기하구조나 환경 측면에서 교통사고 발생을 예방하고 운전자와 사고 차량의 특성을 이해하는데 도움을 준다. 본 연구에서는 계층 이항 로지스틱모형에 의해 고속도로 교통사고 심각도에 영향을 미치는 요인을 파악하고 영향변수 간 차이를 나타내는 비교위험도(odds ratio)를 도출하였다. 사고 심각도는 인명피해와 차량피해로 구분하여 사망사고모형과 차량완파사고모형을 구축하였다, 종속변수는 사망자 발생과 완파차량 발생 여부이며, 각각 사고-탑승자, 사고-차량의 2수준 계층구조를 적용하였다. 추정 결과 설명변수의 고정효과는 두 모형이 유사한 결과를 보이나 종속변수의 속성에 따라 차별화된 결과를 나타내기도 하였다. 본선과 진출입부에서의 사고가 가장 위험하며, 중앙선 침범과 통행위반, 과속 사고의 상해나 차량 파손 위험도가 높고, 충돌사고와 추돌사고, 화재 사고의 피해가 크다. 사고 심각도는 노면 상태나 시야 조건 등 외부환경에 영향을 받으나 기하구조 조건은 관련이 없다.

유어낚시인구의 사회경제학적 특성과 출조빈도함수의 추정에 관한 연구 (A Study on the Socio-economic Characteristics of the Angler Population and the Estimation of A Fishing Frequency Function)

  • 박철형
    • 수산경영론집
    • /
    • 제36권1호
    • /
    • pp.81-101
    • /
    • 2005
  • This article is to estimate the fishing frequency function in Korean recreational fishery with respect to socio-economic characteristics of anglers. First, the study described the characteristics of the entire angler population on the view points of 9 socio-economic variables. And then, the study divided the total angler population into three groups of in-land, sea, and mixed angler populations in order to investigate the differences in their characteristics. The study could confirm the existence of differences in regions, size of regions, and educational levels between the in - land and the sea angler populations by testing heterogeneity in the frequency table. The fishing frequency function is estimated using Poisson regression model in order to accomodate the count data(non-negative discrete random variable) aspects of the fishing frequency. However, the model specification error is found due to overdispersion of data. The model exhibits the lack of goodness of fit. The negative binomial regression model is adopted to cure the overdispersion of the data as an alternative estimation methodology. Finally, the study can confirm overdispersion does not exist in the model any more and the goodness of fit improved significantly to the reasonable level. The results of estimation of fishing frequency population modeled by the negative binomial regression models are following. The three variables of region, sex, and education have effects on the decision making process of fishing frequency in the case of in-land recreation fishery. On the other hand, the three variables of sex, age, and marriage status do the same job in the case of sea angler population. Among the left-over variables, both income and use of Internet variables now affect on the process in mixed angler population. Finally, the results of whole angler population show that all of the previous variables are proven to be statistically significant due to the summation of data with all three sub-groups of angler population.

  • PDF

불완전 디버깅 환경에서의 이항 반응 계수 초기하분포 소프트웨어 신뢰성 성장 모델 (The Binomial Sensitivity Factor Hyper-Geometric Distribution Software Reliability Growth Model for Imperfect Debugging Environment)

  • 김성희;박중양;박재흥
    • 한국정보처리학회논문지
    • /
    • 제7권4호
    • /
    • pp.1103-1111
    • /
    • 2000
  • The hyper-geometric distribution software reliability growth model (HGDM) usually assumes that all the software faults detected are perfectly removed without introducing new faults. However, since new faults can be introduced during the test-and-debug phase, the perfect debugging assumption should be relaxed. In this context, Hou, Kuo and Chang [7] developed a modified HGDM for imperfect debugging environment, assuming tat the learning factor is constant. In this paper we extend the existing imperfect debugging HGDM for tow respects: introduction of random sensitivity factor and allowance of variable learning factor. Then the statistical characteristics of he suggested model are studied and its applications to two real data sets are demonstrated.

  • PDF

초기하분포 소프트웨어 신뢰성 성장 모델 : 일반화, 추정과 예측 (Hyper-Geometric Distribution Software Reliability Growth Model : Generalizatio, Estimation and Prediction)

  • 박중양;유창열;박재홍
    • 한국정보처리학회논문지
    • /
    • 제6권9호
    • /
    • pp.2343-2349
    • /
    • 1999
  • 최근에 개발되어 성공적으로 적용되고 있는 초기하분포 소프트웨어 신뢰성 성장 모델은 이 모델에서 중요한 역할을 하는 반응계수(sensitivity factor)를 추정 대상인 모수로 가정하고 있다. 본 논문은 먼저 디버깅과정의 무작위성을 반영하기 위해 반응계수를 이항분로를 하는 확률변수로 가정하여 초기하분포 신뢰성 성장 모델을 일반화한다. 이러한 일반화는 초기하분포 소프트웨어 신뢰성 성장 모델의 통계적 특성을 쉽게 파악할 수 있게 한다. 특히 일반화 된 모델의 모수를 최소자승법으로 추정하면 기존 모델에 최소자승법을 적용한 것과 같은 결과를 얻을 수 있음을 보이고, 더불어 최우추정치를 최소자승법으로 구하는 방법과 예측방법도 제시한다.

  • PDF