• Title/Summary/Keyword: 상관된 이항자료

Search Result 25, Processing Time 0.021 seconds

Fitting Bivariate Generalized Binomial Models of the Sarmanov Type (Sarmanov형 이변량 일반화이항모형의 적합)

  • Lee, Joo-Yong;Kim, Kee-Young
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.2
    • /
    • pp.271-280
    • /
    • 2009
  • For bivariate binomial data with both intra and inter-class correlation, Danaher and Hardie (2005) proposed a bivariate beta-binomial model. However, the model is limited to the situation where the intra-class correlation is strictly positive. Thus it might be seriously inadequate for data with a negative intra-class correlation. Several authors have considered generalized binomial distributions covering a wider range of intra-class correlation which could relax the possible model restrictions imposed. Among others there are the additive/multiplicative and the beta/extended beta binomial model. In this study, bivariate models of the Sarmanov (1966) type are formed by combining each of those univariate models to take care of the inter-class correlation, and are evaluated in terms of the goodness-of-fit. As a result, B-mB and B-ebB are fitted, successfully, to real data and that B-mB, which has a wider permissible range than B-ebB for the intra-class correlation is relatively preferred.

Comparative Simulation Studies on Generalized Binomial Models (일반화 이항모형의 적합도 평가)

  • Baik, E.J.;Kim, K.Y.
    • Communications for Statistical Applications and Methods
    • /
    • v.18 no.4
    • /
    • pp.507-516
    • /
    • 2011
  • Comparative studies on generalized binomial models (Moon, 2003; Ng, 1989; Paul, 1985; Kupper and Haseman, 1978; Griffiths, 1973) are restrictive in that the models compared are rather limited and MSE of the estimates is the only measure considered for the model adequacy. This paper is aimed to report simulation results which provide possible guidelines for selecting a proper model. We examine Pearson type of goodness-of-fit statistic to its degrees of freedom and AIC for the overall model quality. MSE and Bias of the individual estimates are also considered as the component fit measures. Performance of some models varies widely for a certain range of the parameter space while most of the models are quite competent. Our evaluation shows that the Extended Beta-Binomial model (Prentice, 1986) turns out to be particularly favorable in the point that it provides consistently excellent fit almost all over the values of the intra-class correlation coefficient and the probability of success.

On the Extension of Test Statistics for Detecting Negative Binomial Departures from the Poisson Assumption (포아송으로부터 부의 이항분포로의 이탈에 대한 검정통계량의 확장)

  • 이선호
    • Journal of the Korean Statistical Society
    • /
    • v.22 no.2
    • /
    • pp.171-190
    • /
    • 1993
  • 포아송분포로부터 부의 이항분포로의 이탈을 검색하는 통계량들이 자료의 형태에 따라 여러가지 제시되었다. 그런데 대립가설인 부의 이항분포의 모수화 방법에 따라 분산과 평균의 구조가 변하고 국소 최적 검정 통계량도 달라진다는 것이 알려졌다. 본 논문에서는 대립가설을 일반적인 포아송 혼합분포로까지 확장시키고, 일반적인 형태의 분산과 평균의 구조에도 검정 가능한 새로운 통계량 L을 소개하고 있다. 또한 L 통계량은 포아송 분포로부터 부의 이항분포로의 이탈을 다루는 기존의 여러 통계량들의 일반화된 형태임을 보였다. 점근적 상대효율과 모의 실험을 통하여 L 통계량과 기존의 통계량들을 비교한 결과 분산과 평균사이의 구조에 상관없이 L 통계량이 우수한 것임을 입증하였다.

  • PDF

Generalized Linear Mixed Model for Multivariate Multilevel Binomial Data (다변량 다수준 이항자료에 대한 일반화선형혼합모형)

  • Lim, Hwa-Kyung;Song, Seuck-Heun;Song, Ju-Won;Cheon, Soo-Young
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.6
    • /
    • pp.923-932
    • /
    • 2008
  • We are likely to face complex multivariate data which can be characterized by having a non-trivial correlation structure. For instance, omitted covariates may simultaneously affect more than one count in clustered data; hence, the modeling of the correlation structure is important for the efficiency of the estimator and the computation of correct standard errors, i.e., valid inference. A standard way to insert dependence among counts is to assume that they share some common unobservable variables. For this assumption, we fitted correlated random effect models considering multilevel model. Estimation was carried out by adopting the semiparametric approach through a finite mixture EM algorithm without parametric assumptions upon the random coefficients distribution.

Traffic Accident Models of Cheongju Four-Legged Signalized Intersections by Accident Type (사고유형에 따른 청주시 4지 신호교차로 교통사고모형)

  • Park, Byung-Ho;Han, Sang-Wook;Kim, Tae-Young;Kim, Won-Ho
    • Journal of Korean Society of Transportation
    • /
    • v.26 no.5
    • /
    • pp.153-162
    • /
    • 2008
  • This study deals with the traffic accidents at the 4-legged signalized intersections in Cheong-ju. The purpose is to comparatively analyze the characteristics and models by the accident type using the data of 143 intersections. In pursuing the above, this study gives particular emphasis to modeling such the accidents as head on collision, rear end collision, side swipe, side right angle collision, and others. The main results are the followings. First, the overdispersion tests show that the negative binomial regression models are appropriate to the traffic accident data in the above contexts. Second, five accident models are developed, which are all analyzed to be statistically significant. Finally, the models are comparatively evaluated using the common variable(ADT) and type-specific variables.

Heterogeneity Analysis of the Male Birth Ratio Data (남아 출생률 자료에 대한 이질성 분석)

  • Lim, Hwa-Kyung;Song, Seuck-Heun;Song, Ju-Won
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.2
    • /
    • pp.365-373
    • /
    • 2009
  • Since 1990, identifying the sex of fetus and illegal abortion has brought the sex ratio imbalance at birth in Korea due to a notion of preferring a son to a daughter, socio-economic development, population policy, and so forth. Although there have been many researches such as time series analysis and region difference analysis to monitor this sex ratio imbalance, they have a defect that time and space could not be included in the analysis simultaneously. This study analyzes the sex ratio imbalance at birth, taking into account time and region at the same time. The analysis considered the numbers of male and female babies, who were born as the third or latter in their families, in 2000 and 2001 at 234 Gu / Si / Goon administrative districts. Here, we suggest a mixture model of binomial distributions, assuming heterogeneous populations. The estimation of the location parameters, weights and correlation coefficient of the mixture model is conducted by the EM algorithm, and the heterogeneity of the regions is expressed as a picture using ArcView GIS.

Accident Models of Rotary by Vehicle Type (차량유형별 로터리 사고모형)

  • Han, Su-San;Park, Byeong-Ho
    • Journal of Korean Society of Transportation
    • /
    • v.29 no.6
    • /
    • pp.67-74
    • /
    • 2011
  • This study deals with the traffic accidents data from the Korean rotaries (circular intersections) to verify their characteristics affected by different vehicle types. This paper categorized the data into three groups based on vehicle types, and developed a set of accident models. The paper proposed two ZIP models and one negative binomial model through a statistical analysis for three vehicle types: automobile, truck and van, and others. The differences among those models were then statistically compared.

Zero In ated Poisson Model for Spatial Data (영과잉 공간자료의 분석)

  • Han, Junhee;Kim, Changhoon
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.2
    • /
    • pp.231-239
    • /
    • 2015
  • A Poisson model is the first choice for counts data. Quasi Poisson or negative binomial models are usually used in cases of over (or under) dispersed data. However, these models might be unsuitable if the data consist of excessive number of zeros (zero inflated data). For zero inflated counts data, Zero Inflated Poisson (ZIP) or Zero Inflated Negative Binomial (ZINB) models are recommended to address the issue. In this paper, we further considered a situation where zero inflated data are spatially correlated. A mixed effect model with random effects that account for spatial autocorrelation is used to fit the data.

Accident Models of Circular Intersections by Type in Korea (사고유형에 따른 원형교차로 사고모형)

  • Han, Su-San;Kim, Kyung-Hwan;Park, Byung-Ho
    • International Journal of Highway Engineering
    • /
    • v.13 no.3
    • /
    • pp.103-110
    • /
    • 2011
  • This study deals with the traffic accidents by type. The objectives are to analyze the characteristics of 2 accident types, and to develop the models by type. In pursuing the above, this paper gives particular attentions to testing the differences between by type two groups, and developing the models (Poisson and negative binomial regressions) using the data of domestic circular intersections. The main results are as follows. First, the number of accidents in vehicle vehicle was analyzed to account for about 73.41% of total and to be higher than vehicle people. Second, two Poisson models and two negative binomial models which were all statistically significant were developed using vehicle people accidents and vehicle vehicle accidents as dependant variables. Finally, the traffic volume as common variable was selected in the models, and right-turn slip lane, speed hump, the number of driveways, the number of pedestrian crossings as specific variables of the models were selected.

Exploring interaction using 3-D residual plots in logistic regression model (3차원 잔차산점도를 이용한 로지스틱회귀모형에서 교호작용의 탐색)

  • Kahng, Myung-Wook
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.1
    • /
    • pp.177-185
    • /
    • 2014
  • Under bivariate normal distribution assumptions, the interaction and quadratic terms are needed in the logistic regression model with two predictors. However, depending on the correlation coefficient and the variances of two conditional distributions, the interaction and quadratic terms may not be necessary. Although the need for these terms can be determined by comparing the two scatter plots, it is not as useful for interaction terms. We explore the structure and usefulness of the 3-D residual plot as a tool for dealing with interaction in logistic regression models. If predictors have an interaction effect, a 3-D residual plot can show the effect. This is illustrated by simulated and real data.