• Title/Summary/Keyword: Correlated binary data

Search Result 58, Processing Time 0.023 seconds

Comparison of Three Binomial-related Models in the Estimation of Correlations

  • Moon, Myung-Sang
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.2
    • /
    • pp.585-594
    • /
    • 2003
  • It has been generally recognized that conventional binomial or Poisson model provides poor fits to the actual correlated binary data due to the extra-binomial variation. A number of generalized statistical models have been proposed to account for this additional variation. Among them, beta-binomial, correlated-binomial, and modified-binomial models are binomial-related models which are frequently used in modeling the sum of n correlated binary data. In many situations, it is reasonable to assume that n correlated binary data are exchangeable, which is a special case of correlated binary data. The sum of n exchangeable correlated binary data is modeled relatively well when the above three binomial-related models are applied. But the estimation results of correlation coefficient turn to be quite different. Hence, it is important to identify which model provides better estimates of model parameters(success probability, correlation coefficient). For this purpose, a small-scale simulation study is performed to compare the behavior of above three models.

Bayesian Analysis of a New Skewed Multivariate Probit for Correlated Binary Response Data

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • v.30 no.4
    • /
    • pp.613-635
    • /
    • 2001
  • This paper proposes a skewed multivariate probit model for analyzing a correlated binary response data with covariates. The proposed model is formulated by introducing an asymmetric link based upon a skewed multivariate normal distribution. The model connected to the asymmetric multivariate link, allows for flexible modeling of the correlation structure among binary responses and straightforward interpretation of the parameters. However, complex likelihood function of the model prevents us from fitting and analyzing the model analytically. Simulation-based Bayesian inference methodologies are provided to overcome the problem. We examine the suggested methods through two data sets in order to demonstrate their performances.

  • PDF

Investigation of Biases for Variance Components on Multiple Traits with Varying Number of Categories in Threshold Models Using Bayesian Inferences

  • Lee, D.H.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.15 no.7
    • /
    • pp.925-931
    • /
    • 2002
  • Gibbs sampling algorithms were implemented to the multi-trait threshold animal models with any combinations of multiple binary, ordered categorical, and linear traits and investigate the amount of bias on these models with two kinds of parameterization and algorithms for generating underlying liabilities. Statistical models which included additive genetic and residual effects as random and contemporary group effects as fixed were considered on the models using simulated data. The fully conditional posterior means of heritabilities and genetic (residual) correlations were calculated from 1,000 samples retained every 10th samples after 15,000 samples discarded as "burn-in" period. Under the models considered, several combinations of three traits with binary, multiple ordered categories, and continuous were analyzed. Five replicates were carried out. Estimates for heritabilities and genetic (residual) correlations as the posterior means were unbiased when underlying liabilities for a categorical trait were generated given by underlying liabilities of the other traits and threshold estimates were rescaled. Otherwise, when parameterizing threshold of zero and residual variance of one for binary traits, heritability estimates were inflated 7-10% upward. Genetic correlation estimates were biased upward if positively correlated and downward if negatively correlated when underling liabilities were generated without accounting for correlated traits on prior information. Residual correlation estimates were, consequently, much biased downward if positively correlated and upward if negatively correlated in that case. The more categorical trait had categories, the better mixing rate was shown.

Correlation of Liquid-Liquid Equilibrium of Four Binary Hydrocarbon-Water Systems, Using an Improved Artificial Neural Network Model

  • Lv, Hui-Chao;Shen, Yan-Hong
    • Journal of the Korean Chemical Society
    • /
    • v.57 no.3
    • /
    • pp.370-376
    • /
    • 2013
  • A back propagation artificial neural network model with one hidden layer is established to correlate the liquid-liquid equilibrium data of hydrocarbon-water systems. The model has four inputs and two outputs. The network is systematically trained with 48 data points in the range of 283.15 to 405.37K. Statistical analyses show that the optimised neural network model can yield excellent agreement with experimental data(the average absolute deviations equal to 0.037% and 0.0012% for the correlated mole fractions of hydrocarbon in two coexisting liquid phases respectively). The comparison in terms of average absolute deviation between the correlated mole fractions for each binary system and literature results indicates that the artificial neural network model gives far better results. This study also shows that artificial neural network model could be developed for the phase equilibria for a family of hydrocarbon-water binaries.

Comparison of Lasso Type Estimators for High-Dimensional Data

  • Kim, Jaehee
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.4
    • /
    • pp.349-361
    • /
    • 2014
  • This paper compares of lasso type estimators in various high-dimensional data situations with sparse parameters. Lasso, adaptive lasso, fused lasso and elastic net as lasso type estimators and ridge estimator are compared via simulation in linear models with correlated and uncorrelated covariates and binary regression models with correlated covariates and discrete covariates. Each method is shown to have advantages with different penalty conditions according to sparsity patterns of regression parameters. We applied the lasso type methods to Arabidopsis microarray gene expression data to find the strongly significant genes to distinguish two groups.

Binary Vapor-Liquid Equilibria and Ternary Liquid-Liquid Equilibria for NMF Contained Systems (NMF를 포함하는 이성분계의 등온 기-액 평형과 삼성분계 액-액 평형)

  • Park, So-Jin;Han, Kyu-Jin;Won, Dong-Bok;Oh, Jong-Hyeok;Choi, Young-Yoon
    • Korean Chemical Engineering Research
    • /
    • v.43 no.2
    • /
    • pp.259-265
    • /
    • 2005
  • Binary isothermal vapor-liquid equilibrium(VLE) data were measured for water+n-methylformamide(NMF), benzene+NMF and toluene+NMF systems by using headspace gas chromatography(HSGC) at 353.15K. Additionally, the ternary liquid-liquid Equilibrium(LLE) data were determined by measuring of tie-line for the systems of NMF+benzene+n-heptane and NMF+toluene+n-heptane at 298.15 K. The measured isothermal binary VLE data have no azeotropes and were correlated well with $g^E$ model equations of Margules, van Laar, Wilson, NRTL and UNIQUAC. The experimental ternary tie line data were also correlated well with NRTL and UNIQUAC models. Besides their accuracy was analyzed by Hirata-Fujita and Maior-Swenson equations.

Phase Equilibrium of Binary Mixture for the (Carbon Dioxide + 1-Phenyl-2-Pyrrolidone) System at High Pressure

  • Lee, Ho;Jeong, Jong-Dae;Byun, Hun-Soo
    • Korean Chemical Engineering Research
    • /
    • v.56 no.5
    • /
    • pp.732-737
    • /
    • 2018
  • Experimental data of phase equilibria are reported for the binary mixture of 1-phenyl-2-pyrrolidone in supercritical carbon dioxide. Phase behavior data was measured in a synthetic method at a temperature ranging from 333.2 to 393.2 K and at pressures up to 97.14 MPa. The solubility of 1-phenyl-2-pyrrolidone in the carbon dioxide + 1-phenyl-2-pyrrolidone system increased as temperature increased at a constant pressure and it exhibited the type-I phase behavior. The experimental data for the binary mixture were correlated with the Peng-Robinson equation of state using mixing rule and the critical properties of 1-phenyl-2-pyrrolidone were predicted with the Joback and Lyderson method.

Interval prediction on the sum of binary random variables indexed by a graph

  • Park, Seongoh;Hahn, Kyu S.;Lim, Johan;Son, Won
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.3
    • /
    • pp.261-272
    • /
    • 2019
  • In this paper, we propose a procedure to build a prediction interval of the sum of dependent binary random variables over a graph to account for the dependence among binary variables. Our main interest is to find a prediction interval of the weighted sum of dependent binary random variables indexed by a graph. This problem is motivated by the prediction problem of various elections including Korean National Assembly and US presidential election. Traditional and popular approaches to construct the prediction interval of the seats won by major parties are normal approximation by the CLT and Monte Carlo method by generating many independent Bernoulli random variables assuming that those binary random variables are independent and the success probabilities are known constants. However, in practice, the survey results (also the exit polls) on the election are random and hardly independent to each other. They are more often spatially correlated random variables. To take this into account, we suggest a spatial auto-regressive (AR) model for the surveyed success probabilities, and propose a residual based bootstrap procedure to construct the prediction interval of the sum of the binary outcomes. Finally, we apply the procedure to building the prediction intervals of the number of legislative seats won by each party from the exit poll data in the $19^{th}$ and $20^{th}$ Korea National Assembly elections.

Densities, Viscosities and Excess Properties of 2-Bromopropane - Methanol Binary Mixtures at Temperature from (298.15 to 318.15) K (298.15~318.15 K 에서 2-브로모프로판-메탄올 이성분 혼합물의 밀도, 점성도, 여분 성질)

  • Li, Hua;Zhang, Zhen;Zhao, Lei
    • Journal of the Korean Chemical Society
    • /
    • v.54 no.1
    • /
    • pp.71-76
    • /
    • 2010
  • The densities and viscosities of 2-bromopropane-methanol binary mixtures had been determined using an digital vibrating U-tube densimeter and Ubbelohde capillary viscometer respectively from (298.15 to 318.15) K. The dependence of densities and viscosities on temperature and concentration had been correlated. The excess molar volume and the excess viscosity of the binary system were calculated from the experimental density and viscosity data. The excess molar volumes were related to compositions by polynomial regression and regression parameters and total RMSD deviations were obtained; the excess viscosities was related to compositions by Redlich-Kister equation and regression coefficients and total RMSD deviation of the excess viscosity for 2-bromopropane and methanol binary system were obtained. The results showed that the model agreed very well with the experimental data.

Ultrasonic Speed and Isentropic Compressibility of 2-propanol with Hydrocarbons at 298.15 and 308.15 K

  • Gahlyan, Suman;Verma, Sweety;Rani, Manju;Maken, Sanjeev
    • Korean Chemical Engineering Research
    • /
    • v.55 no.5
    • /
    • pp.668-678
    • /
    • 2017
  • Intermolecular interactions were studied for binary mixtures of 2-propanol + cyclohexane, n-hexane, benzene, toluene, o-, m- and p-xylenes by measuring ultrasonic speeds (u) over the entire range of composition at 298.15 K and 308.15 K. From these results the deviation in ultrasonic speed was calculated. These results were fitted to the Redlich-Kister equation to derive the binary coefficients along with standard deviations between the experimental and calculated data. Acoustic parameters such as excess isentropic compressibility ($K_s^E$), intermolecular free length ($L_f$) and available volume ($V_a$) were also derived from ultrasonic speed data and Jacobson's free length theory. The ultrasonic speed data were correlated by Nomoto's relation, Van Dael's mixing relation, impedance dependence relation, and Schaaff's collision factor theory. Van Dael's relation gives the best prediction of u in the binary mixtures containing aliphatic hydrocarbons. The ultrasonic speed data and isentropic compressibility were further analyzed in terms of Jacobson's free length theory.