• Title/Summary/Keyword: 경시적 자료분석

Search Result 62, Processing Time 0.018 seconds

A longitudinal data analysis for child academic achievement with Korea welfare panel study data (경시적 자료를 이용한 아동 학업성취도 분석)

  • Lee, Naeun;Huh, Jib
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.1
    • /
    • pp.1-10
    • /
    • 2017
  • Longitudinal data of Korean child academic achievement have been used to find the significant exploratory variables under the assumption of independent repeated measured data. Using the exploratory variables in previous research works, we analyze the linear mixed model incorporating the fixed and random effects for child academic achievement to detect the significant exploratory variables. Korea welfare panel study data observed three times between 2006 and 2012 by additional survey for children. The child academic achievement is evaluated by the sum of academic achievements of Korean, English and Mathematics. We also investigate the multicollinearity and the missing mechanism and select some popular correlation matrices to analyze the linear mixed model.

Estimation of the joint conditional distribution for repeatedly measured bivariate cholesterol data using Gaussian copula (가우시안 코플라를 이용한 반복측정 이변량 자료의 조건부 결합 분포 추정)

  • Kwak, Minjung
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.2
    • /
    • pp.203-213
    • /
    • 2017
  • We study estimation and inference of joint conditional distributions of bivariate longitudinal outcomes using regression models and copulas. We consider a class of time-varying transformation models and combine the two marginal models using Gaussian copulas to estimate the joint models. Our models and estimation method can be applied in many situations where the conditional mean-based models are inadequate. Gaussian copulas combined with time-varying transformation models may allow convenient and easy-to-interpret modeling for the joint conditional distributions for bivariate longitudinal data. We apply our method to an epidemiological study of repeatedly measured bivariate cholesterol data.

Hurdle Model for Longitudinal Zero-Inflated Count Data Analysis (영과잉 경시적 가산자료 분석을 위한 허들모형)

  • Jin, Iktae;Lee, Keunbaik
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.6
    • /
    • pp.923-932
    • /
    • 2014
  • The Hurdle model can to analyze zero-inflated count data. This model is a mixed model of the logit model for a binary component and a truncated Poisson model of a truncated count component. We propose a new hurdle model with a general heterogeneous random effects covariance matrix to analyze longitudinal zero-inflated count data using modified Cholesky decomposition. This decomposition factors the random effects covariance matrix into generalized autoregressive parameters and innovation variance. The parameters are modeled using (generalized) linear models and estimated with a Bayesian method. We use these methods to carefully analyze a real dataset.

Estimation of the joint conditional distribution for repeatedly measured bivariate cholesterol data using nonparametric copula (비모수적 코플라를 이용한 반복측정 이변량 자료의 조건부 결합 분포 추정)

  • Kwak, Minjung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.3
    • /
    • pp.689-700
    • /
    • 2016
  • We study estimation and inference of the joint conditional distributions of bivariate longitudinal outcomes using regression models and copulas. For the estimation of marginal models we consider a class of time-varying transformation models and combine the two marginal models using nonparametric empirical copulas. Regression parameters in the transformation model can be obtained as the solution of estimating equations and our models and estimation method can be applied in many situations where the conditional mean-based models are not good enough. Nonparametric copulas combined with time-varying transformation models may allow quite flexible modeling for the joint conditional distributions for bivariate longitudinal data. We apply our method to an epidemiological study of repeatedly measured bivariate cholesterol data.

A Bayesian zero-inflated negative binomial regression model based on Pólya-Gamma latent variables with an application to pharmaceutical data (폴랴-감마 잠재변수에 기반한 베이지안 영과잉 음이항 회귀모형: 약학 자료에의 응용)

  • Seo, Gi Tae;Hwang, Beom Seuk
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.2
    • /
    • pp.311-325
    • /
    • 2022
  • For count responses, the situation of excess zeros often occurs in various research fields. Zero-inflated model is a common choice for modeling such count data. Bayesian inference for the zero-inflated model has long been recognized as a hard problem because the form of conditional posterior distribution is not in closed form. Recently, however, Pillow and Scott (2012) and Polson et al. (2013) proposed a Pólya-Gamma data-augmentation strategy for logistic and negative binomial models, facilitating Bayesian inference for the zero-inflated model. We apply Bayesian zero-inflated negative binomial regression model to longitudinal pharmaceutical data which have been previously analyzed by Min and Agresti (2005). To facilitate posterior sampling for longitudinal zero-inflated model, we use the Pólya-Gamma data-augmentation strategy.

Comparison of the covariance matrix for general linear model (일반 선형 모형에 대한 공분산 행렬의 비교)

  • Nam, Sang Ah;Lee, Keunbaik
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.1
    • /
    • pp.103-117
    • /
    • 2017
  • In longitudinal data analysis, the serial correlation of repeated outcomes must be taken into account using covariance matrix. Modeling of the covariance matrix is important to estimate the effect of covariates properly. However, It is challenging because there are many parameters in the matrix and the estimated covariance matrix should be positive definite. To overcome the restrictions, several Cholesky decomposition approaches for the covariance matrix were proposed: modified autoregressive (AR), moving average (MA), ARMA Cholesky decompositions. In this paper we review them and compare the performance of the approaches using simulation studies.

A Study on Spatial and Temporal Distribution of a Pest via Generalized Linear Mixed Models (일반화선형혼합모형을 통한 해충밀도의 시공간분포 연구)

  • 박흥선;조기종
    • The Korean Journal of Applied Statistics
    • /
    • v.17 no.2
    • /
    • pp.185-196
    • /
    • 2004
  • It is an important research area in Integrated Pest Management System to estimate the pest density within plants, because the artificial controls such as spraying pesticides or biological enemies depend on the information of pest density. This paper studies the population density distribution of two-spotted spider mite in glasshouse roses. As the data were collected repeatedly on the same subject, Subject-Specific and Population Averaged approaches are used and compared.

Rank Tracking Probabilities using Linear Mixed Effect Models (선형 혼합 효과 모형을 이용한 순위 추적 확률)

  • Kwak, Minjung
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.2
    • /
    • pp.241-250
    • /
    • 2015
  • An important scientific objective of longitudinal studies involves tracking the probability of a subject having certain health condition over the course of the study. Proper definitions and estimates of disease risk tracking have important implications in the design and analysis of long-term biomedical studies and in developing guidelines for disease prevention and intervention. We study in this paper a class of rank-tracking probabilities to describe a subject's conditional probabilities of having certain health outcomes at two different time points. Linear mixed effects models are considered to estimate the tracking probabilities and their ratios of interest. We apply our methods to an epidemiological study of childhood cardiovascular risk factors.

Building credit scoring models with various types of target variables (목표변수의 형태에 따른 신용평점 모형 구축)

  • Woo, Hyun Seok;Lee, Seok Hyung;Cho, HyungJun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.1
    • /
    • pp.85-94
    • /
    • 2013
  • As the financial market becomes larger, the loss increases due to the failure of the credit risk managements from the poor management of the customer information or poor decision-making. Thus, the credit risk management also becomes more important and it is essential to develop a credit scoring model, which is a fundamental tool used to minimize the credit risk. Credit scoring models have been studied and developed only for binary target variables. In this paper, we consider other types of target variables such as ordinal multinomial data or longitudinal binary data and suggest credit scoring models. We then apply our developed models to real data and random data, and investigate their performance through Kolmogorov-Smirnov statistic.

Survey of Models for Random Effects Covariance Matrix in Generalized Linear Mixed Model (일반화 선형혼합모형의 임의효과 공분산행렬을 위한 모형들의 조사 및 고찰)

  • Kim, Jiyeong;Lee, Keunbaik
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.2
    • /
    • pp.211-219
    • /
    • 2015
  • Generalized linear mixed models are used to analyze longitudinal categorical data. Random effects specify the serial dependence of repeated outcomes in these models; however, the estimation of a random effects covariance matrix is challenging because of many parameters in the matrix and the estimated covariance matrix should satisfy positive definiteness. Several approaches to model the random effects covariance matrix are proposed to overcome these restrictions: modified Cholesky decomposition, moving average Cholesky decomposition, and partial autocorrelation approaches. We review several approaches and present potential future work.