• Title/Summary/Keyword: covariance methods

Search Result 452, Processing Time 0.025 seconds

Bayesian modeling of random effects precision/covariance matrix in cumulative logit random effects models

  • Kim, Jiyeong;Sohn, Insuk;Lee, Keunbaik
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.1
    • /
    • pp.81-96
    • /
    • 2017
  • Cumulative logit random effects models are typically used to analyze longitudinal ordinal data. The random effects covariance matrix is used in the models to demonstrate both subject-specific and time variations. The covariance matrix may also be homogeneous; however, the structure of the covariance matrix is assumed to be homoscedastic and restricted because the matrix is high-dimensional and should be positive definite. To satisfy these restrictions two Cholesky decomposition methods were proposed in linear (mixed) models for the random effects precision matrix and the random effects covariance matrix, respectively: modified Cholesky and moving average Cholesky decompositions. In this paper, we use these two methods to model the random effects precision matrix and the random effects covariance matrix in cumulative logit random effects models for longitudinal ordinal data. The methods are illustrated by a lung cancer data set.

Effects of Covariance Modeling on Estimation Accuracy in an IMU-based Attitude Estimation Kalman Filter (IMU 기반 자세 추정 칼만필터에서 공분산 모델링이 추정 정확도에 미치는 영향)

  • Choi, Ji Seok;Lee, Jung Keun
    • Journal of Sensor Science and Technology
    • /
    • v.29 no.6
    • /
    • pp.440-446
    • /
    • 2020
  • A well-known difficulty in attitude estimation based on inertial measurement unit (IMU) signals is the occurrence of external acceleration under dynamic motion conditions, as the acceleration significantly degrades the estimation accuracy. Lee et al. (2012) designed a Kalman filter (KF) that could effectively deal with the acceleration issue. Ahmed and Tahir (2017) modified this method by adjusting the acceleration-related covariance matrix because they considered covariance modeling as a pivotal factor in the estimation accuracy. This study investigates the effects of covariance modeling on estimation accuracy in an IMU-based attitude estimation KF. The method proposed by Ahmed and Tahir can be divided into two: one uses the covariance including only diagonal components and the other uses the covariance including both diagonal and off-diagonal components. This paper compares these three methods with respect to the motion condition and the window size, which is required for the methods by Ahmed and Tahir. Experimental results showed that the method proposed by Lee et al. performed the best among the three methods under relatively slow motion conditions, whereas the modified method using the diagonal covariance with a high window size performed the best under relatively fast motion conditions.

Bayesian Modeling of Random Effects Covariance Matrix for Generalized Linear Mixed Models

  • Lee, Keunbaik
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.3
    • /
    • pp.235-240
    • /
    • 2013
  • Generalized linear mixed models(GLMMs) are frequently used for the analysis of longitudinal categorical data when the subject-specific effects is of interest. In GLMMs, the structure of the random effects covariance matrix is important for the estimation of fixed effects and to explain subject and time variations. The estimation of the matrix is not simple because of the high dimension and the positive definiteness; subsequently, we practically use the simple structure of the covariance matrix such as AR(1). However, this strong assumption can result in biased estimates of the fixed effects. In this paper, we introduce Bayesian modeling approaches for the random effects covariance matrix using a modified Cholesky decomposition. The modified Cholesky decomposition approach has been used to explain a heterogenous random effects covariance matrix and the subsequent estimated covariance matrix will be positive definite. We analyze metabolic syndrome data from a Korean Genomic Epidemiology Study using these methods.

Comparison of covariance thresholding methods in gene set analysis

  • Park, Sora;Kim, Kipoong;Sun, Hokeun
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.5
    • /
    • pp.591-601
    • /
    • 2022
  • In gene set analysis with microarray expression data, a group of genes such as a gene regulatory pathway and a signaling pathway is often tested if there exists either differentially expressed (DE) or differentially co-expressed (DC) genes between two biological conditions. Recently, a statistical test based on covariance estimation have been proposed in order to identify DC genes. In particular, covariance regularization by hard thresholding indeed improved the power of the test when the proportion of DC genes within a biological pathway is relatively small. In this article, we compare covariance thresholding methods using four different regularization penalties such as lasso, hard, smoothly clipped absolute deviation (SCAD), and minimax concave plus (MCP) penalties. In our extensive simulation studies, we found that both SCAD and MCP thresholding methods can outperform the hard thresholding method when the proportion of DC genes is extremely small and the number of genes in a biological pathway is much greater than a sample size. We also applied four thresholding methods to 3 different microarray gene expression data sets related with mutant p53 transcriptional activity, and epithelium and stroma breast cancer to compare genetic pathways identified by each method.

Asymptotics for realized covariance under market microstructure noise and sampling frequency determination

  • Shin, Dong Wan;Hwang, Eunju
    • Communications for Statistical Applications and Methods
    • /
    • v.23 no.5
    • /
    • pp.411-421
    • /
    • 2016
  • Large frequency limiting distributions of two errors in realized covariance are investigated under noisy and non-synchronous high frequency sampling situations. The first distribution characterizes increased variance of the realized covariance due to noise for large frequency and the second distribution characterizes decreased variance of the realized covariance due to discretization for large frequency. The distribution of the combined error enables us to determine the sampling frequency which depends on a nuisance parameter. A consistent estimator of the nuisance parameter is proposed.

Bayesian baseline-category logit random effects models for longitudinal nominal data

  • Kim, Jiyeong;Lee, Keunbaik
    • Communications for Statistical Applications and Methods
    • /
    • v.27 no.2
    • /
    • pp.201-210
    • /
    • 2020
  • Baseline-category logit random effects models have been used to analyze longitudinal nominal data. The models account for subject-specific variations using random effects. However, the random effects covariance matrix in the models needs to explain subject-specific variations as well as serial correlations for nominal outcomes. In order to satisfy them, the covariance matrix must be heterogeneous and high-dimensional. However, it is difficult to estimate the random effects covariance matrix due to its high dimensionality and positive-definiteness. In this paper, we exploit the modified Cholesky decomposition to estimate the high-dimensional heterogeneous random effects covariance matrix. Bayesian methodology is proposed to estimate parameters of interest. The proposed methods are illustrated with real data from the McKinney Homeless Research Project.

On Testing Equality of Matrix Intraclass Covariance Matrices of $K$Multivariate Normal Populations

  • Kim, Hea-Jung
    • Communications for Statistical Applications and Methods
    • /
    • v.7 no.1
    • /
    • pp.55-64
    • /
    • 2000
  • We propose a criterion for testing homogeneity of matrix intraclass covariance matrices of K multivariate normal populations, It is based on a variable transformation intended to propose and develop a likelihood ratio criterion that makes use of properties of eigen structures of the matrix intraclass covariance matrices. The criterion then leads to a simple test that uses an asymptotic distribution obtained from Box's (1949) theorem for the general asymptotic expansion of random variables.

  • PDF

Global Covariance based Principal Component Analysis for Speaker Identification (화자식별을 위한 전역 공분산에 기반한 주성분분석)

  • Seo, Chang-Woo;Lim, Young-Hwan
    • Phonetics and Speech Sciences
    • /
    • v.1 no.1
    • /
    • pp.69-73
    • /
    • 2009
  • This paper proposes an efficient global covariance-based principal component analysis (GCPCA) for speaker identification. Principal component analysis (PCA) is a feature extraction method which reduces the dimension of the feature vectors and the correlation among the feature vectors by projecting the original feature space into a small subspace through a transformation. However, it requires a larger amount of training data when performing PCA to find the eigenvalue and eigenvector matrix using the full covariance matrix by each speaker. The proposed method first calculates the global covariance matrix using training data of all speakers. It then finds the eigenvalue matrix and the corresponding eigenvector matrix from the global covariance matrix. Compared to conventional PCA and Gaussian mixture model (GMM) methods, the proposed method shows better performance while requiring less storage space and complexity in speaker identification.

  • PDF

Negative binomial loglinear mixed models with general random effects covariance matrix

  • Sung, Youkyung;Lee, Keunbaik
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.1
    • /
    • pp.61-70
    • /
    • 2018
  • Modeling of the random effects covariance matrix in generalized linear mixed models (GLMMs) is an issue in analysis of longitudinal categorical data because the covariance matrix can be high-dimensional and its estimate must satisfy positive-definiteness. To satisfy these constraints, we consider the autoregressive and moving average Cholesky decomposition (ARMACD) to model the covariance matrix. The ARMACD creates a more flexible decomposition of the covariance matrix that provides generalized autoregressive parameters, generalized moving average parameters, and innovation variances. In this paper, we analyze longitudinal count data with overdispersion using GLMMs. We propose negative binomial loglinear mixed models to analyze longitudinal count data and we also present modeling of the random effects covariance matrix using the ARMACD. Epilepsy data are analyzed using our proposed model.

Dynamic linear mixed models with ARMA covariance matrix

  • Han, Eun-Jeong;Lee, Keunbaik
    • Communications for Statistical Applications and Methods
    • /
    • v.23 no.6
    • /
    • pp.575-585
    • /
    • 2016
  • Longitudinal studies repeatedly measure outcomes over time. Therefore, repeated measurements are serially correlated from same subject (within-subject variation) and there is also variation between subjects (between-subject variation). The serial correlation and the between-subject variation must be taken into account to make proper inference on covariate effects (Diggle et al., 2002). However, estimation of the covariance matrix is challenging because of many parameters and positive definiteness of the matrix. To overcome these limitations, we propose autoregressive moving average Cholesky decomposition (ARMACD) for the linear mixed models. The ARMACD allows a class of flexible, nonstationary, and heteroscedastic models that exploits the structure allowed by combining the AR and MA modeling of the random effects covariance matrix. We analyze a real dataset to illustrate our proposed methods.