• Title/Summary/Keyword: Time course microarray data

Search Result 16, Processing Time 0.019 seconds

Clustering of Time-Course Microarray Data Using Pharmacokinetic Parameter (약동학적 파라미터를 이용한 시간경로 마이크로어레이 자료의 군집분석)

  • Lee, Hyo-Jung;Kim, Peol-A;Park, Mi-Ra
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.4
    • /
    • pp.623-631
    • /
    • 2011
  • A major goal of time-course microarray data analysis is the detection of groups of genes that manifest similar expression patterns over time. The corresponding numerous cluster algorithms for clustering time-course microarray data have been developed. In this study, we proposed a clustering method based on the primary pharmacokinetic parameters in the pharmacokinetics study for assessment of pharmaceutical equivalents between two drug products. A real data and a simulation data was used to demonstrate the usefulness of the proposed method.

Curve Clustering in Microarray

  • Lee, Kyeong-Eun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.3
    • /
    • pp.575-584
    • /
    • 2004
  • We propose a Bayesian model-based approach using a mixture of Dirichlet processes model with discrete wavelet transform, for curve clustering in the microarray data with time-course gene expressions.

  • PDF

A Review of Cluster Analysis for Time Course Microarray Data (시간 경로 마이크로어레이 자료의 군집 분석에 관한 고찰)

  • Sohn In-Suk;Lee Jae-Won;Kim Seo-Young
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.1
    • /
    • pp.13-32
    • /
    • 2006
  • Biologists are attempting to group genes based on the temporal pattern of gene expression levels. So far, a number of methods have been proposed for clustering microarray data. However, the results of clustering depends on the genes selection, therefore the gene selection with significant expression difference is also very important to cluster for microarray data. Thus, this paper present the results of broad comparative studies to time course microarray data by considering methods of gene selection, clustering and cluster validation.

A Study of HME Model in Time-Course Microarray Data

  • Myoung, Sung-Min;Kim, Dong-Geon;Jo, Jin-Nam
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.3
    • /
    • pp.415-422
    • /
    • 2012
  • For statistical microarray data analysis, clustering analysis is a useful exploratory technique and offers the promise of simultaneously studying the variation of many genes. However, most of the proposed clustering methods are not rigorously solved for a time-course microarray data cluster and for a fitting time covariate; therefore, a statistical method is needed to form a cluster and represent a linear trend of each cluster for each gene. In this research, we developed a modified hierarchical mixture of an experts model to suggest clustering data and characterize each cluster using a linear mixed effect model. The feasibility of the proposed method is illustrated by an application to the human fibroblast data suggested by Iyer et al. (1999).

Consensus Clustering for Time Course Gene Expression Microarray Data

  • Kim, Seo-Young;Bae, Jong-Sung
    • Communications for Statistical Applications and Methods
    • /
    • v.12 no.2
    • /
    • pp.335-348
    • /
    • 2005
  • The rapid development of microarray technologies enabled the monitoring of expression levels of thousands of genes simultaneously. Recently, the time course gene expression data are often measured to study dynamic biological systems and gene regulatory networks. For the data, biologists are attempting to group genes based on the temporal pattern of their expression levels. We apply the consensus clustering algorithm to a time course gene expression data in order to infer statistically meaningful information from the measurements. We evaluate each of consensus clustering and existing clustering methods with various validation measures. In this paper, we consider hierarchical clustering and Diana of existing methods, and consensus clustering with hierarchical clustering, Diana and mixed hierachical and Diana methods and evaluate their performances on a real micro array data set and two simulated data sets.

Screening and Clustering for Time-course Yeast Microarray Gene Expression Data using Gaussian Process Regression (효모 마이크로어레이 유전자 발현데이터에 대한 가우시안 과정 회귀를 이용한 유전자 선별 및 군집화)

  • Kim, Jaehee;Kim, Taehoun
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.3
    • /
    • pp.389-399
    • /
    • 2013
  • This article introduces Gaussian process regression and shows its application with time-course microarray gene expression data. Gene screening for yeast cell cycle microarray expression data is accomplished with a ratio of log marginal likelihood that uses Gaussian process regression with a squared exponential covariance kernel function. Gaussian process regression fitting with each gene is done and shown with the nine top ranking genes. With the screened data the Gaussian model-based clustering is done and its silhouette values are calculated for cluster validity.

Bayesian Curve Clustering in Microarray

  • Lee, Kyeong-Eun;Mallick, Bani K.
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2006.04a
    • /
    • pp.39-42
    • /
    • 2006
  • We propose a Bayesian model-based approach using a mixture of Dirichlet processes model with discrete wavelet transform, for curve clustering in the microarray data with time-course gene expressions.

  • PDF

Gene Discovery Analysis from Mouse Embryonic Stem Cells Based on Time Course Microarray Data

  • Suh, Young Ju;Cho, Sun A;Shim, Jung Hee;Yook, Yeon Joo;Yoo, Kyung Hyun;Kim, Jung Hee;Park, Eun Young;Noh, Ji Yeun;Lee, Seong Ho;Yang, Moon Hee;Jeong, Hyo Seok;Park, Jong Hoon
    • Molecules and Cells
    • /
    • v.26 no.4
    • /
    • pp.338-343
    • /
    • 2008
  • An embryonic stem cell is a powerful tool for investigation of early development in vitro. The study of embryonic stem cell mediated neuronal differentiation allows for improved understanding of the mechanisms involved in embryonic neuronal development. We investigated expression profile changes using time course cDNA microarray to identify clues for the signaling network of neuronal differentiation. For the short time course microarray data, pattern analysis based on the quadratic regression method is an effective approach for identification and classification of a variety of expressed genes that have biological relevance. We studied the expression patterns, at each of 5 stages, after neuronal induction at the mRNA level of embryonic stem cells using the quadratic regression method for pattern analysis. As a result, a total of 316 genes (3.1%) including 166 (1.7%) informative genes in 8 possible expression patterns were identified by pattern analysis. Among the selected genes associated with neurological system, all three genes showing linearly increasing pattern over time, and one gene showing decreasing pattern over time, were verified by RT-PCR. Therefore, an increase in gene expression over time, in a linear pattern, may be associated with embryonic development. The genes: Tcfap2c, Ttr, Wnt3a, Btg2 and Foxk1 detected by pattern analysis, and verified by RT-PCR simultaneously, may be candidate markers associated with the development of the nervous system. Our study shows that pattern analysis, using the quadratic regression method, is very useful for investigation of time course cDNA microarray data. The pattern analysis used in this study has biological significance for the study of embryonic stem cells.

A Pattern Consistency Index for Detecting Heterogeneous Time Series in Clustering Time Course Gene Expression Data (시간경로 유전자 발현자료의 군집분석에서 이질적인 시계열의 탐지를 위한 패턴일치지수)

  • Son, Young-Sook;Baek, Jang-Sun
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.2
    • /
    • pp.371-379
    • /
    • 2005
  • In this paper, we propose a pattern consistency index for detecting heterogeneous time series that deviate from the representative pattern of each cluster in clustering time course gene expression data using the Pearson correlation coefficient. We examine its usefulness by applying this index to serum time course gene expression data from microarrays.

Missing Values Estimation for Time Course Gene Expression Data Using the Sequential Partial Least Squares Regression Fitting (순차적 부분최소제곱 회귀적합에 의한 시간경로 유전자 발현 자료의 결측치 추정)

  • Kim, Kyung-Sook;Oh, Mi-Ra;Baek, Jang-Sun;Son, Young-Sook
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.2
    • /
    • pp.275-290
    • /
    • 2008
  • The size of microarray gene expression data is very big and its observation process is also very complex. Thus missing values are frequently occurred. In this paper we propose the sequential partial least squares(SPLS) regression fitting method to estimate missing values for time course gene expression data that has correlations among observations over time points. The SPLS method is to combine the sequential technique with the partial least squares(PLS) regression fitting method. The usefulness of method proposed is evaluated through some simulation study for three yeast time course data.