• Title/Summary/Keyword: Time-course microarray data

Search Result 16, Processing Time 0.019 seconds

A Biclustering Method for Time Series Analysis

  • Lee, Jeong-Hwa;Lee, Young-Rok;Jun, Chi-Hyuck
    • Industrial Engineering and Management Systems
    • /
    • v.9 no.2
    • /
    • pp.131-140
    • /
    • 2010
  • Biclustering is a method of finding meaningful subsets of objects and attributes simultaneously, which may not be detected by traditional clustering methods. It is popularly used for the analysis of microarray data representing the expression levels of genes by conditions. Usually, biclustering algorithms do not consider a sequential relation between attributes. For time series data, however, bicluster solutions should keep the time sequence. This paper proposes a new biclustering algorithm for time series data by modifying the plaid model. The proposed algorithm introduces a parameter controlling an interval between two selected time points. Also, the pruning step preventing an over-fitting problem is modified so as to eliminate only starting or ending points. Results from artificial data sets show that the proposed method is more suitable for the extraction of biclusters from time series data sets. Moreover, by using the proposed method, we find some interesting observations from real-world time-course microarray data sets and apartment price data sets in metropolitan areas.

Gene Screening and Clustering of Yeast Microarray Gene Expression Data (효모 마이크로어레이 유전자 발현 데이터에 대한 유전자 선별 및 군집분석)

  • Lee, Kyung-A;Kim, Tae-Houn;Kim, Jae-Hee
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.6
    • /
    • pp.1077-1094
    • /
    • 2011
  • We accomplish clustering analyses for yeast cell cycle microarray expression data. To reflect the characteristics of a time-course data, we screen the genes using the test statistics with Fourier coefficients applying a FDR procedure. We compare the results done by model-based clustering, K-means, PAM, SOM, hierarchical Ward method and Fuzzy method with the yeast data. As the validity measure for clustering results, connectivity, Dunn index and silhouette values are computed and compared. A biological interpretation with GO analysis is also included.

Missing values imputation for time course gene expression data using the pattern consistency index adaptive nearest neighbors (시간경로 유전자 발현자료에서 패턴일치지수와 적응 최근접 이웃을 활용한 결측값 대치법)

  • Shin, Heyseo;Kim, Dongjae
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.3
    • /
    • pp.269-280
    • /
    • 2020
  • Time course gene expression data is a large amount of data observed over time in microarray experiments. This data can also simultaneously identify the level of gene expression. However, the experiment process is complex, resulting in frequent missing values due to various causes. In this paper, we propose a pattern consistency index adaptive nearest neighbors as a method of missing value imputation. This method combines the adaptive nearest neighbors (ANN) method that reflects local characteristics and the pattern consistency index that considers consistent degree for gene expression between observations over time points. We conducted a Monte Carlo simulation study to evaluate the usefulness of proposed the pattern consistency index adaptive nearest neighbors (PANN) method for two yeast time course data.

A Method of Identifying Disease-related Significant Pathways Using Time-Series Microarray Data (시간열 마이크로어레이 데이터를 이용한 질병 관련 유의한 패스웨이 유전자 집합의 검출)

  • Kim, Jae-Young;Shin, Mi-Young
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.47 no.5
    • /
    • pp.17-24
    • /
    • 2010
  • Recently the study of identifying bio-markers for disease diagnosis and prognosis has been actively performed. In particular, lots of attentions have been paid to the finding of pathway gene-sets differentially expressed in disease patients rather than the finding of individual gene markers. In this paper we propose a novel method to identify disease-related pathway gene-sets based on time-series microarray data. For this purpose, we firstly compute individual gene scores by the using maSigPro (microarray Significant Profiles) and then arrange all the genes in the decreasing order of the corresponding gene scores. The rank of each gene in the entire list is used to evaluate the statistical significance of candidate gene-sets with Wilcoxson rank sum test. For the generation of candidate gene-sets, MSigDB (Molecular Signatures Database) pathway information has been employed. The experiment was conducted with prostate cancer time-series microarray data and the results showed the usefulness of the proposed method by correctly identifying 6 out of 7 biological pathways already known as being actually related to prostate cancer.

Comparative Analysis of Growth-Phase-Dependent Gene Expression in Virulent and Avirulent Streptococcus pneumoniae Using a High-Density DNA Microarray

  • Ko, Kwan Soo;Park, Sulhee;Oh, Won Sup;Suh, Ji-Yoeun;Oh, TaeJeong;Ahn, Sungwhan;Chun, Jongsik;Song, Jae-Hoon
    • Molecules and Cells
    • /
    • v.21 no.1
    • /
    • pp.82-88
    • /
    • 2006
  • The global pattern of growth-dependent gene expression in Streptococcus pneumoniae strains was evaluated using a high-density DNA microarray. Total RNAs obtained from an avirulent S. pneumoniae strain R6 and a virulent strain AMC96-6 were used to compare the expression patterns at seven time points (2.5, 3.5, 4.5, 5.5, 6.0, 6.5, and 8.0 h). The expression profile of strain R6 changed between log and stationary growth (the Log-Stat switch). There were clear differences between the growth-dependent gene expression profiles of the virulent and avirulent pneumococcal strains in 367 of 1,112 genes. Transcripts of genes associated with bacterial competence and capsular polysaccharide formation, as well as clpP and cbpA, were higher in the virulent strain. Our data suggest that late log or early stationary phase may be the most virulent phase of S. pneumoniae.

Effects of Baicalin on Gene Expression Profiles during Adipogenesis of 3T3-L1 Cells (3T3-L1 세포의 지방세포형성과정에서 Baicalin에 의한 유전자 발현 프로파일 분석)

  • Lee, Hae-Yong;Kang, Ryun-Hwa;Chung, Sang-In;Cho, Soo-Hyun;Yoon, Yoo-Sik
    • Journal of the Korean Society of Food Science and Nutrition
    • /
    • v.39 no.1
    • /
    • pp.54-63
    • /
    • 2010
  • Baicalin, a flavonoid, was shown to have diverse effects such as anti-inflammatory, anti-cancer, anti-viral, anti-bacterial and others. Recently, we found that the baicalin inhibits adipogenesis through the modulations of anti-adipogenic and pro-adipogenic factors of the adipogenesis pathway. In the present study, we further characterized the molecular mechanism of the anti-adipogenic effect of baicalin using microarray technology. Microarray analyses were conducted to analyze the gene expression profiles during the differentiation time course (0 day, 2 day, 4 day and 7 day) in 3T3-L1 cells with or without baicalin treatment. We identified a total of 3972 genes of which expressions were changed more than 2 fold. These 3972 genes were further analyzed using hierarchical clustering analysis, resulting in 20 clusters. Four clusters among 20 showed clearly up-regulated expression patterns (cluster 8 and cluster 10) or clearly down-regulated expression patterns (cluster 12 and cluster 14) by baicalin treatment for over-all differentiation period. The cluster 8 and cluster 10 included many genes which enhance cell proliferation or inhibit adipogenesis. On the other hand, the cluster 12 and cluster 14 included many genes which are related with proliferation inhibition, cell cycle arrest, cell growth suppression or adipogenesis induction. In conclusion, these data provide detailed information on the molecular mechanism of baicalin-induced inhibition of adipogenesis.