Browse > Article

Gene Expression Data Analysis Using Seed Clustering  

Shin Myoung (Bioinformatics Research Team, Electronics and Telecommunication Research Institute)
Publication Information
Abstract
Cluster analysis of microarray data has been often used to find biologically relevant Broups of genes based on their expression levels. Since many functionally related genes tend to be co-expressed, by identifying groups of genes with similar expression profiles, the functionalities of unknown genes can be inferred from those of known genes in the same group. In this Paper we address a novel clustering approach, called seed clustering, and investigate its applicability for microarray data analysis. In the seed clustering method, seed genes are first extracted by computational analysis of their expression profiles and then clusters are generated by taking the seed genes as prototype vectors for target clusters. Since it has strong mathematical foundations, the seed clustering method produces the stable and consistent results in a systematic way. Also, our empirical results indicate that the automatically extracted seed genes are well representative of potential clusters hidden in the data, and that its performance is favorable compared to current approaches.
Keywords
microarray data; gene expression data analysis; clustering algorithm; seed clustering;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Sus. Datta and Som. Datta, Comparisons and validation of statistical clustering techniques for microarray gene expression data, Bioinformatics, Vol. 19, no. 9, pp.459-466, 2003   DOI   ScienceOn
2 D. Horn and I. Axel, Novel clustering algorithm for microarray expression data in a truncated SVD space, Bioinformatics, Vol. 19, no. 9, pp. 1110-1115, 2003   DOI   ScienceOn
3 H. Toh and H. Horimoto, Inference of a genetic network by a combined approach of cluster analysis and graphical Gaussian modeling, Bioinformatics, Vol. 18, no. 2, pp. 287-297, 2002   DOI   ScienceOn
4 C. Ding, X. He, H. Zha, and H.D. Simon, Adaptive dimension reduction for clustering high dimensional data, Proceedings of 2nd IEEE International Conference on Data Mining, 2002   DOI
5 오쯔까 기치비, 아비꼬 요시미쯔, 비주얼 생화학, 분자 생물학, 해돋이, pp. 94-95, 2000
6 R. J. Cho, M. J. Campbell, E. A. Winzeler, L. Steinmetz, A. Conway, L. Wodicka, T. G. Wolfsberg, A. E. Gabrielian, D. Landsman, D. J. Lockhart and R. W. Davis, 'A genome-wide transcriptional analysis of the mitotic cell cycle,' Molecular Cell, vol. 2, pp. 65-73, 1998   DOI   ScienceOn
7 http://staff.washington.edu/kayee/cluster/
8 K. Y. Yeung and W.L. Ruzzo, Principle component analysis for clustering gene expression data, Bioinformatics, Vol. 17, no. 9, pp.763-774, 2001   DOI   ScienceOn
9 S. Tavazoie, J.D. Hughes, M.J. Campbell, R.J. Cho and G.M. Church, Systematic determination of genetic network architecture, Nature Genetics, Vol. 22, pp. 281-285, 1999   DOI   ScienceOn
10 K. Y. Yeung, et al., 'Validating clustering for gene expression data,' Bioinformatics, vol. 17, no. 4, pp. 309-318, 2001   DOI   ScienceOn
11 P. Tamayo, D. Slonim, J. Mesirov, Q. Zhu, S. Kitareewan, E. Dmitrovsky, E.S. Lander and T.R. Golub, Interpreting patterns of gene expression with self-organizing maps: Methods and application to hematopoietic differentiation, Proc. Natl. Acad. Sci., Vol. 96, pp. 2007-2912, 1999   DOI
12 Golub, G.H. and Van Loan, C.F., Matrix Computation (3rd edition), The Johns Hopkins University Press, pp. 500-595, 1996
13 M.E. Eisen, P.T. Spellman, P.O. Brown and D. Botstein, Cluster analysis and display of genome-wide expression patterns, Proc. Natl. Acad. Sci., Vol. 95, pp.14863-14868, 1998   DOI