Browse > Article

Analysis of Saccharomyces Cell Cycle Expression Data using Bayesian Validation of Fuzzy Clustering  

Yoo Si-Ho (연세대학교 컴퓨터과학과)
Won Hong-Hee (연세대학교 컴퓨터과학과)
Cho Sung-Bae (연세대학교 컴퓨터과학과)
Abstract
Clustering, a technique for the analysis of the genes, organizes the patterns into groups by the similarity of the dataset and has been used for identifying the functions of the genes in the cluster or analyzing the functions of unknown gones. Since the genes usually belong to multiple functional families, fuzzy clustering methods are more appropriate than the conventional hard clustering methods which assign a sample to a group. In this paper, a Bayesian validation method is proposed to evaluate the fuzzy partitions effectively. Bayesian validation method is a probability-based approach, selecting a fuzzy partition with the largest posterior probability given the dataset. At first, the proposed Bayesian validation method is compared to the 4 representative conventional fuzzy cluster validity measures in 4 well-known datasets where foray c-means algorithm is used. Then, we have analyzed the results of Saccharomyces cell cycle expression data evaluated by the proposed method.
Keywords
fuzzy clustering; fuzzy c-means algorithm; Bayesian validation method; Saccharomyces cell cycle expression data;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Y. Fukuyama and M. Sugeno, 'A new method of choosing the number of clusters for the fuzzy c-means method,' Proceedings of 5th Fuzzy Systems Symposium, pp. 247-250, 1989
2 D. Dembele and P. Kastner, 'Fuzzy c-means method for clustering microarray data,' Bioinformatics, vol. 19, no. 8, pp. 973-980, 2003   DOI   ScienceOn
3 K. Y. Yeung, et al., 'Validating clustering for gene expression data,' Bioinformatics, vol. 17, no. 4, pp. 309-318, 2001   DOI   ScienceOn
4 N. Bolshakova and F. Azuaje, 'Cluster validation techniques for genome expression data,' Signal Processing, vol. 21, no. 82, pp. 1-9. 2002
5 A. P. Gasch and M. B. Eisen, 'Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering,' Genome Biology, vol. 3, no. 11, research 0059.1-0059.22, 2002
6 U. Alon, N. Barkai, D. A. Notterman, K. Gish, Y. Barra, D. Mack and A. J. Levine, 'Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays,' Proceedings of National Academy of Science USA, vol. 96, pp. 6745-6750, 1999   DOI
7 J. C. Bezdeck, 'Numerical taxonomy with fuzzy sets,' J. Math. Biology, vol. 1, pp. 58-72, 1974
8 J. C. Bezdeck, 'Cluster validity with fuzzy sets,' J. Cybernit, vol. 3, pp. 58-72, 1974
9 X. L. Xie and G. Beni, 'A validity measure for fuzzy clustering,' IEEE Trnas. Pattern Analysis and Machine Intelligence, vol. 3, no. 3, pp. 841-846, 1991   DOI   ScienceOn
10 A. P. Gasch and M. B. Eisen, 'Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering,' Genome Biology, vol. 3, no. 11, research 0059.1-0059.22, 2002
11 F. Hoppner, F. Klawonn, R. Kruse and T. Runkler, Fuzzy Cluster Analysis, Wiley, 2000
12 S. L. Chiu, 'Fuzzy model identification based on cluster estimation,' J. Intelligent and Fuzzy Systems, vol. 2, no. 3, pp. 267-278, 1994
13 D. W. Kim, K. H. Lee and D. H. Lee, 'Fuzzy cluster validation index based on inter-cluster proximity,' Pattern Recognition Letters, vol. 24. pp. 2561-2574, 2003   DOI   ScienceOn
14 I. Gath and A.B. Geva, 'Unsupervised optimal fuzzy clustering,' IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 11, pp. 773-781, 1989   DOI   ScienceOn
15 J. C. Bezdek, Pattern Recognition with Fuzzy Objective Function Algorithms, Plenum Press, 1989
16 J. Khan, J. S. Wei, M. Ringner, L. H. Saal, M. Ladanyi, F. Westermann, F. Berthold, M. Schwab, C. R. Antonescu, C. Peterson and P. S. Meltzer, 'Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks,' Nature Medicine, vol. 7, pp. 673-679, 2001   DOI   ScienceOn
17 N.R. Pal and J.C. Bezdek : On cluster validity for the fuzzy c-means model, IEEE Transactions on Fuzzy Systems, Vol. 3, No. pp.3370-379, 1995   DOI   ScienceOn
18 M. R. Rezaee, B. P. F. Lelieveldt and J. H. C. Reiber, 'A new cluster validity index for the fuzzy c-means,' Pattern Recognition Letters, vol. 19, pp. 237-246, 1998   DOI   ScienceOn
19 Y. Barash and N. Friedman, 'Context-specific Bayesian clustering for gene expression data,' Journal of Computational Molecular Cell Biology, vol. 9, no. 2, pp. 12-21, 2001
20 R. J. Cho, M. J. Campbell, E. A. Winzeler, L. Steinmetz, A. Conway, L. Wodicka, T. G. Wolfsberg, A. E. Gabrielian, D. Landsman, D. J. Lockhart and R. W. Davis, 'A genome-wide transcriptional analysis of the mitotic cell cycle,' Molecular Cell, vol. 2, pp. 65-73, 1998   DOI   ScienceOn