Browse > Article

Clustering Gene Expression Data by MCL Algorithm  

Shon, Ho-Sun (School of Electrical and Computer Engineering, Chungbuk National University)
Ryu, Keun-Ho (School of Electrical and Computer Engineering, Chungbuk National University)
Publication Information
Abstract
The clustering of gene expression data is used to analyze the results of microarray studies. This clustering is one of the frequently used methods in understanding degrees of biological change and gene expression. In biological research, MCL algorithm is an algorithm that clusters nodes within a graph, and is quick and efficient. We have modified the existing MCL algorithm and applied it to microarray data. In applying the MCL algorithm we put forth a simulation that adjusts two factors, namely inflation and diagonal tent and converted them by making use of Markov matrix. Furthermore, in order to distinguish class more clearly in the modified MCL algorithm we took the average of each row and used it as a threshold. Therefore, the improved algorithm can increase accuracy better than the existing ones. In other words, in the actual experiment, it showed an average of 70% accuracy when compared with an existing class. We also compared the MCL algorithm with the self-organizing map(SOM) clustering, K-means clustering and hierarchical clustering (HC) algorithms. And the result showed that it showed better results than ones derived from hierarchical clustering and K-means method.
Keywords
Gene Expression; MCL algorithm; Clustering; Hierarchical; K-means;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Ho Sun Shon, Sunshin Kim, Chung Sei Rhee, Keun ho Ryu, "Clustering DNA Microarray Data by MCL Algorithm, ISMB, 2007
2 T. Kohonen, "The self-organizing map," Neurocomputing, vol 21, pp.1-6. 1998   DOI   ScienceOn
3 Ho Sun Shon, Sunshin Kim, Keun Ho Ryu, "Clustering approach using MCL Algorithm for analysing Microarray Data", no1, vol 9, 2007
4 EisenLab http://rana1b1.gov./EisenSoftware, 2008
5 T. Kohonen, Self-Organizing Maps, Springer Verlag, NewYork, 1997
6 J. Han, M. Kamber, Data Mining: Concepts & Techniques 2nd ed, March 2006
7 Sunshin Kim, Kwang Su Jung, Keun Ho Ryu, "Automatic Orthologous-Protein-Clustering from Multiple Complete-Genomes by the Best Reciprocal BLAST Hits", LNBI, vol.3916, pp.60-70, 2006
8 T. R Golub, D. K Slonim, P. Tamayo et al., "Molecular classification of cancer: class discovery and class prediction by gene expression monitoring". Science 286, pp.531-537, 1999   DOI   ScienceOn
9 P. T. Spellman, G. Sherlock, M. Q. Zhang, et al., "Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by Microarray hybridization", Molecular Biology of the Cell, vol 9, no. 12, 3273-3297, 1998   DOI
10 E. Hartuv et al., An Algorithm for Clustering cDNAs for Gene Expression Analysis, RECOM B 99, pp.188-197, 1999
11 R. Sharan and R. Shamir, "CLICK : A Clustering algorithm with applications to gene expression analysis", In Proceedings ISMB, 2000
12 Stijn Marinus van Dongen, GRAPH Clustering by FLOW SIMULATION, 1969
13 Q. Zhang, Y. Zhang, "Hierarchical Clustering of gene expression profiles with graphics hardware acceleration", pp.676-681, Pattern Recognition Letters, vol 27, 2006   DOI   ScienceOn
14 정광수, 유기진, 정용제, 류근호, "MCL 알고리즘을 이용한 단백질 표면의 바인딩 영역 분석 기법" 정보처리학회논문지 D, 제14-D권 제7호, pp743-752, 2007.12   과학기술학회마을   DOI
15 Sunshin Kim, Clustering Methods for Finding Orthologs among Multiple Species, http://dblab.chungbuk.ac.kr/-sskim04/,2007
16 The R Project for Statistical Computing, 2008 http://www.r-project.org/