DOI QR코드

DOI QR Code

An Efficient Functional Analysis Method for Micro-array Data Using Gene Ontology

  • Hong, Dong-Wan (Department of Computer Engineering, Hallym University) ;
  • Lee, Jong-Keun (Department of Computer Engineering, Hallym University) ;
  • Park, Sung-Soo (Department of Computer Engineering, Hallym University) ;
  • Hong, Sang-Kyoon (Department of Computer Engineering, Hallym University) ;
  • Yoon, Jee-Hee (Department of Computer Engineering, Hallym University)
  • Published : 2007.06.30

Abstract

Microarray data includes tens of thousands of gene expressions simultaneously, so it can be effectively used in identifying the phenotypes of diseases. However, the retrieval of functional information from a large corpus of gene expression data is still a time-consuming task. In this paper, we propose an efficient method for identifying functional categories of differentially expressed genes from a micro-array experiment by using Gene Ontology (GO). Our method is as follows: (1) The expression data set is first filtered to include only genes with mean expression values that differ by at least 3-fold between the two groups. (2) The genes are then ranked based on the t-statistics. The 100 most highly ranked genes are selected as informative genes. (3) The t-value of each informative gene is imposed as a score on the associated GO terms. High-scoring GO terms are then listed with their associated genes and represent the functional category information of the micro-array experiment. A system called HMDA (Hallym Micro-array Data analysis) is implemented on publicly available micro-array data sets and validated. Our results were also compared with the original analysis.

Keywords

References

  1. http://www.geneontology.org/
  2. http://www.nbn.ac.za/Education/14-microarray - 2004
  3. E. LaTulippe, J. Satagopan, A. Smith, H. Scher, P. Scardino, V. Reuter and W. L. Gerald, 'Comprehensive Gene Expression Analysis of Prostate Cancer Reveals Distinct Transcriptional Programs Associated with Metastatic Disease,' Cancer Research, vol. 62, pp. 4499-4509, 2002
  4. F. AI-Shahrour, R. Diaz-Uriarte and J. Dopazo, 'FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes,' Bioinformatics, Vol. 20, no. 4, pp. 578-580, 2004 https://doi.org/10.1093/bioinformatics/btg455
  5. B. Adryan and R. Schuh, 'Gene-Ontology-based clustering of gene expression data,' Bioinformatics, Vol. 20, no. 16,pp.2851-2852, 2004 https://doi.org/10.1093/bioinformatics/bth289
  6. P. N. Robinson, A. Wollstein, U. Bohme and B. Beattie, 'Ontologizing gene-expression micro-array data: characterizing clusters with Gene Ontology,' Bioinformatics, Vol. 20, no. 6, pp. 979-981, 2004 https://doi.org/10.1093/bioinformatics/bth040
  7. M. Smid and L. C. J. Dorssers, 'GO-Mapper: functional analysis of gene expression data using the expression level as a score to evaluate Gene Ontology terms,' Bioinformatics, Vol. 20, no. 16, pp. 2618-2625, 2004 https://doi.org/10.1093/bioinformatics/bth293
  8. P. Khatri, S. Draghici, G. C. Ostermeier and S. A. Krawetz, 'Profiling Gene Expression Using Onto-Express,' Genomics, Vol. 79, no. 2, pp. 266-270, 2002 https://doi.org/10.1006/geno.2002.6698
  9. A. Young, N. Whitehouse, J. Cho and C. Shaw, 'Ontology Traverser: and R package for GO analysis,' Bioinformatics, Vol. 21, no. 2, pp 275-276, 2005 https://doi.org/10.1093/bioinformatics/bth495
  10. P. Khatri and S.Draghici, 'Ontological analysis of gene expression data: current tools, limitations and open problems,' Bioinformatics, Vol. 21, no. 18, pp. 3587-3595, 2005 https://doi.org/10.1093/bioinformatics/bti565