DOI QR코드

DOI QR Code

CaGe: A Web-Based Cancer Gene Annotation System for Cancer Genomics

  • Received : 2012.01.26
  • Accepted : 2012.02.16
  • Published : 2012.03.31

Abstract

High-throughput genomic technologies (HGTs), including next-generation DNA sequencing (NGS), microarray, and serial analysis of gene expression (SAGE), have become effective experimental tools for cancer genomics to identify cancer-associated somatic genomic alterations and genes. The main hurdle in cancer genomics is to identify the real causative mutations or genes out of many candidates from an HGT-based cancer genomic analysis. One useful approach is to refer to known cancer genes and associated information. The list of known cancer genes can be used to determine candidates of cancer driver mutations, while cancer gene-related information, including gene expression, protein-protein interaction, and pathways, can be useful for scoring novel candidates. Some cancer gene or mutation databases exist for this purpose, but few specialized tools exist for an automated analysis of a long gene list from an HGT-based cancer genomic analysis. This report presents a new web-accessible bioinformatic tool, called CaGe, a cancer genome annotation system for the assessment of candidates of cancer genes from HGT-based cancer genomics. The tool provides users with information on cancer-related genes, mutations, pathways, and associated annotations through annotation and browsing functions. With this tool, researchers can classify their candidate genes from cancer genome studies into either previously reported or novel categories of cancer genes and gain insight into underlying carcinogenic mechanisms through a pathway analysis. We show the usefulness of CaGe by assessing its performance in annotating somatic mutations from a published small cell lung cancer study.

Keywords

References

  1. Meyerson M, Gabriel S, Getz G. Advances in understanding cancer genomes through second-generation sequencing. Nat Rev Genet 2010;11:685-696. https://doi.org/10.1038/nrg2841
  2. Zang ZJ, Ong CK, Cutcutache I, Yu W, Zhang SL, Huang D, et al. Genetic and structural variation in the gastric cancer kinome revealed through targeted deep sequencing. Cancer Res 2011;71:29-39. https://doi.org/10.1158/0008-5472.CAN-10-1749
  3. Futreal PA, Coin L, Marshall M, Down T, Hubbard T, Wooster R, et al. A census of human cancer genes. Nat Rev Cancer 2004;4:177-183. https://doi.org/10.1038/nrc1299
  4. Forbes SA, Bindal N, Bamford S, Cole C, Kok CY, Beare D, et al. COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer. Nucleic Acids Res 2011;39:D945-D950. https://doi.org/10.1093/nar/gkq929
  5. Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 2009;4:44-57.
  6. Zeeberg BR, Qin H, Narasimhan S, Sunshine M, Cao H, Kane DW, et al. High-Throughput GoMiner, an 'industrial- strength' integrative gene ontology tool for interpretation of multiple-microarray experiments, with application to studies of Common Variable Immune Deficiency (CVID). BMC Bioinformatics 2005;6:168. https://doi.org/10.1186/1471-2105-6-168
  7. Beissbarth T, Speed TP. GOstat: find statistically overrepresented Gene Ontologies within a group of genes. Bioinformatics 2004;20:1464-1465. https://doi.org/10.1093/bioinformatics/bth088
  8. Khatri P, Voichita C, Kattan K, Ansari N, Khatri A, Georgescu C, et al. Onto-Tools: new additions and improvements in 2006. Nucleic Acids Res 2007;35:W206-W211. https://doi.org/10.1093/nar/gkm327
  9. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome- wide expression profiles. Proc Natl Acad Sci U S A 2005;102:15545-15550. https://doi.org/10.1073/pnas.0506580102
  10. Ng PC, Henikoff S. SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res 2003;31:3812-3814. https://doi.org/10.1093/nar/gkg509
  11. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, et al. A method and server for predicting damaging missense mutations. Nat Methods 2010;7:248-249. https://doi.org/10.1038/nmeth0410-248
  12. Pleasance ED, Stephens PJ, O'Meara S, McBride DJ, Meynert A, Jones D, et al. A small-cell lung cancer genome with complex signatures of tobacco exposure. Nature 2010;463:184-190. https://doi.org/10.1038/nature08629

Cited by

  1. Semi-automated literature mining to identify putative biomarkers of disease from multiple biofluids vol.4, pp.1, 2014, https://doi.org/10.1186/2043-9113-4-13
  2. A Pilot Study on the Potential of RNA-Associated to Urinary Vesicles as a Suitable Non-Invasive Source for Diagnostic Purposes in Bladder Cancer vol.6, pp.1, 2014, https://doi.org/10.3390/cancers6010179
  3. Human Genetic Relevance and Potent Antitumor Activity of Heat Shock Protein 90 Inhibition in Canine Lung Adenocarcinoma Cell Lines vol.10, pp.11, 2015, https://doi.org/10.1371/journal.pone.0142007
  4. Effects of omics data combinations on in silico tumor-normal tissue classification vol.37, pp.6, 2015, https://doi.org/10.1007/s13258-015-0281-6