DOI QR코드

DOI QR Code

TRAPR: R Package for Statistical Analysis and Visualization of RNA-Seq Data

  • Lim, Jae Hyun (Seoul National University Biomedical Informatics (SNUBI), Division of Biomedical Informatics and Systems Biomedical Informatics Research Center, Seoul National University College of Medicine) ;
  • Lee, Soo Youn (Seoul National University Biomedical Informatics (SNUBI), Division of Biomedical Informatics and Systems Biomedical Informatics Research Center, Seoul National University College of Medicine) ;
  • Kim, Ju Han (Seoul National University Biomedical Informatics (SNUBI), Division of Biomedical Informatics and Systems Biomedical Informatics Research Center, Seoul National University College of Medicine)
  • Received : 2017.01.23
  • Accepted : 2017.02.15
  • Published : 2017.03.31

Abstract

High-throughput transcriptome sequencing, also known as RNA sequencing (RNA-Seq), is a standard technology for measuring gene expression with unprecedented accuracy. Numerous bioconductor packages have been developed for the statistical analysis of RNA-Seq data. However, these tools focus on specific aspects of the data analysis pipeline, and are difficult to appropriately integrate with one another due to their disparate data structures and processing methods. They also lack visualization methods to confirm the integrity of the data and the process. In this paper, we propose an R-based RNA-Seq analysis pipeline called TRAPR, an integrated tool that facilitates the statistical analysis and visualization of RNA-Seq expression data. TRAPR provides various functions for data management, the filtering of low-quality data, normalization, transformation, statistical analysis, data visualization, and result visualization that allow researchers to build customized analysis pipelines.

Keywords

References

  1. Wang Z, Gerstein M, Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 2009;10:57-63. https://doi.org/10.1038/nrg2484
  2. Garber M, Grabherr MG, Guttman M, Trapnell C. Computational methods for transcriptome annotation and quantification using RNA-seq. Nat Methods 2011;8:469-477. https://doi.org/10.1038/nmeth.1613
  3. Auer PL, Srivastava S, Doerge RW. Differential expression: the next generation and beyond. Brief Funct Genomics 2012;11:57-62. https://doi.org/10.1093/bfgp/elr041
  4. Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 2004;5:R80. https://doi.org/10.1186/gb-2004-5-10-r80
  5. Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 2010;26:139-140. https://doi.org/10.1093/bioinformatics/btp616
  6. Anders S, Huber W. Differential expression analysis for sequence count data. Genome Biol 2010;11:R106. https://doi.org/10.1186/gb-2010-11-10-r106
  7. Wang L, Feng Z, Wang X, Wang X, Zhang X. DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Bioinformatics 2010;26:136-138. https://doi.org/10.1093/bioinformatics/btp612
  8. Hardcastle TJ, Kelly KA. baySeq: empirical Bayesian methods for identifying differential expression in sequence count data. BMC Bioinformatics 2010;11:422. https://doi.org/10.1186/1471-2105-11-422
  9. Calza S, Raffelsberger W, Ploner A, Sahel J, Leveillard T, Pawitan Y. Filtering genes to improve sensitivity in oligonucleotide microarray data analysis. Nucleic Acids Res 2007;35:e102. https://doi.org/10.1093/nar/gkm537
  10. Huber W, von Heydebreck A, Sultmann H, Poustka A, Vingron M. Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics 2002;18 Suppl 1:S96-S104. https://doi.org/10.1093/bioinformatics/18.suppl_1.S96
  11. Kadota K, Nishiyama T, Shimizu K. A normalization strategy for comparing tag count data. Algorithms Mol Biol 2012;7:5. https://doi.org/10.1186/1748-7188-7-5
  12. Tarazona S, Garcia-Alcalde F, Dopazo J, Ferrer A, Conesa A. Differential expression in RNA-seq: a matter of depth. Genome Res 2011;21:2213-2223. https://doi.org/10.1101/gr.124321.111