DOI QR코드

DOI QR Code

COEX-Seq: Convert a Variety of Measurements of Gene Expression in RNA-Seq

  • Kim, Sang Cheol (Division of Bio-Medical Informatics, Center for Genome Science, National Institute of Health, Korea Centers for Disease Control and Prevention) ;
  • Yu, Donghyeon (Department of Statistics, Inha University) ;
  • Cho, Seong Beom (Division of Bio-Medical Informatics, Center for Genome Science, National Institute of Health, Korea Centers for Disease Control and Prevention)
  • Received : 2018.10.15
  • Accepted : 2018.11.14
  • Published : 2018.12.31

Abstract

Next generation sequencing (NGS), a high-throughput DNA sequencing technology, is widely used for molecular biological studies. In NGS, RNA-sequencing (RNA-Seq), which is a short-read massively parallel sequencing, is a major quantitative transcriptome tool for different transcriptome studies. To utilize the RNA-Seq data, various quantification and analysis methods have been developed to solve specific research goals, including identification of differentially expressed genes and detection of novel transcripts. Because of the accumulation of RNA-Seq data in the public databases, there is a demand for integrative analysis. However, the available RNA-Seq data are stored in different formats such as read count, transcripts per million, and fragments per kilobase million. This hinders the integrative analysis of the RNA-Seq data. To solve this problem, we have developed a web-based application using Shiny, COEX-seq (Convert a Variety of Measurements of Gene Expression in RNA-Seq) that easily converts data in a variety of measurement formats of gene expression used in most bioinformatic tools for RNA-Seq. It provides a workflow that includes loading data set, selecting measurement formats of gene expression, and identifying gene names. COEX-seq is freely available for academic purposes and can be run on Windows, Mac OS, and Linux operating systems. Source code, sample data sets, and supplementary documentation are available as well.

Keywords

References

  1. Goncalves A, Tikhonov A, Brazma A, Kapushesky M. A pipeline for RNA-seq data processing and quality assessment. Bioinformatics 2011;27:867-869. https://doi.org/10.1093/bioinformatics/btr012
  2. Law CW, Chen Y, Shi W, Smyth GK. voom: Precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol 2014;15:R29. https://doi.org/10.1186/gb-2014-15-2-r29
  3. Wagner GP, Kin K, Lynch VJ. Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples. Theory Biosci 2012;131:281-285. https://doi.org/10.1007/s12064-012-0162-3
  4. Li B, Ruotti V, Stewart RM, Thomson JA, Dewey CN. RNA-Seq gene expression estimation with read mapping uncertainty. Bioinformatics 2010;26:493-500. https://doi.org/10.1093/bioinformatics/btp692
  5. Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods 2008;5:621-628. https://doi.org/10.1038/nmeth.1226
  6. Collado-Torres L, Nellore A, Kammers K, Ellis SE, Taub MA, Hansen KD, et al. Reproducible RNA-seq analysis using recount2. Nat Biotechnol 2017;35:319-321. https://doi.org/10.1038/nbt.3838
  7. RStudio Team. RStudio: Integrated Development for R. Boston: RStudio, Inc., 2015. Accessed 2018 Oct 1. Available from: http://www.rstudio.com/.
  8. RStudio Team. shiny: Web Application Framework for R. Boston: RStudio, Inc., 2013. Accessed 2018 Oct 1. Available from: http://shiny.rstudio.com/.
  9. Pachter L. Models for transcript quantification from RNA-Seq. Ithaca: Cornell University Library, 2011. Accessed 2018 Oct 1. Available from: http://arxiv.org/abs/1104.3889.