DOI QR코드

DOI QR Code

A Simple GUI-based Sequencing Format Conversion Tool for the Three NGS Platforms

  • Rhie, A-Rang (Department of Computer Science, Ewha Womans University) ;
  • Yang, San-Duk (Department of Computer Science, Ewha Womans University) ;
  • Lee, Kyung-Eun (Department of Computer Science, Ewha Womans University) ;
  • Thong, Chin Ting (Department of Computer Science, Ewha Womans University) ;
  • Park, Hyun-Seok (Department of Computer Science, Ewha Womans University)
  • Accepted : 2010.06.17
  • Published : 2010.06.30

Abstract

To allow for a quick conversion of the proprietary sequence data from various sequencing platforms, sequence format conversion toolkits are required that can be easily integrated into workflow systems. In this respect, a format conversion tool, as well as quality conversion tool would be the minimum requirements to integrate reads from different platforms. We have developed the Pyrus NGS Sequencing Format Converter, a simple software toolkit which allows to convert three kinds of Next Generation Sequencing reads, into commonly used fasta or fastq formats. The converter modules are all implemented, uniformly, in Java GUI modules that can be integrated in software applications for displaying the data content in the same format.

Keywords

References

  1. Bennett, S. (2004). Solexa Ltd. Pharmacogenomics 5:433- 438. https://doi.org/10.1517/14622416.5.4.433
  2. Cock, P., Fields, C., Goto, N., Heuer, M., and Rice, P. (2010). The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants, Nucl. Acids Res. 38, 1767-1771. https://doi.org/10.1093/nar/gkp1137
  3. Droege, M., and Hill, B. (2008). The Genome Sequencer FLX System-longer reads, more applications, straight forward bioinformatics and more complete data sets. J. Biotechnol. 136, 3-10. https://doi.org/10.1016/j.jbiotec.2008.03.021
  4. Giardine, B., Riemer, C., Hardison, R.C., Burhans, R., Elnitski, L., Shah, P., Zhang, Y., Blankenberg, D., Albert, I., Taylor, J., Miller, W., Kent, W.J., and Nekrutenko, A. (2005). Galaxy: a platform for interactive large-scale genome analysis. Genome Res. (10), 1451-1455. https://doi.org/10.1101/gr.4086505
  5. Harismendy, O., Ng, P.C., Strausberg, R.L., Wang, X., Stockwell, T.B., Beeson, K.Y., Schork, N.J., Murray, S.S., Topol, E.J., Levy, S., and Frazer, K.A. (2009). Evaluation of next generation sequencing platforms for population targeted sequencing studies. Genome Biol. 10, R32. https://doi.org/10.1186/gb-2009-10-3-r32
  6. Horner, D.S., Pavesi, G., Castrignano, T., De Meo, P.D., Liuni, S., Sammeth, M., Picardi, E., and Pesole, G. (2010). Bioinformatics approaches for genomics and post genomics applications of next-generation sequencing. Brief. Bioinfo. 11, 181-197. https://doi.org/10.1093/bib/bbp046
  7. MacLean, D., Jones J.D., and Studholme, D.J. (2009). Application of 'next-generation' sequencing technologies to microbial genetics, Nat. Rev. Microbiol. 7, 287-296.
  8. Miller, J., Koren, S., and Sutton, G. (2010). Assembly algorithms for next-generation sequencing data, Genomics 95, 315-327. https://doi.org/10.1016/j.ygeno.2010.03.001
  9. Pandey, V., Nutter, R.C., and E, E.P. (2008). Applied Biosystems SOLiD system: ligation-based sequencing. In Next Generation Genome Sequencing: towards personalized medicine, Janitz, M, ed. Weinheim, Wiley-VCH, pp. 29-41.
  10. Porreca, G.J., Shendure, J., and Church, G.M. (2006). Polony DNA sequencing. Curr ProtocMol. Biol. Chapter 7:Unit:7-8.
  11. Rothberg, J.M., and Leamon, J.H. (2008). The development and impact of 454 sequencing. Nat. Biotechnol. 26, 1117-1124. https://doi.org/10.1038/nbt1485
  12. Shendure, J., and Ji, H. (2008). Next-generation DNA sequencing. Nat. Biotechnol. 26, 1135-1145. https://doi.org/10.1038/nbt1486

Cited by

  1. Making next-generation sequencing work for you: approaches and practical considerations for marker development and phylogenetics vol.5, pp.4, 2012, https://doi.org/10.1080/17550874.2012.745909