Browse > Article
http://dx.doi.org/10.5808/GI.2010.8.2.097

A Simple GUI-based Sequencing Format Conversion Tool for the Three NGS Platforms  

Rhie, A-Rang (Department of Computer Science, Ewha Womans University)
Yang, San-Duk (Department of Computer Science, Ewha Womans University)
Lee, Kyung-Eun (Department of Computer Science, Ewha Womans University)
Thong, Chin Ting (Department of Computer Science, Ewha Womans University)
Park, Hyun-Seok (Department of Computer Science, Ewha Womans University)
Abstract
To allow for a quick conversion of the proprietary sequence data from various sequencing platforms, sequence format conversion toolkits are required that can be easily integrated into workflow systems. In this respect, a format conversion tool, as well as quality conversion tool would be the minimum requirements to integrate reads from different platforms. We have developed the Pyrus NGS Sequencing Format Converter, a simple software toolkit which allows to convert three kinds of Next Generation Sequencing reads, into commonly used fasta or fastq formats. The converter modules are all implemented, uniformly, in Java GUI modules that can be integrated in software applications for displaying the data content in the same format.
Keywords
sequence format conversion; next generation sequencing;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Rothberg, J.M., and Leamon, J.H. (2008). The development and impact of 454 sequencing. Nat. Biotechnol. 26, 1117-1124.   DOI
2 Shendure, J., and Ji, H. (2008). Next-generation DNA sequencing. Nat. Biotechnol. 26, 1135-1145.   DOI
3 Cock, P., Fields, C., Goto, N., Heuer, M., and Rice, P. (2010). The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants, Nucl. Acids Res. 38, 1767-1771.   DOI   ScienceOn
4 Droege, M., and Hill, B. (2008). The Genome Sequencer FLX System-longer reads, more applications, straight forward bioinformatics and more complete data sets. J. Biotechnol. 136, 3-10.   DOI
5 Giardine, B., Riemer, C., Hardison, R.C., Burhans, R., Elnitski, L., Shah, P., Zhang, Y., Blankenberg, D., Albert, I., Taylor, J., Miller, W., Kent, W.J., and Nekrutenko, A. (2005). Galaxy: a platform for interactive large-scale genome analysis. Genome Res. (10), 1451-1455.   DOI   ScienceOn
6 Harismendy, O., Ng, P.C., Strausberg, R.L., Wang, X., Stockwell, T.B., Beeson, K.Y., Schork, N.J., Murray, S.S., Topol, E.J., Levy, S., and Frazer, K.A. (2009). Evaluation of next generation sequencing platforms for population targeted sequencing studies. Genome Biol. 10, R32.   DOI
7 Horner, D.S., Pavesi, G., Castrignano, T., De Meo, P.D., Liuni, S., Sammeth, M., Picardi, E., and Pesole, G. (2010). Bioinformatics approaches for genomics and post genomics applications of next-generation sequencing. Brief. Bioinfo. 11, 181-197.   DOI
8 MacLean, D., Jones J.D., and Studholme, D.J. (2009). Application of 'next-generation' sequencing technologies to microbial genetics, Nat. Rev. Microbiol. 7, 287-296.
9 Pandey, V., Nutter, R.C., and E, E.P. (2008). Applied Biosystems SOLiD system: ligation-based sequencing. In Next Generation Genome Sequencing: towards personalized medicine, Janitz, M, ed. Weinheim, Wiley-VCH, pp. 29-41.
10 Miller, J., Koren, S., and Sutton, G. (2010). Assembly algorithms for next-generation sequencing data, Genomics 95, 315-327.   DOI
11 Porreca, G.J., Shendure, J., and Church, G.M. (2006). Polony DNA sequencing. Curr ProtocMol. Biol. Chapter 7:Unit:7-8.
12 Bennett, S. (2004). Solexa Ltd. Pharmacogenomics 5:433- 438.   DOI