Browse > Article
http://dx.doi.org/10.5808/gi.20038

Whole-genome sequence analysis through online web interfaces: a review  

Gunasekara, A.W.A.C.W.R. (Veterinary Medical Center and College of Veterinary Medicine, Jeonbuk National University)
Rajapaksha, L.G.T.G. (Veterinary Medical Center and College of Veterinary Medicine, Jeonbuk National University)
Tung, T.L. (Department of Botany, Dagon University)
Abstract
The recent development of whole-genome sequencing technologies paved the way for understanding the genomes of microorganisms. Every whole-genome sequencing (WGS) project requires a considerable cost and a massive effort to address the questions at hand. The final step of WGS is data analysis. The analysis of whole-genome sequence is dependent on highly sophisticated bioinformatics tools that the research personal have to buy. However, many laboratories and research institutions do not have the bioinformatics capabilities to analyze the genomic data and therefore, are unable to take maximum advantage of whole-genome sequencing. In this aspect, this study provides a guide for research personals on a set of bioinformatics tools available online that can be used to analyze whole-genome sequence data of bacterial genomes. The web interfaces described here have many advantages and, in most cases exempting the need for costly analysis tools and intensive computing resources.
Keywords
average nucleotide identitys; online web servers; single nucleotide polymorphisms; virulence factors; whole-genome sequencing;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Sanger F, Nicklen S, Coulson AR. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A 1977;74:5463-5467.   DOI
2 Austin CP. The impact of the completed human genome sequence on the development of novel therapeutics for human disease. Annu Rev Med 2004;55:1-13.   DOI
3 Chattopadhyay GP. Technologies in the Era of Singularity. Chennai: Notion Press, 2018.
4 Oakeson KF, Wagner JM, Mendenhall M, Rohrwasser A, Atkinson-Dunn R. Bioinformatic analyses of whole-genome sequence data in a public health laboratory. Emerg Infect Dis 2017;23:1441-1445.   DOI
5 Abril JF, Castellano S. Genome annotation. In: Encyclopedia of Bioinformatics and Computational Biology (Ranganathan S, Gribskov M, Nakai K, Schonbach C, eds.). Oxford: Academic Press, 2019. pp. 195-209.
6 Ekblom R, Wolf JB. A field guide to whole-genome sequencing, assembly and annotation. Evol Appl 2014;7:1026-1042.   DOI
7 Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, et al. The RAST Server: rapid annotations using subsystems technology. BMC Genomics 2008;9:75.   DOI
8 Ibarz Pavon AB, Maiden MC. Multilocus sequence typing. In: Molecular Epidemiology of Microorganisms, Vol. 551 (Caugant DA, ed.). New York: Humana Press, 2009. pp. 129-140.
9 Brussow H, Canchaya C, Hardt WD. Phages and the evolution of bacterial pathogens: from genomic rearrangements to lysogenic conversion. Microbiol Mol Biol Rev 2004;68:560-602.   DOI
10 Fortier LC, Sekulovic O. Importance of prophages to evolution and virulence of bacterial pathogens. Virulence 2013;4:354-365.   DOI
11 Schurch AC, Siezen RJ. Genomic tracing of epidemics and disease outbreaks. Microb Biotechnol 2010;3:628-633.   DOI
12 Pightling AW, Pettengill JB, Luo Y, Baugher JD, Rand H, Strain E. Interpreting whole-genome sequence analyses of foodborne bacteria for regulatory applications and outbreak investigations. Front Microbiol 2018;9:1482.   DOI
13 Kaas RS, Leekitcharoenphon P, Aarestrup FM, Lund O. Solving the problem of comparing whole bacterial genomes across different sequencing platforms. PLoS One 2014;9:e104984.   DOI
14 Holmes RK, Jobling MG. Genetics. In: Medical Microbiology (Baron S, ed.). Galveston: University of Texas Medical Branch at Galveston, 1996.
15 Bryant J, Chewapreecha C, Bentley SD. Developing insights into the mechanisms of evolution of bacterial pathogens from whole-genome sequences. Future Microbiol 2012;7:1283-1296.   DOI
16 Tatusov RL, Galperin MY, Natale DA, Koonin EV. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res 2000;28:33-36.   DOI
17 Larsen MV, Cosentino S, Rasmussen S, Friis C, Hasman H, Marvig RL, et al. Multilocus sequence typing of total-genome-sequenced bacteria. J Clin Microbiol 2012;50:1355-1361.   DOI
18 Kwong JC, McCallum N, Sintchenko V, Howden BP. Whole genome sequencing in clinical and public health microbiology. Pathology 2015;47:199-210.   DOI
19 Koonin EV, Galperin MY. Genome annotation and analysis. In: Sequence - Evolution - Function: Computational Approaches in Comparative Genomics (Koonin EV, Galperin MY, eds.). Boston: Kluwer Academic Press, 2003. pp. 193-226.
20 Robles JC, Koreen L, Park S, Perlin DS. Multilocus sequence typing is a reliable alternative method to DNA fingerprinting for discriminating among strains of Candida albicans. J Clin Microbiol 2004;42:2480-2488.   DOI
21 Liu YY, Lin JW, Chen CC. cano-wgMLST_BacCompare: a bacterial genome analysis platform for epidemiological investigation and comparative genomic analysis. Front Microbiol 2019;10:1687.   DOI
22 Arndt D, Grant JR, Marcu A, Sajed T, Pon A, Liang Y, et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res 2016;44:W16-W21.   DOI
23 Galperin MY, Kristensen DM, Makarova KS, Wolf YI, Koonin EV. Microbial genome analysis: the COG approach. Brief Bioinform 2019;20:1063-1070.   DOI
24 Brown TA. Genomes. 2nd ed. Oxford: Wiley-Liss, 2002.
25 Peterson JW. Bacterial pathogenesis. In: Medical Microbiology (Baron S, ed.). Galveston: University of Texas Medical Branch at Galveston, 1996.
26 Stothard P, Grant JR, Van Domselaar G. Visualizing and comparing circular genomes using the CGView family of tools. Brief Bioinform 2019;20:1576-1582.   DOI
27 Aanensen DM, Spratt BG. The multilocus sequence typing network: mlst.net. Nucleic Acids Res 2005;33:W728-W733.   DOI
28 Kovanen SM, Kivisto RI, Rossi M, Schott T, Karkkainen UM, Tuuminen T, et al. Multilocus sequence typing (MLST) and whole-genome MLST of Campylobacter jejuni isolates from human infections in three districts during a seasonal peak in Finland. J Clin Microbiol 2014;52:4147-4154.   DOI
29 Jain C, Rodriguez RL, Phillippy AM, Konstantinidis KT, Aluru S. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat Commun 2018;9:5114.   DOI
30 Richter M, Rossello-Mora R. Shifting the genomic gold standard for the prokaryotic species definition. Proc Natl Acad Sci U S A 2009;106:19126-19131.   DOI
31 Chen L, Zheng D, Liu B, Yang J, Jin Q. VFDB 2016: hierarchical and refined dataset for big data analysis: 10 years on. Nucleic Acids Res 2016;44:D694-D697.   DOI
32 Staden R. A strategy of DNA sequencing employing computer programs. Nucleic Acids Res 1979;6:2601-2610.   DOI
33 Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, et al. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science 1995;269:496-512.   DOI
34 Hood L, Rowen L. The Human Genome Project: big science transforms biology and medicine. Genome Med 2013;5:79.   DOI
35 Lopez-Fernandez S, Sonego P, Moretto M, Pancher M, Engelen K, Pertot I, et al. Whole-genome comparative analysis of virulence genes unveils similarities and differences between endophytes and other symbiotic bacteria. Front Microbiol 2015;6:419.   DOI
36 Fitz-Gibbon ST, House CH. Whole genome-based phylogenetic analysis of free-living microorganisms. Nucleic Acids Res 1999;27:4218-4222.   DOI
37 Zuo G, Hao B. CVTree3 Web Server for Whole-genome-based and alignment-free prokaryotic phylogeny and taxonomy. Genomics Proteomics Bioinformatics 2015;13:321-331.   DOI
38 Wyres KL, Conway TC, Garg S, Queiroz C, Reumann M, Holt K, et al. WGS analysis and interpretation in clinical and public health microbiology laboratories: what are the requirements and how do existing tools compare? Pathogens 2014;3:437-458.   DOI
39 Yang Q, Dong X, Xie G, Fu S, Zou P, Sun J, et al. Comparative genomic analysis unravels the transmission pattern and intra-species divergence of acute hepatopancreatic necrosis disease (AH-PND)-causing Vibrio parahaemolyticus strains. Mol Genet Genomics 2019;294:1007-1022.   DOI
40 Sarowska J, Futoma-Koloch B, Jama-Kmiecik A, Frej-Madrzak M, Ksiazczyk M, Bugla-Ploskonska G, et al. Virulence factors, prevalence and potential transmission of extraintestinal pathogenic Escherichia coli isolated from different sources: recent reports. Gut Pathog 2019;11:10.   DOI
41 de Been M, Pinholt M, Top J, Bletz S, Mellmann A, van Schaik W, et al. Core genome multilocus sequence typing scheme for high-resolution typing of Enterococcus faecium. J Clin Microbiol 2015;53:3788-3797.   DOI
42 Saraswathy N, Ramalingam P. Concepts and Techniques in Genomics and Proteomics. Amsterdam: Elsevier, 2011. pp. 77-159.
43 Richter M, Rossello-Mora R, Oliver Glockner F, Peplies J. JSpeciesWS: a web server for prokaryotic species circumscription based on pairwise genome comparison. Bioinformatics 2016;32:929-931.   DOI