• Title/Summary/Keyword: Pan-genome analysis

Search Result 29, Processing Time 0.025 seconds

Comparative Genomic Analysis of Food-Originated Coagulase-Negative Staphylococcus: Analysis of Conserved Core Genes and Diversity of the Pan-Genome

  • Heo, Sojeong;Lee, Jung-Sug;Lee, Jong-Hoon;Jeong, Do-Won
    • Journal of Microbiology and Biotechnology
    • /
    • v.30 no.3
    • /
    • pp.341-351
    • /
    • 2020
  • To shed light on the genetic differences among food-originated coagulase-negative Staphylococcus (CNS), we performed pan-genome analysis of five species: Staphylococcus carnosus (two strains), Staphylococcus equorum (two strains), Staphylococcus succinus (three strains), Staphylococcus xylosus (two strains), and Staphylococcus saprophyticus (one strain). The pan-genome size increases with each new strain and currently holds about 4,500 genes from 10 genomes. Specific genes were shown to be strain dependent but not species dependent. Most specific genes were of unknown function or encoded restriction-modification enzymes, transposases, or prophages. Our results indicate that unique genes have been acquired or lost by convergent evolution within individual strains.

Comparative Genomics Reveals the Core and Accessory Genomes of Streptomyces Species

  • Kim, Ji-Nu;Kim, Yeonbum;Jeong, Yujin;Roe, Jung-Hye;Kim, Byung-Gee;Cho, Byung-Kwan
    • Journal of Microbiology and Biotechnology
    • /
    • v.25 no.10
    • /
    • pp.1599-1605
    • /
    • 2015
  • The development of rapid and efficient genome sequencing methods has enabled us to study the evolutionary background of bacterial genetic information. Here, we present comparative genomic analysis of 17 Streptomyces species, for which the genome has been completely sequenced, using the pan-genome approach. The analysis revealed that 34,592 ortholog clusters constituted the pan-genome of these Streptomyces species, including 2,018 in the core genome, 11,743 in the dispensable genome, and 20,831 in the unique genome. The core genome was converged to a smaller number of genes than reported previously, with 3,096 gene families. Functional enrichment analysis showed that genes involved in transcription were most abundant in the Streptomyces pan-genome. Finally, we investigated core genes for the sigma factors, mycothiol biosynthesis pathway, and secondary metabolism pathways; our data showed that many genes involved in stress response and morphological differentiation were commonly expressed in Streptomyces species. Elucidation of the core genome offers a basis for understanding the functional evolution of Streptomyces species and provides insights into target selection for the construction of industrial strains.

Comparative analysis of core and pan-genomes of order Nitrosomonadales (Nitrosomonadales 목의 핵심유전체(core genome)와 범유전체(pan-genome)의 비교유전체학적 연구)

  • Lee, Jinhwan;Kim, Kyoung-Ho
    • Korean Journal of Microbiology
    • /
    • v.51 no.4
    • /
    • pp.329-337
    • /
    • 2015
  • All known genomes (N=10) in the order Nitrosomonadales were analyzed to contain 9,808 and 908 gene clusters in their pan-genome and core genome, respectively. Analyses with reference genomes belonging to other orders in Betaproteobacteria revealed that sizes of pan-genome and core genome were dependent on the number of genomes compared and the differences of genomes within a group. The sizes of pan-genomes of the genera Nitrosomonas and Nitrosospira were 7,180 and 4,586 and core genomes, 1,092 and 1,600, respectively, which implied that similarity of genomes in Nitrosospira were higher than Nitrosomonas. The genomes of Nitrosomonas contributed mostly to the size of the pan-genome and core genomes of Nitrosomonadales. COG analysis of gene clusters showed that the J (translation, ribosomal structure and biogenesis) category occupied the biggest proportions (9.7-21.0%) among COG categories in core genomes and its proportion increased in the group which genetic distances among members were high. The unclassified category (-) occupied very high proportions (34-51%) in pan-genomes. Ninety seven gene clusters existed only in Nitrosomonadales and not in reference genomes. The gene clusters contained ammonia monooxygenase (amoA and amoB) and -related genes (amoE and amoD) which were typical genes characterizing the order Nitrosomonadales while they contained significant amount (16-45%) of unclassified genes. Thus, these exclusively-conserved gene clusters might play an important role to reveal genetic specificity of the order Nitrosomonadales.

Assessment of Erythrobacter Species Diversity through Pan-Genome Analysis with Newly Isolated Erythrobacter sp. 3-20A1M

  • Cho, Sang-Hyeok;Jeong, Yujin;Lee, Eunju;Ko, So-Ra;Ahn, Chi-Yong;Oh, Hee-Mock;Cho, Byung-Kwan;Cho, Suhyung
    • Journal of Microbiology and Biotechnology
    • /
    • v.31 no.4
    • /
    • pp.601-609
    • /
    • 2021
  • Erythrobacter species are extensively studied marine bacteria that produce various carotenoids. Due to their photoheterotrophic ability, it has been suggested that they play a crucial role in marine ecosystems. It is essential to identify the genome sequence and the genes of the species to predict their role in the marine ecosystem. In this study, we report the complete genome sequence of the marine bacterium Erythrobacter sp. 3-20A1M. The genome size was 3.1 Mbp and its GC content was 64.8%. In total, 2998 genetic features were annotated, of which 2882 were annotated as functional coding genes. Using the genetic information of Erythrobacter sp. 3-20A1M, we performed pan-genome analysis with other Erythrobacter species. This revealed highly conserved secondary metabolite biosynthesis-related COG functions across Erythrobacter species. Through subsequent secondary metabolite biosynthetic gene cluster prediction and KEGG analysis, the carotenoid biosynthetic pathway was proven conserved in all Erythrobacter species, except for the spheroidene and spirilloxanthin pathways, which are only found in photosynthetic Erythrobacter species. The presence of virulence genes, especially the plant-algae cell wall degrading genes, revealed that Erythrobacter sp. 3-20A1M is a potential marine plant-algae scavenger.

High Resolution Whole Genome Multilocus Sequence Typing (wgMLST) Schemes for Salmonella enterica Weltevreden Epidemiologic Investigations

  • Tadee, Pakpoom;Tadee, Phacharaporn;Hitchings, Matthew D.;Pascoe, Ben;Sheppard, Samuel K.;Patchanee, Prapas
    • Microbiology and Biotechnology Letters
    • /
    • v.46 no.2
    • /
    • pp.162-170
    • /
    • 2018
  • Non-typhoidal Salmonella is one of the main pathogens causing food-borne illness in humans, with up to 20% of cases resulting from consumption of pork products. Over the gastroenteritis signs, multidrug resistant Salmonella has arisen. In this study, pan-susceptible phenotypic strains of Salmonella enterica serotype Weltevreden recovered from pig production chain in Chiang Mai, Thailand during 2012-2014 were chosen for analysis. The aim of this study was to use whole genome sequencing (WGS) data with an emphasis on antimicrobial resistance gene investigation to assess their pathogenic potential and genetic diversity determination based on whole genome Multilocus Sequence Typing (wgMLST) to expand epidemiological knowledge and to provide additional guidance for disease control. Analyis using ResFinder 3.0 for WGS database tracing found that one of pan-susceptible phenotypic strain carried five classes of resistance genes: aminoglycoside, beta-lactam, phenicol, sulfonamide, and tetracycline associated genes. Twenty four and 36 loci differences were detected by core genome Multilocus Sequence Typing (cgMLST) and pan genome Multilocus Sequence Typing (pgMLST), respectively, in two matching strains (44/13 vs A543057 and A543056 vs 204/13) initially assigned by conventional MLST and Pulsed-field Gel Electrophoresis (PFGE). One hundread percent discriminant ability can be achieved using the wgMLST technique. WGS is currently the ultimate molecular technique for various in-depth studies. As the findings stated above, a new of "gold standard typing method era" for routine works in genome study is being set.

Complete genome sequencing and comparative genomic analysis of Lactobacillus acidophilus C5 as a potential canine probiotics

  • Son, Seungwoo;Lee, Raham;Park, Seung-Moon;Lee, Sung Ho;Lee, Hak-Kyo;Kim, Yangseon;Shin, Donghyun
    • Journal of Animal Science and Technology
    • /
    • v.63 no.6
    • /
    • pp.1411-1422
    • /
    • 2021
  • Lactobacillus acidophilus is a gram-positive, microaerophilic, and acidophilic bacterial species. L. acidophilus strains in the gastrointestinal tracts of humans and other animals have been profiled, but strains found in the canine gut have not been studied yet. Our study helps in understanding the genetic features of the L. acidophilus C5 strain found in the canine gut, determining its adaptive features evolved to survive in the canine gut environment, and in elucidating its probiotic functions. To examine the canine L. acidophilus C5 genome, we isolated the C5 strain from a Korean dog and sequenced it using PacBio SMRT sequencing technology. A comparative genomic approach was used to assess genetic relationships between C5 and six other strains and study the distinguishing features related to different hosts. We found that most genes in the C5 strain were related to carbohydrate transport and metabolism. The pan-genome of seven L. acidophilus strains contained 2,254 gene families, and the core genome contained 1,726 gene families. The phylogenetic tree of the core genes in the canine L. acidophilus C5 strain was very close to that of two strains (DSM20079 and NCFM) from humans. We identified 30 evolutionarily accelerated genes in the L. acidophilus C5 strain in the ratio of non-synonymous to synonymous substitutions (dN/dS) analysis. Five of these thirty genes were associated with carbohydrate transport and metabolism. This study provides insights into genetic features and adaptations of the L. acidophilus C5 strain to survive the canine intestinal environment. It also suggests that the evolution of the L. acidophilus genome is closely related to the host's evolutionary adaptation process.

High-quality draft genome and characterization of commercially potent probiotic Lactobacillus strains

  • Sulthana, Ayesha;Lakshmi, Suvarna G.;Madempudi, Ratna Sudha
    • Genomics & Informatics
    • /
    • v.17 no.4
    • /
    • pp.43.1-43.5
    • /
    • 2019
  • Lactobacillus acidophilus UBLA-34, L. paracasei UBLPC-35, L. plantarum UBLP-40, and L. reuteri UBLRU-87 were isolated from different varieties of fermented foods. To determine the probiotic safety at the strain level, the whole genome of the respective strains was sequenced, assembled, and characterized. Both the core-genome and pan-genome phylogeny showed that L. reuteri was closest to L. plantarum than to L. acidophilus, which was closest to L. paracasei. The genomic analysis of all the strains confirmed the absence of genes encoding putative virulence factors, antibiotic resistance, and the plasmids.

Development of a novel endolysin, PanLys.1, for the specific inhibition of Peptostreptococcus anaerobius

  • Joonbeom Moon;Hanbeen Kim;Dongseok Lee;Jakyeom Seo
    • Animal Bioscience
    • /
    • v.36 no.8
    • /
    • pp.1285-1292
    • /
    • 2023
  • Objective: The objective of this study was to develop a novel endolysin (PanLys.1) for the specific killing of the ruminal hyper-ammonia-producing bacterium Peptostreptococcus anaerobius (P. anaerobius). Methods: Whole genome sequences of P. anaerobius strains and related bacteriophages were collected from the National Center for Biotechnology Information database, and the candidate gene for PanLys.1 was isolated based on amino acid sequences and conserved domain database (CDD) analysis. The gene was overexpressed using a pET system in Escherichia coli BL21 (DE3). The lytic activity of PanLys.1 was evaluated under various conditions (dosage, pH, temperature, NaCl, and metal ions) to determine the optimal lytic activity conditions. Finally, the killing activity of PanLys.1 against P. anaerobius was confirmed using an in vitro rumen fermentation system. Results: CDD analysis showed that PanLys.1 has a modular design with a catalytic domain, amidase-2, at the N-terminal, and a cell wall binding domain, from the CW-7 superfamily, at the C-terminal. The lytic activity of PanLys.1 against P. anaerobius was the highest at pH 8.0 (p<0.05) and was maintained at 37℃ to 45℃, and 0 to 250 mM NaCl. The activity of PanLys.1 significantly decreased (p<0.05) after Mn2+ or Zn2+ treatment. The relative abundance of P. anaerobius did not decrease after administration PanLys.1 under in vitro rumen conditions. Conclusion: The application of PanLys.1 to modulate P. anaerobius in the rumen might not be feasible because its lytic activity was not observed in in vitro rumen system.

Comparative Genomics Uncovers the Genetic Diversity and Synthetic Biology of Secondary Metabolite Production of Trametes

  • Zhang, Yan;Wang, Jingjing;Yajun, Chen;Zhou, Minghui;Wang, Wei;Geng, Ming;Xu, Decong;Xu, Zhongdong
    • Mycobiology
    • /
    • v.48 no.2
    • /
    • pp.104-114
    • /
    • 2020
  • The carbohydrate-active enzyme (CAZyme) genes of Trametes contribute to polysaccharide degradation. However, the comprehensive analysis of the composition of CAZymes and the biosynthetic gene clusters (BGCs) of Trametes remain unclear. Here, we conducted comparative analysis, detected the CAZyme genes, and predicted the BGCs for nine Trametes strains. Among the 82,053 homologous clusters obtained for Trametes, we identified 8518 core genes, 60,441 accessory genes, and 13,094 specific genes. A large proportion of CAZyme genes were cataloged into glycoside hydrolases, glycosyltransferases, and carbohydrate esterases. The predicted BGCs of Trametes were divided into six strategies, and the nine Trametes strains harbored 47.78 BGCs on average. Our study revealed that Trametes exhibits an open pan-genome structure. These findings provide insights into the genetic diversity and explored the synthetic biology of secondary metabolite production for Trametes.

Comparative Genome Analysis of Psychrobacillus Strain PB01, Isolated from an Iceberg

  • Choi, Jun Young;Kim, Sun Chang;Lee, Pyung Cheon
    • Journal of Microbiology and Biotechnology
    • /
    • v.30 no.2
    • /
    • pp.237-243
    • /
    • 2020
  • A novel psychrotolerant Psychrobacillus strain PB01, isolated from an Antarctic iceberg, was comparatively analyzed with five related strains. The complete genome of strain PB01 consists of a single circular chromosome (4.3 Mb) and a plasmid (19 Kb). As potential low-temperature adaptation strategies, strain PB01 has four genes encoding cold-shock proteins, two genes encoding DEAD-box RNA helicases, and eight genes encoding transporters for glycine betaine, which can serve as a cryoprotectant, on the genome. The pan-genome structure of the six Psychrobacillus strains suggests that strain PB01 might have evolved to adapt to extreme environments by changing its genome content to gain higher capacity for DNA repair, translation, and membrane transport. Notably, strain PB01 possesses a complete TCA cycle consisting of eight enzymes as well as three additional Helicobacter pylori-type enzymes: ferredoxin-dependent 2-oxoglutarate synthase, succinyl-CoA/acetoacetyl-CoA transferase, and malate/quinone oxidoreductase. The co-existence of the genes for TCA cycle enzymes has also been identified in the other five Psychrobacillus strains.