• Title/Summary/Keyword: pan-genome

Search Result 42, Processing Time 0.029 seconds

Comparative Genomics Reveals the Core and Accessory Genomes of Streptomyces Species

  • Kim, Ji-Nu;Kim, Yeonbum;Jeong, Yujin;Roe, Jung-Hye;Kim, Byung-Gee;Cho, Byung-Kwan
    • Journal of Microbiology and Biotechnology
    • /
    • v.25 no.10
    • /
    • pp.1599-1605
    • /
    • 2015
  • The development of rapid and efficient genome sequencing methods has enabled us to study the evolutionary background of bacterial genetic information. Here, we present comparative genomic analysis of 17 Streptomyces species, for which the genome has been completely sequenced, using the pan-genome approach. The analysis revealed that 34,592 ortholog clusters constituted the pan-genome of these Streptomyces species, including 2,018 in the core genome, 11,743 in the dispensable genome, and 20,831 in the unique genome. The core genome was converged to a smaller number of genes than reported previously, with 3,096 gene families. Functional enrichment analysis showed that genes involved in transcription were most abundant in the Streptomyces pan-genome. Finally, we investigated core genes for the sigma factors, mycothiol biosynthesis pathway, and secondary metabolism pathways; our data showed that many genes involved in stress response and morphological differentiation were commonly expressed in Streptomyces species. Elucidation of the core genome offers a basis for understanding the functional evolution of Streptomyces species and provides insights into target selection for the construction of industrial strains.

Comparative Genomic Analysis of Food-Originated Coagulase-Negative Staphylococcus: Analysis of Conserved Core Genes and Diversity of the Pan-Genome

  • Heo, Sojeong;Lee, Jung-Sug;Lee, Jong-Hoon;Jeong, Do-Won
    • Journal of Microbiology and Biotechnology
    • /
    • v.30 no.3
    • /
    • pp.341-351
    • /
    • 2020
  • To shed light on the genetic differences among food-originated coagulase-negative Staphylococcus (CNS), we performed pan-genome analysis of five species: Staphylococcus carnosus (two strains), Staphylococcus equorum (two strains), Staphylococcus succinus (three strains), Staphylococcus xylosus (two strains), and Staphylococcus saprophyticus (one strain). The pan-genome size increases with each new strain and currently holds about 4,500 genes from 10 genomes. Specific genes were shown to be strain dependent but not species dependent. Most specific genes were of unknown function or encoded restriction-modification enzymes, transposases, or prophages. Our results indicate that unique genes have been acquired or lost by convergent evolution within individual strains.

Comparative analysis of core and pan-genomes of order Nitrosomonadales (Nitrosomonadales 목의 핵심유전체(core genome)와 범유전체(pan-genome)의 비교유전체학적 연구)

  • Lee, Jinhwan;Kim, Kyoung-Ho
    • Korean Journal of Microbiology
    • /
    • v.51 no.4
    • /
    • pp.329-337
    • /
    • 2015
  • All known genomes (N=10) in the order Nitrosomonadales were analyzed to contain 9,808 and 908 gene clusters in their pan-genome and core genome, respectively. Analyses with reference genomes belonging to other orders in Betaproteobacteria revealed that sizes of pan-genome and core genome were dependent on the number of genomes compared and the differences of genomes within a group. The sizes of pan-genomes of the genera Nitrosomonas and Nitrosospira were 7,180 and 4,586 and core genomes, 1,092 and 1,600, respectively, which implied that similarity of genomes in Nitrosospira were higher than Nitrosomonas. The genomes of Nitrosomonas contributed mostly to the size of the pan-genome and core genomes of Nitrosomonadales. COG analysis of gene clusters showed that the J (translation, ribosomal structure and biogenesis) category occupied the biggest proportions (9.7-21.0%) among COG categories in core genomes and its proportion increased in the group which genetic distances among members were high. The unclassified category (-) occupied very high proportions (34-51%) in pan-genomes. Ninety seven gene clusters existed only in Nitrosomonadales and not in reference genomes. The gene clusters contained ammonia monooxygenase (amoA and amoB) and -related genes (amoE and amoD) which were typical genes characterizing the order Nitrosomonadales while they contained significant amount (16-45%) of unclassified genes. Thus, these exclusively-conserved gene clusters might play an important role to reveal genetic specificity of the order Nitrosomonadales.

High Resolution Whole Genome Multilocus Sequence Typing (wgMLST) Schemes for Salmonella enterica Weltevreden Epidemiologic Investigations

  • Tadee, Pakpoom;Tadee, Phacharaporn;Hitchings, Matthew D.;Pascoe, Ben;Sheppard, Samuel K.;Patchanee, Prapas
    • Microbiology and Biotechnology Letters
    • /
    • v.46 no.2
    • /
    • pp.162-170
    • /
    • 2018
  • Non-typhoidal Salmonella is one of the main pathogens causing food-borne illness in humans, with up to 20% of cases resulting from consumption of pork products. Over the gastroenteritis signs, multidrug resistant Salmonella has arisen. In this study, pan-susceptible phenotypic strains of Salmonella enterica serotype Weltevreden recovered from pig production chain in Chiang Mai, Thailand during 2012-2014 were chosen for analysis. The aim of this study was to use whole genome sequencing (WGS) data with an emphasis on antimicrobial resistance gene investigation to assess their pathogenic potential and genetic diversity determination based on whole genome Multilocus Sequence Typing (wgMLST) to expand epidemiological knowledge and to provide additional guidance for disease control. Analyis using ResFinder 3.0 for WGS database tracing found that one of pan-susceptible phenotypic strain carried five classes of resistance genes: aminoglycoside, beta-lactam, phenicol, sulfonamide, and tetracycline associated genes. Twenty four and 36 loci differences were detected by core genome Multilocus Sequence Typing (cgMLST) and pan genome Multilocus Sequence Typing (pgMLST), respectively, in two matching strains (44/13 vs A543057 and A543056 vs 204/13) initially assigned by conventional MLST and Pulsed-field Gel Electrophoresis (PFGE). One hundread percent discriminant ability can be achieved using the wgMLST technique. WGS is currently the ultimate molecular technique for various in-depth studies. As the findings stated above, a new of "gold standard typing method era" for routine works in genome study is being set.

Comparative Analyses of Four Complete Genomes in Pseudomonas amygdali Revealed Differential Adaptation to Hostile Environments and Secretion Systems

  • Jung, Hyejung;Kim, Hong-Seop;Han, Gil;Park, Jungwook;Seo, Young-Su
    • The Plant Pathology Journal
    • /
    • v.38 no.2
    • /
    • pp.167-174
    • /
    • 2022
  • Pseudomonas amygdali is a hemibiotrophic phytopathogen that causes disease in woody and herbaceous plants. Complete genomes of four P. amygdali pathovars were comparatively analyzed to decipher the impact of genomic diversity on host colonization. The pan-genome indicated that 3,928 core genes are conserved among pathovars, while 504-1,009 are unique to specific pathovars. The unique genome contained many mobile elements and exhibited a functional distribution different from the core genome. Genes involved in O-antigen biosynthesis and antimicrobial peptide resistance were significantly enriched for adaptation to hostile environments. While the type III secretion system was distributed in the core genome, unique genomes revealed a different organization of secretion systems as follows: type I in pv. tabaci, type II in pv. japonicus, type IV in pv. morsprunorum, and type VI in pv. lachrymans. These findings provide genetic insight into the dynamic interactions of the bacteria with plant hosts.

Assessment of Erythrobacter Species Diversity through Pan-Genome Analysis with Newly Isolated Erythrobacter sp. 3-20A1M

  • Cho, Sang-Hyeok;Jeong, Yujin;Lee, Eunju;Ko, So-Ra;Ahn, Chi-Yong;Oh, Hee-Mock;Cho, Byung-Kwan;Cho, Suhyung
    • Journal of Microbiology and Biotechnology
    • /
    • v.31 no.4
    • /
    • pp.601-609
    • /
    • 2021
  • Erythrobacter species are extensively studied marine bacteria that produce various carotenoids. Due to their photoheterotrophic ability, it has been suggested that they play a crucial role in marine ecosystems. It is essential to identify the genome sequence and the genes of the species to predict their role in the marine ecosystem. In this study, we report the complete genome sequence of the marine bacterium Erythrobacter sp. 3-20A1M. The genome size was 3.1 Mbp and its GC content was 64.8%. In total, 2998 genetic features were annotated, of which 2882 were annotated as functional coding genes. Using the genetic information of Erythrobacter sp. 3-20A1M, we performed pan-genome analysis with other Erythrobacter species. This revealed highly conserved secondary metabolite biosynthesis-related COG functions across Erythrobacter species. Through subsequent secondary metabolite biosynthetic gene cluster prediction and KEGG analysis, the carotenoid biosynthetic pathway was proven conserved in all Erythrobacter species, except for the spheroidene and spirilloxanthin pathways, which are only found in photosynthetic Erythrobacter species. The presence of virulence genes, especially the plant-algae cell wall degrading genes, revealed that Erythrobacter sp. 3-20A1M is a potential marine plant-algae scavenger.

"The Korean Genome for Asian Health": A Commercialization Strategy of the Korean Genome Projects ("아시아인 건강을 위한 한국인 게놈" : 한국인 유전체 프로젝트의 상업화 전략)

  • HYUN, Jaehwan
    • Journal of Science and Technology Studies
    • /
    • v.19 no.2
    • /
    • pp.117-167
    • /
    • 2019
  • Since a working draft sequence mapping of the human genome was published in 2001, the variety of the national genome projects has been initiated in South Korea. One of the rationales for such projects is that "the Korean genome database" will be used for "the personalized medicine for Asians." By focusing on the development of human genomics in this country, this paper examines how the discourse has emerged as a strategy for commercializing the national genome. The paper argues that Korean genomicists developed this strategy under the influences of the global "genome sovereignty" policy and local "Asian regionalist" science policy. It will contribute to the literature of the "Asian" race and genomics by shedding new light on the historical formation of the Pan-Asian Single Nucleotide Polymorphism(PASNP) consortium beyond the Singaporean experience.

Identification of Ethnically Specific Genetic Variations in Pan-Asian Ethnos

  • Yang, Jin Ok;Hwang, Sohyun;Kim, Woo-Yeon;Park, Seong-Jin;Kim, Sang Cheol;Park, Kiejung;Lee, Byungwook;The HUGO Pan-Asian SNP Consortium
    • Genomics & Informatics
    • /
    • v.12 no.1
    • /
    • pp.42-47
    • /
    • 2014
  • Asian populations contain a variety of ethnic groups that have ethnically specific genetic differences. Ethnic variants may be highly relevant in disease and human differentiation studies. Here, we identified ethnically specific variants and then investigated their distribution across Asian ethnic groups. We obtained 58,960 Pan-Asian single nucleotide polymorphisms of 1,953 individuals from 72 ethnic groups of 11 Asian countries. We selected 9,306 ethnic variant single nucleotide polymorphisms (ESNPs) and 5,167 ethnic variant copy number polymorphisms (ECNPs) using the nearest shrunken centroid method. We analyzed ESNPs and ECNPs in 3 hierarchical levels: superpopulation, subpopulation, and ethnic population. We also identified ESNP- and ECNP-related genes and their features. This study represents the first attempt to identify Asian ESNP and ECNP markers, which can be used to identify genetic differences and predict disease susceptibility and drug effectiveness in Asian ethnic populations.

High-quality draft genome and characterization of commercially potent probiotic Lactobacillus strains

  • Sulthana, Ayesha;Lakshmi, Suvarna G.;Madempudi, Ratna Sudha
    • Genomics & Informatics
    • /
    • v.17 no.4
    • /
    • pp.43.1-43.5
    • /
    • 2019
  • Lactobacillus acidophilus UBLA-34, L. paracasei UBLPC-35, L. plantarum UBLP-40, and L. reuteri UBLRU-87 were isolated from different varieties of fermented foods. To determine the probiotic safety at the strain level, the whole genome of the respective strains was sequenced, assembled, and characterized. Both the core-genome and pan-genome phylogeny showed that L. reuteri was closest to L. plantarum than to L. acidophilus, which was closest to L. paracasei. The genomic analysis of all the strains confirmed the absence of genes encoding putative virulence factors, antibiotic resistance, and the plasmids.

Complete genome sequencing and comparative genomic analysis of Lactobacillus acidophilus C5 as a potential canine probiotics

  • Son, Seungwoo;Lee, Raham;Park, Seung-Moon;Lee, Sung Ho;Lee, Hak-Kyo;Kim, Yangseon;Shin, Donghyun
    • Journal of Animal Science and Technology
    • /
    • v.63 no.6
    • /
    • pp.1411-1422
    • /
    • 2021
  • Lactobacillus acidophilus is a gram-positive, microaerophilic, and acidophilic bacterial species. L. acidophilus strains in the gastrointestinal tracts of humans and other animals have been profiled, but strains found in the canine gut have not been studied yet. Our study helps in understanding the genetic features of the L. acidophilus C5 strain found in the canine gut, determining its adaptive features evolved to survive in the canine gut environment, and in elucidating its probiotic functions. To examine the canine L. acidophilus C5 genome, we isolated the C5 strain from a Korean dog and sequenced it using PacBio SMRT sequencing technology. A comparative genomic approach was used to assess genetic relationships between C5 and six other strains and study the distinguishing features related to different hosts. We found that most genes in the C5 strain were related to carbohydrate transport and metabolism. The pan-genome of seven L. acidophilus strains contained 2,254 gene families, and the core genome contained 1,726 gene families. The phylogenetic tree of the core genes in the canine L. acidophilus C5 strain was very close to that of two strains (DSM20079 and NCFM) from humans. We identified 30 evolutionarily accelerated genes in the L. acidophilus C5 strain in the ratio of non-synonymous to synonymous substitutions (dN/dS) analysis. Five of these thirty genes were associated with carbohydrate transport and metabolism. This study provides insights into genetic features and adaptations of the L. acidophilus C5 strain to survive the canine intestinal environment. It also suggests that the evolution of the L. acidophilus genome is closely related to the host's evolutionary adaptation process.