• Title/Summary/Keyword: core-genome

Search Result 125, Processing Time 0.024 seconds

Comparative Genomics Reveals the Core and Accessory Genomes of Streptomyces Species

  • Kim, Ji-Nu;Kim, Yeonbum;Jeong, Yujin;Roe, Jung-Hye;Kim, Byung-Gee;Cho, Byung-Kwan
    • Journal of Microbiology and Biotechnology
    • /
    • v.25 no.10
    • /
    • pp.1599-1605
    • /
    • 2015
  • The development of rapid and efficient genome sequencing methods has enabled us to study the evolutionary background of bacterial genetic information. Here, we present comparative genomic analysis of 17 Streptomyces species, for which the genome has been completely sequenced, using the pan-genome approach. The analysis revealed that 34,592 ortholog clusters constituted the pan-genome of these Streptomyces species, including 2,018 in the core genome, 11,743 in the dispensable genome, and 20,831 in the unique genome. The core genome was converged to a smaller number of genes than reported previously, with 3,096 gene families. Functional enrichment analysis showed that genes involved in transcription were most abundant in the Streptomyces pan-genome. Finally, we investigated core genes for the sigma factors, mycothiol biosynthesis pathway, and secondary metabolism pathways; our data showed that many genes involved in stress response and morphological differentiation were commonly expressed in Streptomyces species. Elucidation of the core genome offers a basis for understanding the functional evolution of Streptomyces species and provides insights into target selection for the construction of industrial strains.

HiCORE: Hi-C Analysis for Identification of Core Chromatin Looping Regions with Higher Resolution

  • Lee, Hongwoo;Seo, Pil Joon
    • Molecules and Cells
    • /
    • v.44 no.12
    • /
    • pp.883-892
    • /
    • 2021
  • Genome-wide chromosome conformation capture (3C)-based high-throughput sequencing (Hi-C) has enabled identification of genome-wide chromatin loops. Because the Hi-C map with restriction fragment resolution is intrinsically associated with sparsity and stochastic noise, Hi-C data are usually binned at particular intervals; however, the binning method has limited reliability, especially at high resolution. Here, we describe a new method called HiCORE, which provides simple pipelines and algorithms to overcome the limitations of single-layered binning and predict core chromatin regions with three-dimensional physical interactions. In this approach, multiple layers of binning with slightly shifted genome coverage are generated, and interacting bins at each layer are integrated to infer narrower regions of chromatin interactions. HiCORE predicts chromatin looping regions with higher resolution, both in human and Arabidopsis genomes, and contributes to the identification of the precise positions of potential genomic elements in an unbiased manner.

Pan-Genome Analysis Reveals Origin Specific Genome Expansion in Enterococcus mundtii Strains

  • Neeti Pandey;Raman Rajagopal;Shubham Dhara
    • Microbiology and Biotechnology Letters
    • /
    • v.52 no.2
    • /
    • pp.163-178
    • /
    • 2024
  • Pan-genome analysis is used to interpret genome heterogeneity and diversification of bacterial species. Here, we present pan-genome analysis of 22 strains of Enterococcus mundtii. The GenBank file of E. mundtii strains that have been isolated from different sources i.e., human fecal matter, soil, leaf, dairy products, and insects was downloaded from National Center for Biotechnology Information (NCBI) database and analyzed using BPGA-1.3.0 (Bacterial Pan Genome Analysis) pipeline. Out of a total, 4503 gene families, 1843 belongs to the core genes whereas 1,762 gene families represent the accessory genes and 898 gene families depict the unique genes among all the selected genomes. Majority of the core genes belongs to the categories of Metabolism (37.83%) and Information storage & processing (29.84%) whereas unique genes belongs to the category of Information storage & processing (48.08%). Further, accessory genes are almost equally present in both functional categories i.e. Information storage & processing and Metabolism (34.34% and 32.27% respectively). Further, subset analysis on the basis of the origin of isolates exhibits presence and absence of exclusive gene families. The observation suggests that even closely related strains of a species show extensive disparity in genome owing to their ability to adapt to a specific environment.

Interaction of Hepatitis C Virus Core Protein with Janus Kinase Is Required for Efficient Production of Infectious Viruses

  • Lee, Choongho
    • Biomolecules & Therapeutics
    • /
    • v.21 no.2
    • /
    • pp.97-106
    • /
    • 2013
  • Chronic hepatitis C virus (HCV) infection is responsible for the development of liver cirrhosis and hepatocellular carcinoma. HCV core protein plays not only a structural role in the virion morphogenesis by encapsidating a virus RNA genome but also a non-structural role in HCV-induced pathogenesis by blocking innate immunity. Especially, it has been shown to regulate JAK-STAT signaling pathway through its direct interaction with Janus kinase (JAK) via its proline-rich JAK-binding motif ($^{79}{\underline{P}}GY{\underline{P}}WP^{84}$). However, little is known about the physiological significance of this HCV core-JAK association in the context of the virus life cycle. In order to gain an insight, a mutant HCV genome (J6/JFH1-79A82A) was constructed to express the mutant core with a defective JAK-binding motif ($^{79}{\underline{A}}GY{\underline{A}}WP^{84}$) using an HCV genotype 2a infectious clone (J6/JFH1). When this mutant HCV genome was introduced into hepatocarcinoma cells, it was found to be severely impaired in its ability to produce infectious viruses in spite of its robust RNA genome replication. Taken together, all these results suggest an essential requirement of HCV core-JAK protein interaction for efficient production of infectious viruses and the potential of using core-JAK blockers as a new anti-HCV therapy.

Comparative analysis of core and pan-genomes of order Nitrosomonadales (Nitrosomonadales 목의 핵심유전체(core genome)와 범유전체(pan-genome)의 비교유전체학적 연구)

  • Lee, Jinhwan;Kim, Kyoung-Ho
    • Korean Journal of Microbiology
    • /
    • v.51 no.4
    • /
    • pp.329-337
    • /
    • 2015
  • All known genomes (N=10) in the order Nitrosomonadales were analyzed to contain 9,808 and 908 gene clusters in their pan-genome and core genome, respectively. Analyses with reference genomes belonging to other orders in Betaproteobacteria revealed that sizes of pan-genome and core genome were dependent on the number of genomes compared and the differences of genomes within a group. The sizes of pan-genomes of the genera Nitrosomonas and Nitrosospira were 7,180 and 4,586 and core genomes, 1,092 and 1,600, respectively, which implied that similarity of genomes in Nitrosospira were higher than Nitrosomonas. The genomes of Nitrosomonas contributed mostly to the size of the pan-genome and core genomes of Nitrosomonadales. COG analysis of gene clusters showed that the J (translation, ribosomal structure and biogenesis) category occupied the biggest proportions (9.7-21.0%) among COG categories in core genomes and its proportion increased in the group which genetic distances among members were high. The unclassified category (-) occupied very high proportions (34-51%) in pan-genomes. Ninety seven gene clusters existed only in Nitrosomonadales and not in reference genomes. The gene clusters contained ammonia monooxygenase (amoA and amoB) and -related genes (amoE and amoD) which were typical genes characterizing the order Nitrosomonadales while they contained significant amount (16-45%) of unclassified genes. Thus, these exclusively-conserved gene clusters might play an important role to reveal genetic specificity of the order Nitrosomonadales.

Comparative Analyses of Four Complete Genomes in Pseudomonas amygdali Revealed Differential Adaptation to Hostile Environments and Secretion Systems

  • Jung, Hyejung;Kim, Hong-Seop;Han, Gil;Park, Jungwook;Seo, Young-Su
    • The Plant Pathology Journal
    • /
    • v.38 no.2
    • /
    • pp.167-174
    • /
    • 2022
  • Pseudomonas amygdali is a hemibiotrophic phytopathogen that causes disease in woody and herbaceous plants. Complete genomes of four P. amygdali pathovars were comparatively analyzed to decipher the impact of genomic diversity on host colonization. The pan-genome indicated that 3,928 core genes are conserved among pathovars, while 504-1,009 are unique to specific pathovars. The unique genome contained many mobile elements and exhibited a functional distribution different from the core genome. Genes involved in O-antigen biosynthesis and antimicrobial peptide resistance were significantly enriched for adaptation to hostile environments. While the type III secretion system was distributed in the core genome, unique genomes revealed a different organization of secretion systems as follows: type I in pv. tabaci, type II in pv. japonicus, type IV in pv. morsprunorum, and type VI in pv. lachrymans. These findings provide genetic insight into the dynamic interactions of the bacteria with plant hosts.

Identification of SNPs Related to 19 Phenotypic Traits Using Genome-wide Association Study (GWAS) Approach in Korean Wheat Mini-core Collection

  • Yuna Kang;Yeonjun Sung;Seonghyeon Kim;Changsoo Kim
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2020.06a
    • /
    • pp.120-120
    • /
    • 2020
  • Based on the simple sequence repeat (SSR) marker, a Korean wheat core collection were established with 616 wheat accessions. Among them, the SNP genotyping for the entire genome was performed using DNA chip array to clarify the whole genome SNP profiles. Consequently, a total of 35,143 SNPs were found and we re-established a mini-core collection with 247 accessions. Population diversity and phylogenetic analysis revealed genetic diversity and relationships from the mini core set. In addition, genome-wide association study (GWAS) was performed on 19 phenotypic traits; ear type, awn length, culm length, ear length, awn color, seed coat color, culm color, ear color, loading, leaf length, leaf width, seeding stand, cold damage, weight, auricle, plant type, heading stage, maturation period, upright habit, and degree of flag leaf. The GWAS was performed using the fixed and random model circulating probability unification (FarmCPU), which identified 14 to 258 SNP loci related to 19 phenotypic traits. Our study indicates that this Korean wheat mini-core collection is a set of germplasm useful for basic and applied research with the aim of understanding and exploiting the genetic diversity of Korean wheat varieties.

  • PDF

Comparative Analysis of Completely Sequenced Insect Mitochondrial Genomes

  • Lee, Jin-Sung;Kim, Ki-Hwan;Suh, Dong-Sang;Park, Jae-Heung;Suh, Ji-Yoeun;Chung, Kyu-Hoi;Hwang, Jae-Sam
    • International Journal of Industrial Entomology and Biomaterials
    • /
    • v.2 no.1
    • /
    • pp.1-6
    • /
    • 2001
  • This paper reports a few characteristics of seven insect mitochondrial genomes sequenced completely (Bombyx mori, Drosophila melanogaster, D. yakuba, Apis mellifera, Anopheles gambiae, A. quadrimaculatus, and Locusta migratoria). Comparative analysis of complete mt genome sequences from several species revealed a number of interesting features (base composition, gene content, A+T-rich region, and gene arrangement, etc) of insect mitochondrial genome. The properties revealed by our work shed new light on the organization and evolution of the insect mitochondrial genome and more importantly open up the way to clearly aimed experimental studies for understanding critical roles of the regulatory mechanisms (transcription and translation) in mitochondrial gene expression.

  • PDF

Conserved Genes and Metabolic Pathways in Prokaryotes of the Same Genus (동일한 속 원핵생물들의 보존 유전자와 대사경로)

  • Lee, Dong-Geun;Lee, Sang-Hyeon
    • Journal of Life Science
    • /
    • v.29 no.1
    • /
    • pp.123-128
    • /
    • 2019
  • The use of 16S rDNA is commonplace in the determination of prokaryotic species. However, it has limitations, and there are few studies at the genus level. We investigated conserved genes and metabolic pathways at the genus level in 28 strains of 13 genera of prokaryotes using the COG database (conserved genes) and MetaCyc database (metabolic pathways). Conserved genes compared to total genes (core genome) at the genus level ranged from 27.62%(Nostoc genus) to 71.76%(Spiribacter genus), with an average of 46.72%. The lower ratio of core genome meant the higher ratio of peculiar genes of a prokaryote, namely specific biological activities or the habitat may be varied. The ratio of common metabolic pathways at the genus level was higher than the ratio of core genomes, from 58.79% (Clostridium genus) to 96.31%(Mycoplasma genus), with an average of 75.86%. When compared among other genera, members of the same genus were positioned in the closest nodes to each other. Interestingly, Bacillus and Clostridium genera were positioned in closer nodes than those of the other genera. Archaebacterial genera were grouped together in the ortholog and metabolic pathway nodes in a phylogenetic tree. The genera Granulicella, Nostoc, and Bradyrhizobium of the Acidobacteria, Cyanobacteria, and Proteobacteria phyla, respectively, were grouped in an ortholog content tree. The results of this study can be used for (i) the identification of common genes and metabolic pathways at each phylogenetic level and (ii) the improvement of strains through horizontal gene transfer or site-directed mutagenesis.

Complete Genome Sequence of the Enterobacter asburiae IK3 Isolated from a Soybean (Glycine max) Rhizosphere

  • Sihyun Park;GyuDae Lee;Ikwhan Kim;Yeongyu Jeong;Jae-Ho Shin
    • Microbiology and Biotechnology Letters
    • /
    • v.51 no.3
    • /
    • pp.306-308
    • /
    • 2023
  • This research presents the whole-genome sequence of Enterobacter asburiae strain IK3, which was isolated from the rhizosphere soil of soybean (Glycine max). The genome of the strain is composed of a single chromosome with 4 plasmids, total size of 5,084,040 bp, and the GC content is 55.5%.