• Title/Summary/Keyword: orthologous groups

Search Result 41, Processing Time 0.023 seconds

Phylogenetic Analysis of 680 Prokaryotes by Gene Content (유전자 보유 계통수를 이용한 원핵생물 680종의 분석)

  • Lee, Dong-Geun;Lee, Sang-Hyeon
    • Journal of Life Science
    • /
    • v.26 no.6
    • /
    • pp.711-720
    • /
    • 2016
  • To determine the degree of common genes and the phylogenetic relationships among genome-sequenced 680 prokaryotes, the similarities among 4,631 clusters of orthologous groups of protein (COGs)’ presence/ absence and gene content trees were analyzed. The number of COGs was in the range of 103–2,199 (mean 1377.1) among 680 prokaryotes. Candidatus Nasuia deltocephalinicola str. NAS-ALF, an obligate symbiont with insects, showed the minimum COG, while Pseudomonas aeruginosa PAO1, an opportunistic pathogen, represented the maximum COG. The similarities between two prokaryotes were 49.30–99.78 % (mean 72.65%). Methanocaldococcus jannaschii DSM 2661 (hyperthermophilic and autotrophic, Euryarchaeota phylum) and Mesorhizobium loti MAFF303099 (mesophilic and symbiotic, alpha-Proteobacteria class) had the minimum amount of similarities. As gene content may represent the potential for an organism to adapt to each habitat, this may represent the history of prokaryotic evolution or the range of prokaryotic habitats at present on earth. COG content trees represented the following. First, two members of Chloroflexi phylum (Dehalogenimonas lykanthroporepellens BL-DC-9 and Dehalococcoides mccartyi 195) showed a greater relationship with Archaea than other Eubacteria. Second, members of the same phylum or class in the 16S rRNA gene were separated in the COG content tree. Finally, delta- and epsilon-Proteobacteria were in different lineages with other Proteobacteria classes in neighbor-joining (NJ) and maximum likelihood (ML) trees. The results of this study would be valuable to identifying the origins of organisms, functional relationships, and useful genes.

Conservative Genes among 1,309 Species of Prokaryotes (원핵생물 1,309종의 보존적 유전자)

  • Lee, Dong-Geun
    • Journal of Life Science
    • /
    • v.32 no.6
    • /
    • pp.463-467
    • /
    • 2022
  • As a result of applying the COG (Cluster of Orthologous Groups of Protein) algorithm to 1,309 species to confirm the conserved genes of prokaryotes, ribosomal protein S11 (COG0100) was identified. The numbers of conservative genes were 2, 5, 5, and 6 in 1,308, 1,307, 1,306, and 1,305 species, respectively. Twenty-nine genes were conserved in over 1,302 species, and they encoded 23 ribosomal proteins, 3 tRNA synthetases, 2 translation factors, and 1 RNA polymerase subunit. Most of them were related to protein production, suggesting the importance of protein expression in prokaryotes. The highest conservative COG was COG0048 (ribosomal protein S12) among the 29 COGs. The 29 conserved genes usually have one protein for each prokaryote. COG0090 (ribosomal protein L2) had not only the lowest conservation value but also the largest standard deviation of phylogenetic distance value. As COG0090 is not only a member of the ribosome, but also a regulator of replication and transcription, it could be inferred that prokaryotes have large variations in COG0090 to survive in various environments. This study could provide data necessary for basic science, tumor control, and development of antibacterial agents.

Draft Genome Assembly and Annotation for Cutaneotrichosporon dermatis NICC30027, an Oleaginous Yeast Capable of Simultaneous Glucose and Xylose Assimilation

  • Wang, Laiyou;Guo, Shuxian;Zeng, Bo;Wang, Shanshan;Chen, Yan;Cheng, Shuang;Liu, Bingbing;Wang, Chunyan;Wang, Yu;Meng, Qingshan
    • Mycobiology
    • /
    • v.50 no.1
    • /
    • pp.66-78
    • /
    • 2022
  • The identification of oleaginous yeast species capable of simultaneously utilizing xylose and glucose as substrates to generate value-added biological products is an area of key economic interest. We have previously demonstrated that the Cutaneotrichosporon dermatis NICC30027 yeast strain is capable of simultaneously assimilating both xylose and glucose, resulting in considerable lipid accumulation. However, as no high-quality genome sequencing data or associated annotations for this strain are available at present, it remains challenging to study the metabolic mechanisms underlying this phenotype. Herein, we report a 39,305,439 bp draft genome assembly for C. dermatis NICC30027 comprised of 37 scaffolds, with 60.15% GC content. Within this genome, we identified 524 tRNAs, 142 sRNAs, 53 miRNAs, 28 snRNAs, and eight rRNA clusters. Moreover, repeat sequences totaling 1,032,129 bp in length were identified (2.63% of the genome), as were 14,238 unigenes that were 1,789.35 bp in length on average (64.82% of the genome). The NCBI non-redundant protein sequences (NR) database was employed to successfully annotate 11,795 of these unigenes, while 3,621 and 11,902 were annotated with the Swiss-Prot and TrEMBL databases, respectively. Unigenes were additionally subjected to pathway enrichment analyses using the Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), Cluster of Orthologous Groups of proteins (COG), Clusters of orthologous groups for eukaryotic complete genomes (KOG), and Non-supervised Orthologous Groups (eggNOG) databases. Together, these results provide a foundation for future studies aimed at clarifying the mechanistic basis for the ability of C. dermatis NICC30027 to simultaneously utilize glucose and xylose to synthesize lipids.

Comparative Genome Analysis Reveals Natural Variations in the Genomes of Erwinia pyrifoliae, a Black Shoot Blight Pathogen in Apple and Pear

  • Lee, Gyu Min;Ko, Seyoung;Oh, Eom-Ji;Song, Yu-Rim;Kim, Donghyuk;Oh, Chang-Sik
    • The Plant Pathology Journal
    • /
    • v.36 no.5
    • /
    • pp.428-439
    • /
    • 2020
  • Erwinia pyrifoliae is a Gram-negative bacterial plant pathogen that causes black shoot blight in apple and pear. Although earlier studies reported the genome comparison of Erwinia species, E. pyrifoliae strains for such analysis were isolated in 1996. In 2014, the strain E. pyrifoliae EpK1/15 was newly isolated in the apple tree showing black shoot blight in South Korea. This study aimed to better understand the similarities and differences caused by natural variations at the genomic level between newly isolated E. pyrifoliae EpK1/15 and the strain Ep1/96, which were isolated almost 20 years apart. Several comparative genomic analyses were conducted, and Clusters of Orthologous Groups of proteins (COG) database was used to classify functional annotation for each strain. E. pyrifoliae EpK1/15 had similarities with the Ep1/96 strain in stress-related genes, Tn3 transposase of insertion sequences, type III secretion systems, and small RNAs. The most remarkable difference to emerge from this comparison was that although the draft genome of E. pyrifoliae EpK1/15 was almost conserved, Epk1/15 strain had at least three sorts of structural variations in functional annotation according to COG database; chromosome inversion, translocation, and duplication. These results indicate that E. pyrifoliae species has gone natural variations within almost 20 years at the genomic level, and we can trace their similarities and differences with comparative genomic analysis.

Analysis of Conservative Genes in Thermophilic and Hyperthermophilic Bacteria (고온성과 초고온성 세균의 보존적 유전자 분석)

  • Lee Dong-Geun;Lee Jae-Hwa;Ha Bae Jin;Ha Jong-Myung;Lee Jung-Hyun;Kim Sang-Jin;Lee Sang Hyeon
    • KSBB Journal
    • /
    • v.20 no.5 s.94
    • /
    • pp.387-391
    • /
    • 2005
  • Totally 16,299 conservative genes, commonly found in 13 thermophilic and hyperthermophilic bacteria, were analyzed. All genes were belong to W 67 COGs (clusters of orthologous groups of proteins). COGs related to protein metabolism were 80 among 167 COGs. Conservative genes were not limited only thermophiles and hyperthermophiles, meaning thermal stability is independent of specific protein. However reverse gyrase was only found in all hyperthermophilic archaebacteria and eubacteria, meaning DNA stability is important in hyperthermophiles. Hyperthermophilic eubacteria and thermophilic archaebacteria had different position between phylogenetic tree of gene content and 165 rRNA gene. Thermophilic archaebacteria hyperthermophilic eubacteria and archaebacteria had similar values by the statistical analysis of distance values with 167 COGs in each organism.

Investigation of Conservative Genes in 168 Archaebacterial Strains (168개 고세균 균주들의 보존적 유전자에 관한 연구)

  • Lee, Dong-Geun;Lee, Sang-Hyeon
    • Journal of Life Science
    • /
    • v.30 no.9
    • /
    • pp.813-818
    • /
    • 2020
  • The archaeal clusters of orthologous genes (arCOG) algorithm, which identifies common genes among archaebacterial genomes, was used to identify conservative genes among 168 archaebacterial strains. The numbers of conserved orthologs were 14, 10, 9, and 8 arCOGs in 168, 167, 166, and 165 strains, respectively. Among 41 conserved arCOGs, 13 were related to function J (translation, ribosomal structure, and biogenesis), and 10 were related to function L (replication, recombination, and repair). Among the 14 conserved arCOGs in all 168 strains, 6 arCOGs of tRNA synthetase comprised the highest proportion. Of the remaining 8 arCOGs, 2 are involved in reactions with ribosomes, 2 for tRNA synthesis, 2 for DNA replication, and 2 for transcription. These results showed the importance of protein expression in archaea. For the classes or orders having 3 or more members, genomic analysis was performed by averaging the distance values of the conservative arCOGs. Classes Archaeoglobi and Thermoplasmata of the phylum Euryarchaeota showed the lowest and the highest average of distance value, respectively. This study can provides data necessary for basic scientific research and the development of antibacterial agents and tumor control.

Metabolic Pathways of 1309 Prokaryotic Species in Relation to COGs (COG pathways에서 원핵생물 1,309종의 대사경로)

  • Lee, Dong-Geun;Kim, Ju-Hui;Lee, Sang-Hyeon
    • Journal of Life Science
    • /
    • v.32 no.3
    • /
    • pp.249-255
    • /
    • 2022
  • Metabolism is essential for survival and reproduction, and there is a metabolic pathways entry in the clusters of orthologous groups of proteins (COGs) database, updated in 2020. In this study, the metabolic pathways of 1309 prokaryotes were analyzed using COGs. There were 822 COGs associated with 63 metabolic pathways, and the mean for each taxon was between 200.50 (mollicutes) and 527.07 (cyanobacteria) COGs. The metabolic pathway composition ratio (MPCR) was defined as the number of COGs present in one genome in relation to the total number of COGs constituting each metabolic pathway, and the number of pathways with 100% MPCR ranged from 0 to 26 in each prokaryote. Among 1309 species, the 100% MPCR pathways included murein biosynthesis associated with cell wall synthesis (922 species); glycine cleavage (918); and ribosomal 30S subunit synthesis (903). The metabolic pathways with 0% MPCR were those involving photosystem I (1263 species); archaea/vacuolar-type ATP synthase (1028); and Na+-translocation NADH dehydrogenase (976). Depending on the prokaryote, three to 49 metabolic pathways could not be performed at all. The sequence of most highly conserved metabolic pathways was ribosome 30S subunit synthesis (96.1% of 1309 species); murein biosynthesis (86.8%); arginine biosynthesis (80.4%); serine biosynthesis (80.3%); and aminoacyl-tRNA synthesis (82.2%). Protein and cell wall synthesis have been shown to be important metabolic pathways in prokaryotes, and the results of this study of COGs related to such pathways can be utilized in, for example, the development of antibiotics and artificial cells.

Comparison of Expression Profiles between Trophozoite and Cyst of Acanthamoeba castellanii

  • Moon, Eun-Kyung;Kong, Hyun-Hee
    • Biomedical Science Letters
    • /
    • v.18 no.3
    • /
    • pp.313-318
    • /
    • 2012
  • Acanthamoeba is an opportunistic pathogen known to cause granulomatous amoebic encephalitis and amebic keratitis. Acanthamoeba exhibits life cycle consisting of trophozoite and cyst, and the cyst is highly resistant to variable antibiotics and therapeutic agents. To understand the encystation mechanism of Acanthamoeba, the expression profiles of trophozoite and cyst were compared by gene ontology (GO) analysis. Ribosomal proteins and cytoskeletal proteins were highly expressed in trophozoite. In cyst, various protease, and signal transduction - and protein turnover - related proteins were highly expressed. These results correlated with eukaryotic orthologous groups (KOG) assignment and microarray analysis of Acanthamoeba trophozoite and cyst ESTs. The information of differential expression profiles of trophozoite and cyst would provide important clues for research on encystation mechanism of cyst forming protozoa including Acanthamoeba.

Comparative Genome Analysis of Sphingomonas chungbukensis DJ77

  • Hai Dang Sy;Kim Young-Pil;Choi Bum-Sun;Um Hyun-Ju;Kim Young-Chang
    • Proceedings of the Microbiological Society of Korea Conference
    • /
    • 2002.10a
    • /
    • pp.175-179
    • /
    • 2002
  • The assemblies of our partial genomic sequence data of Sphingomonas chungbukensis DJ77, with the total size of 877,928 bp, was done by TIGR Assembler. The total size of our current obtained contigs was about 0.73 Mb. A comparative genome analysis between our uncompleted genome and the other completed genomes was performed by taking advantage of the availability of multiple complete genomes in COGs database (Clusters of Orthologous Groups of proteins) to produce the genomic prediction of our S. chungbukensis DJ77. This analysis based on homologues search among completed genomes provides good initial step to our better assigning putative function to predicted coding sequences.

  • PDF

Genome-Wide Comparison of Carbohydrate-Active Enzymes (CAZymes) Repertoire of Flammulina ononidis

  • Park, Young-Jin;Kong, Won-Sik
    • Mycobiology
    • /
    • v.46 no.4
    • /
    • pp.349-360
    • /
    • 2018
  • Whole-genome sequencing of Flammulina ononidis, a wood-rotting basidiomycete, was performed to identify genes associated with carbohydrate-active enzymes (CAZymes). A total of 12,586 gene structures with an average length of 2009 bp were predicted by the AUGUSTUS tool from a total 35,524,258 bp length of de novo genome assembly (49.76% GC). Orthologous analysis with other fungal species revealed that 7051 groups contained at least one F. ononidis gene. In addition, 11,252 (89.5%) of 12,586 genes for F. ononidis proteins had orthologs among the Dikarya, and F. ononidis contained 8 species-specific genes, of which 5 genes were paralogous. CAZyme prediction revealed 524 CAZyme genes, including 228 for glycoside hydrolases, 21 for polysaccharide lyases, 87 for glycosyltransferases, 61 for carbohydrate esterases, 87 with auxiliary activities, and 40 for carbohydrate-binding modules in the F. ononidis genome. This genome information including CAZyme repertoire will be useful to understand lignocellulolytic machinery of this white rot fungus F. ononidis.