• Title/Summary/Keyword: orthologous groups

Search Result 41, Processing Time 0.024 seconds

Investigation of Conservative Genes in 711 Prokaryotes (원핵생물 711종의 보존적 유전자 탐색)

  • Lee, Dong-Geun;Lee, Sang-Hyeon
    • Journal of Life Science
    • /
    • v.25 no.9
    • /
    • pp.1007-1013
    • /
    • 2015
  • A COG (Cluster of Orthologous Groups of proteins) algorithm was applied to detect conserved genes in 711 prokaryotes. Only COG0080 (ribosomal protein L11) was common among all the 711 prokaryotes analyzed and 58 COGs were common in more than 700 prokaryotes. Nine COGs among 58, including COG0197 (endonuclease III) and COG0088 (ribosomal protein L4), were conserved in a form of one gene per one organism. COG0008 represented 1356 genes in 709 of the prokaryotes and this was the highest number of genes among 58 COGs. Twenty-two COGs were conserved in more than 708 prokaryotes. Of these, two were transcription related, four were tRNA synthetases, eight were large ribosomal subunits, seven were small ribosomal subunits, and one was translation elongation factor. Among 58 conserved COGs in more than 700 prokaryotes, 50 (86.2%) were translation related, and four (6.9%) were transcription related, pointing to the importance of protein-synthesis in prokaryotes. Among these 58 COGs, the most conserved COG was COG0060 (isoleucyl tRNA synthetase), and the least conserved was COG0143 (methionyl tRNA synthetase). Archaea and eubacteria were discriminated in the genomic analysis by the average distance and variation in distance of common COGs. The identification of these conserved genes could be useful in basic and applied research, such as antibiotic development and cancer therapeutics.

Investigation of Conserved Regions in Lipase Genes (Lipase 유전자의 보존적 영역 탐색)

  • 이동근;김철민;김상진;이상현;이재화
    • Journal of Life Science
    • /
    • v.13 no.5
    • /
    • pp.723-731
    • /
    • 2003
  • For the investigation of conserved regions in lipase genes, 132 and 24 sequences were obtained from LED (Lipase Engineering Database) and COG (Clusters of Orthologous Groups of proteins), respectively. There was high diversity in lipase genes and peculiar amino acid sequences were conserved for each homologous family of LED. Similar conserved amino acid sequences were detected from COG0657 and Moraxella lipase 1 homologous group of LED. Although many studies have attempted to detect new lipase genes in procaryotes, they have been limited culturable bacteria. The importance of metagenome, including DNA from non-culturable bacteria, is known. Due to the high diversity, we assumed it might be possible to detect new lipase gene from metagenome. Due to the high diversity of nucleotide sequences in lipase genes, 10 primer sets were designed. Designed primer sets were inspected in BLAST of NCBI and they could amplify a part of the lipase gene from 222 to 713 bp. They can amplify 16.7%∼60.0% of each lipase homologous group which was 3.6 fold higher than each sets. They might offer a high probability of detecting new lipase genes, owing to high efficiency and the diversity of lipase genes.

Conserved COG Pathways and Genes of 122 Species of Archaea (고세균 122종의 보존적 COG pathways와 유전자)

  • Dong-Geun Lee ;Sang-Hyeon Lee
    • Journal of Life Science
    • /
    • v.33 no.11
    • /
    • pp.944-949
    • /
    • 2023
  • The purpose of this study was to identify conserved metabolic pathways and conserved genes in 122 archaeal species. Using the Clusters of Orthologous Groups of Proteins (COG) database of conserved genes, we analyzed whether 122 species had 63 COG metabolic pathways, the 822 COGs that compose them, and a total of 4,877 COGs. Archaeal ribosomal proteins were the most conserved in metabolic pathways. 46 COGs in seven COG pathways among 63 COG pathways and 20 COGs in others were conserved in 122 species. Some genes involved in cell wall and extracellular matrix synthesis, replication, transcription, translation, and protein metabolism were common to all 122 species. When the distance value of the phylogenetic tree was analyzed at the phylum level or class level, the average was the lowest at the class Halobacteria of the phylum Euryarchaeota. Standard deviation was high for the class Nitosospharia of the phylum Thaumarchaeota, the unclassified members of phylum Thaumarchaeota, the class Halobacteria of the phylum Euryarchaeota, the class Thermoprotei of the phylum Crenarchaeota, and other archaea. Furthermore, the phylogenetic tree analysis revealed six commonalities. The results of this study, along with data on conserved genes, could be used for drug development and gene selection for strain improvement.

유전자보유 계통수를 이용한 Archaea와 Proteobacteria 분류

  • Lee, Dong-Geun;Lee, Jin-Ok;Lee, Jae-Hwa
    • 한국생물공학회:학술대회논문집
    • /
    • 2003.04a
    • /
    • pp.686-689
    • /
    • 2003
  • A Gene content phylogenetic tree and a 16S rRNA based phylogenetic tree were compared for 9 Archaea and 15 Proteobacteria, whole-genome sequenced, by neighbor joining and bootstrap methods (n=1000). Ratio of conserved COG (clusters of orthologous groups of proteins) to ortholog revealed that they were within the range of 4.60% (Mezorhizobium loti) or 56.57% (Mycoplasma genitalium), The diversity of ratio meant the Possibility of searching for useful genes, as they possess peculiar genes. The gene content tree and the 16S rDNA tree showed coincidence and discordance in Archaea and Proteobacteria.

  • PDF

Prediction of Genes Lacking in an Ammonia Oxidizing Archaeon for Independent Growth (암모니아 산화 고세균의 독립성장에 필요한 결손 유전자 예측)

  • Han, Sang-Soo;Lee, Jin-Young;Rhee, Sung-Keun;Kim, Geun-Joong
    • KSBB Journal
    • /
    • v.26 no.3
    • /
    • pp.237-242
    • /
    • 2011
  • As a number of archaea are ubiquitously found in non-extreme habitats, elucidation of their functional roles becomes currently an emerging issue. However, most of them are unable to grow in pure culture and so it remains to be established. In order to find genes lacking in the genome of an ammonia-oxidizing archaeon (AOA), we here report on the comparative analyses of an AOA genome with those of experimentally or theoretically established minimal genomes for independent growth. We assessed the genes lacking in AOA using logic of clusters of orthologous groups (COG), remote homology, consensus sequence weight matrix, function-based motif or domain, and then further excluded genes encoding hypothetical orarchaea-specific proteins. The results of these combination analyses revealed 19 candidate genes lacking in the genome of an AOA. Thus, our results provide a possibility of inducing independent growth of AOA when supplemented with product (s) of the lacking gene (s), and also give a chance for finding new proteins with novel sequence or structure space even if the predicted lacking-genes will be found using another algorithms or biochemical studies.

Mercury Resistance and Removal Mechanisms of Pseudomonas sp. Isolated Mercury-contaminated Site in Taiwan

  • Luo, Kai-Hong;Chen, Ssu-Ching;Liao, Hung-Yu
    • Journal of Soil and Groundwater Environment
    • /
    • v.21 no.5
    • /
    • pp.16-24
    • /
    • 2016
  • A new strain of Pseudomonas sp. was isolated from mercury (Hg)-contaminated sites in Taiwan. This bacterium removed more than 80% of Hg present in the culture medium at 12 h incubation and was chosen for further analysis of the molecular mechanisms of Hg tolerance/removal abilities in this Pseudomonas sp. We used RNA-seq, one of the next-generation sequencing methods, to investigate the transcriptomic responses of the Pseudomonas sp. exposed to 60 mg/L of Hg2+. We de novo assembled 4,963 contigs, of which 10,533 up-regulated genes and 5,451 down-regulated genes were found to be regulated by Hg. The 40 genes most altered in expression levels were associated with tolerance to Hg stress and metabolism. Functional analysis showed that some Hg-tolerant genes were related to the mer operon, sulfate uptake and assimilation, the enzymatic antioxidant system, the HSP gene family, chaperones, and metal transporters. The transcriptome were analyzed further with Gene Ontology (GO) and Cluster of Orthologous Groups (COGs) of proteins and showed diverse biological functions and metabolic pathways under Hg stress.

Genome Sequencing and Genome-Wide Identification of Carbohydrate-Active Enzymes (CAZymes) in the White Rot Fungus Flammulina fennae

  • Lee, Chang-Soo;Kong, Won-Sik;Park, Young-Jin
    • Microbiology and Biotechnology Letters
    • /
    • v.46 no.3
    • /
    • pp.300-312
    • /
    • 2018
  • Whole-genome sequencing of the wood-rotting fungus, Flammulina fennae, was carried out to identify carbohydrate-active enzymes (CAZymes). De novo genome assembly (31 kmer) of short reads by next-generation sequencing revealed a total genome length of 32,423,623 base pairs (39% GC). A total of 11,591 gene models in the assembled genome sequence of F. fennae were predicted by ab initio gene prediction using the AUGUSTUS tool. In a genome-wide comparison, 6,715 orthologous groups shared at least one gene with F. fennae and 10,667 (92%) of 11,591 genes for F. fennae proteins had orthologs among the Dikarya. Additionally, F. fennae contained 23 species-specific genes, of which 16 were paralogous. CAZyme identification and annotation revealed 513 CAZymes, including 82 auxiliary activities, 220 glycoside hydrolases, 85 glycosyltransferases, 20 polysaccharide lyases, 57 carbohydrate esterases, and 45 carbohydrate binding-modules in the F. fennae genome. The genome information of F. fennae increases the understanding of this basidiomycete fungus. CAZyme gene information will be useful for detailed studies of lignocellulosic biomass degradation for biotechnological and industrial applications.

Deciphering the Role of Tyrosine Sulfation in Xanthomonas oryzae pv. oryzae Using Shotgun Proteomic Analysis

  • Park, Hye-Jee;Park, Chang-Jin;Bae, Nahee;Han, Sang-Wook
    • The Plant Pathology Journal
    • /
    • v.32 no.3
    • /
    • pp.266-272
    • /
    • 2016
  • A bacterial tyrosine sulfotransferase, RaxST, is required for activation of rice XA21-mediated immunity, and it catalyzes sulfation of tyrosine residues of Omp1X and RaxX in Xanthomonas oryzae pv. oryzae, a causal agent of bacterial blight in rice. Although RaxST is biochemically well-characterized, biological functions of tyrosine sulfation have not been fully elucidated. We compared protein expression patterns between the wildtype and a raxST knockout mutant using shotgun proteomic analysis. Forty nine proteins displayed a more than 1.5-fold difference in their expression between the wildtype and the mutant strains. Clusters of orthologous groups analysis revealed that proteins involved in cell motility were most abundant, and phenotypic observation also showed that the twitching motility of the mutant was dramatically changed. These results indicate that tyrosine sulfation by RaxST is essential for Xoo movement, and they provide new insights into the biological roles of RaxST in cellular processes.

Isolation and Molecular Phylogeny of Three Muscle Actin Isoforms of an Endangered Freshwater Fish Species Hemibarbus mylodon (Cypriniformes; Cyprinidae)

  • Kim, Keun-Yong;Nam, Yoon-Kwon
    • Journal of Aquaculture
    • /
    • v.22 no.1
    • /
    • pp.83-91
    • /
    • 2009
  • The Korean doty barbel Hemibarbus mylodon (Cypriniformes; Cyprinidae) is a critically endangered freshwater fish species mainly because of its natural habitat degradation. Three full-length complementary DNA (cDNA) clones representing different muscle actin isoforms were isolated and characterized. The three muscle actin isoforms were 1,294-1,601 bp long with the identical open reading frames of 1,134 bp with the deduced amino acid residues of 377. They showed 83.9-87.2% identities in the coding nucleotide level and 96.8-98.1% identities in the amino acid level. Phylogenetic analysis with the coding nucleotide sequences revealed that three muscle actin isoforms of H. mylodon formed strongly supported monophyletic groups with one of cypriniform skeletal $\alpha$-actin (acta1), cypriniform aortic $\alpha$-actins (acta2), and uncharacterized Danio rerio muscle actin isoform/Salmo trutta slow muscle actin (a novel muscle actin type). Our phylogenetic tree further suggested that cypriniform acta2 only showed the orthologous relationship to tetrapod acta2. Other multiple actin isoforms from diverse teleostean taxa were however clustered to no tetrapod orthologs, i.e., acta1, cardiac $\alpha$-actins (aetc1), acta2, and enteric $\gamma$-actin (actg2). This result strongly suggested that teleostean muscle actins have experienced different and complicated evolutionary history in comparison to mammalian counterparts.

Web-Based Computational System for Protein-Protein Interaction Inference

  • Kim, Ki-Bong
    • Journal of Information Processing Systems
    • /
    • v.8 no.3
    • /
    • pp.459-470
    • /
    • 2012
  • Recently, high-throughput technologies such as the two-hybrid system, protein chip, Mass Spectrometry, and the phage display have furnished a lot of data on protein-protein interactions (PPIs), but the data has not been accurate so far and the quantity has also been limited. In this respect, computational techniques for the prediction and validation of PPIs have been developed. However, existing computational methods do not take into account the fact that a PPI is actually originated from the interactions of domains that each protein contains. So, in this work, the information on domain modules of individual proteins has been employed in order to find out the protein interaction relationship. The system developed here, WASPI (Web-based Assistant System for Protein-protein interaction Inference), has been implemented to provide many functional insights into the protein interactions and their domains. To achieve those objectives, several preprocessing steps have been taken. First, the domain module information of interacting proteins was extracted by taking advantage of the InterPro database, which includes protein families, domains, and functional sites. The InterProScan program was used in this preprocess. Second, the homology comparison with the GO (Gene Ontology) and COG (Clusters of Orthologous Groups) with an E-value of $10^{-5}$, $10^{-3}$ respectively, was employed to obtain the information on the function and annotation of each interacting protein of a secondary PPI database in the WASPI. The BLAST program was utilized for the homology comparison.