• Title/Summary/Keyword: orthology

Search Result 9, Processing Time 0.026 seconds

Genomic Tree of Gene Contents Based on Functional Groups of KEGG Orthology

  • Kim Jin-Sik;Lee Sang-Yup
    • Journal of Microbiology and Biotechnology
    • /
    • v.16 no.5
    • /
    • pp.748-756
    • /
    • 2006
  • We propose a genome-scale clustering approach to identify whole genome relationships using the functional groups given by the Kyoto Encyclopedia of Genes and Genomes Orthology (KO) database. The metabolic capabilities of each organism were defined by the number of genes in each functional category. The archaeal, bacterial, and eukaryotic genomes were compared by simultaneously applying a two-step clustering method, comprised of a self-organizing tree algorithm followed by unsupervised hierarchical clustering. The clustering results were consistent with various phenotypic characteristics of the organisms analyzed and, additionally, showed a different aspect of the relationship between genomes that have previously been established through rRNA-based comparisons. The proposed approach to collect and cluster the metabolic functional capabilities of organisms should make it a useful tool in predicting relationships among organisms.

An Analysis of Ortholog Clusters Detected from Multiple Genomes (다종의 유전체로부터 탐지된 Ortholog 군집에 대한 분석)

  • Kim, Sun-Shin;Oh, Jeong-Su;Lee, Bum-Ju;Kim, Tae-Kyung;Jung, Kwang-Su;Rhee, Chung-Sei;Kim, Young-Chang;Cho, Wan-Sup;Ryu, Keun-Ho
    • Journal of KIISE:Databases
    • /
    • v.35 no.2
    • /
    • pp.125-131
    • /
    • 2008
  • It is very useful to predict orthologs for new genome annotation and research on genome evolution. We showed that the previous work can be extended to construct OCs(Ortholog Clusters) automatically from multiple complete-genomes. The proposed method also has the quality of production of InParanoid, which produces orthologs from just two genomes. On the other hand, in order to predict more exactly the function of a newly sequenced gene it can be an important issue to prevent unwanted inclusion of paralogs into the OCs. We have, here, investigated how well it is possible to construct a functionally purer OCs with score cut-offs. Our OCs were generated from the datasets of 20 procaryotes. The similarity with both COG(Clusters of Orthologous Group) and KO(Kegg Orthology) against our OCs has about 90% and inclines to increase with the growth of score cut-offs.

Clustering and Comparative Analyses of Complete Genomes for the Elucidation of Evolutionary Characteristics

  • Kim, Jin-Sik;Lee, Sang-Yup
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.78-82
    • /
    • 2005
  • Three of the genus Pseudomonas (P. aeruginosa, P. putida, P. syringae) show highly different phenotypic characteristics among them. Two of the three members are pathogenic and the other is non-pathogenic. Comparative analyses of the complete genomes can elucidate the genomic similarities and differences among them. We analyzed the three genomes and the genes of them to reveal the degree of conservation of chromosomes and similarity of the genes. The 2-dimensional dot plot between the pathogenic P. aeruginosa and non-pathogenic P. putida shared higher portion of the nucleotide sequences than other two combinations. Comparison of the nucleotide compositions by calculating the genome-scale plot of G+C contents and GC skew showed the variation of location. Comparison of the metabolic capabilities using the functional classification of KEGG orthology revealed that the differences in the number of genes for the specific functional categories resulted in the phenotypic differences. Finally combination of the analyses using the protein homologs supported the evolutionary distance of the P. putida obtained from other genome-scale comparisons.

  • PDF

Insilico Analysis for Expressed Sequence Tags from Embryogenic Callus and Flower Buds of Panax ginseng C. A. Meyer

  • Sathiyamoorthy, Subramaniyam;In, Jun-Gyo;Lee, Byum-Soo;Kwon, Woo-Seang;Yang, Dong-Uk;Kim, Ju-Han;Yang, Deok-Chun
    • Journal of Ginseng Research
    • /
    • v.35 no.1
    • /
    • pp.21-30
    • /
    • 2011
  • Panax ginseng root has been used as a major source of ginsenoside throughout the history of oriental medicine. In recent years, scientists have found that all of its biomass, including embryogenic calli and flower buds can contain similar active ingredients with pharmacological functions. In this study, transcriptome analyses were used to identify different gene expressions from embryogenic calli and fl ower buds. In total, 6,226 expressed sequence tags (ESTs) were obtained from cDNA libraries of P. ginseng. Insilico analysis was conducted to annotate the putative sequences using gene ontology functional analysis, Kyoto Encyclopedia of Genes and Genomes orthology biochemical analysis, and interproscan protein functional domain analysis. From the obtained results, genes responsible for growth, pathogenicity, pigments, ginsenoside pathway, and development were discussed. Almost 83.3% of the EST sequence was annotated using one-dimensional insilico analysis.

Transcriptome analysis of a medicinal plant, Pistacia chinensis

  • Choi, Ki-Young;Park, Duck Hwan;Seong, Eun-Soo;Lee, Sang Woo;Hang, Jin;Yi, Li Wan;Kim, Jong-Hwa;Na, Jong-Kuk
    • Journal of Plant Biotechnology
    • /
    • v.46 no.4
    • /
    • pp.274-281
    • /
    • 2019
  • Pistacia chinensis Bunge has not only been used as a medicinal plant to treat various illnesses but its young shoots and leaves have also been used as vegetables. In addition, P. chinensis is used as a rootstock for Pistacia vera (pistachio). Here, the transcriptome of P. chinensis was sequenced to enrich genetic resources and identify secondary metabolite biosynthetic pathways using Illumina RNA-seq methods. De novo assembly resulted in 18,524 unigenes with an average length of 873 bp from 19 million RNA-seq reads. A Kyoto Encyclopedia of Genes and Genomes (KEGG) annotation tool assigned KO (KEGG orthology) numbers to 6,553 (36.2%) unigenes, among which 4,061 unigenes were mapped into 391 different metabolic pathways. For terpenoid backbone and carotenoid biosynthesis pathways, 44 and 22 unigenes encode enzymes corresponding to 30 and 16 entries, respectively. Twenty-two unigenes encode proteins for 16 entries of the carotenoid biosynthesis pathway. As for the phenylpropanoid and flavonoid biosynthesis pathways, 63 and 24 unigenes were homologous to 17 and 14 entry proteins, respectively. Mining of simple sequence repeat identified 2,599 simple sequence repeats from P. chinensis unigenes. The results of the present study provide a valuable resource for in-depth studies on comparative and functional genomics to unravel the underlying mechanisms of the medicinal properties of Pistacia L.

Comparing Two Mycobacterium tuberculosis Genomes from Chinese Immigrants with Native Genomes Using Mauve Alignments

  • Ryoo, Sungweon;Lee, Jeongsoo;Oh, Jee Youn;Kim, Byeong Ki;Kim, Young;Kim, Je Hyeong;Shin, Chol;Lee, Seung Heon
    • Tuberculosis and Respiratory Diseases
    • /
    • v.81 no.3
    • /
    • pp.216-221
    • /
    • 2018
  • Background: The number of immigrants with tuberculosis (TB) increases each year in South Korea. Determining the transmission dynamics based on whole genome sequencing (WGS) to cluster the strains has been challenging. Methods: WGS, annotation refinement, and orthology assignment for the GenBank accession number acquisition were performed on two clinical isolates from Chinese immigrants. In addition, the genomes of the two isolates were compared with the genomes of Mycobacterium tuberculosis isolates, from two native Korean and five native Chinese individuals using a phylogenetic topology tree based on the Multiple Alignment of Conserved Genomic Sequence with Rearrangements (Mauve) package. Results: The newly assigned accession numbers for two clinical isolates were CP020381.2 (a Korean-Chinese from Yanbian Province) and CP022014.1 (a Chinese from Shandong Province), respectively. Mauve alignment classified all nine TB isolates into a discriminative collinear set with matched regions. The phylogenetic analysis revealed a rooted phylogenetic tree grouping the nine strains into two lineages: strains from Chinese individuals and strains from Korean individuals. Conclusion: Phylogenetic trees based on the Mauve alignments were supposed to be useful in revealing the dynamics of TB transmission from immigrants in South Korea, which can provide valuable information for scaling up the TB screening policy for immigrants.

Supragingival Plaque Microbial Community Analysis of Children with Halitosis

  • Ren, Wen;Zhang, Qun;Liu, Xuenan;Zheng, Shuguo;Ma, Lili;Chen, Feng;Xu, Tao;Xu, Baohua
    • Journal of Microbiology and Biotechnology
    • /
    • v.26 no.12
    • /
    • pp.2141-2147
    • /
    • 2016
  • As one of the most complex human-associated microbial habitats, the oral cavity harbors hundreds of bacteria. Halitosis is a prevalent oral condition that is typically caused by bacteria. The aim of this study was to analyze the microbial communities and predict functional profiles in supragingival plaque from healthy individuals and those with halitosis. Ten preschool children were enrolled in this study; five with halitosis and five without. Supragingival plaque was isolated from each participant and 16S rRNA gene pyrosequencing was used to identify the microbes present. Samples were primarily composed of Actinobacteria, Bacteroidetes, Proteobacteria, Firmicutes, Fusobacteria, and Candidate phylum TM7. The ${\alpha}$ and ${\beta}$ diversity indices did not differ between healthy and halitosis subjects. Fifteen operational taxonomic units (OTUs) were identified with significantly different relative abundances between healthy and halitosis plaques, and included the phylotypes of Prevotella sp., Leptotrichia sp., Actinomyces sp., Porphyromonas sp., Selenomonas sp., Selenomonas noxia, and Capnocytophaga ochracea. We suggest that these OTUs are candidate halitosis-associated pathogens. Functional profiles were predicted using PICRUSt, and nine level-3 KEGG Orthology groups were significantly different. Hub modules of co-occurrence networks implied that microbes in halitosis dental plaque were more highly conserved than microbes of healthy individuals' plaque. Collectively, our data provide a background for the oral microbiota associated with halitosis from supragingival plaque, and help explain the etiology of halitosis.

Construction of EST Database for Comparative Gene Studies of Acanthamoeba

  • Moon, Eun-Kyung;Kim, Joung-Ok;Xuan, Ying-Hua;Yun, Young-Sun;Kang, Se-Won;Lee, Yong-Seok;Ahn, Tae-In;Hong, Yeon-Chul;Chung, Dong-Il;Kong, Hyun-Hee
    • Parasites, Hosts and Diseases
    • /
    • v.47 no.2
    • /
    • pp.103-107
    • /
    • 2009
  • The genus Acanthamoeba can cause severe infections such as granulomatous amebic encephalitis and amebic keratitis in humans. However, little genomic information of Acanthamoeba has been reported. Here, we constructed Acanthamoeba expressed sequence tags (EST) database (Acanthamoeba EST DB) derived from our 4 kinds of Acanthamoeba cDNA library. The Acanthamoeba EST DB contains 3,897 EST generated from amebae under various conditions of long term in vitro culture, mouse brain passage, or encystation, and downloaded data of Acanthamoeba from National Center for Biotechnology Information (NCBI) and Taxonomically Broad EST Database (TBestDB). The almost reported eDNA/genomic sequences of Acanthamoeba provide stand alone BLAST system with nucleotide (BLAST NT) and amino acid (BLAST AA) sequence database. In BLAST results, each gene links for the significant information including sequence data, gene orthology annotations, relevant references, and a BlastX result. This is the first attempt for construction of Acanthamoeba database with genes expressed in diverse conditions. These data were integrated into a database (http://www. amoeba.or.kr).

Gramene database: A resource for comparative plant genomics, pathways and phylogenomics analyses

  • Tello-Ruiz, Marcela K.;Stein, Joshua;Wei, Sharon;Preece, Justin;Naithani, Sushma;Olson, Andrew;Jiao, Yinping;Gupta, Parul;Kumari, Sunita;Chougule, Kapeel;Elser, Justin;Wang, Bo;Thomason, James;Zhang, Lifang;D'Eustachio, Peter;Petryszak, Robert;Kersey, Paul;Lee, PanYoung Koung;Jaiswal, kaj;Ware, Doreen
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2017.06a
    • /
    • pp.135-135
    • /
    • 2017
  • The Gramene database (http://www.gramene.org) is a powerful online resource for agricultural researchers, plant breeders and educators that provides easy access to reference data, visualizations and analytical tools for conducting cross-species comparisons. Learn the benefits of using Gramene to enrich your lectures, accelerate your research goals, and respond to your organismal community needs. Gramene's genomes portal hosts browsers for 44 complete reference genomes, including crops and model organisms, each displaying functional annotations, gene-trees with orthologous and paralogous gene classification, and whole-genome alignments. SNP and structural diversity data, available for 11 species, are displayed in the context of gene annotation, protein domains and functional consequences on transcript structure (e.g., missense variant). Browsers from multiple species can be viewed simultaneously with links to community-driven organismal databases. Thus, while hosting the underlying data for comparative studies, the portal also provides unified access to diverse plant community resources, and the ability for communities to upload and display private data sets in multiple standard formats. Our BioMart data mining interface enable complex queries and bulk download of sequence, annotation, homology and variation data. Gramene's pathway portal, the Plant Reactome, hosts over 240 pathways curated in rice and inferred in 66 additional plant species by orthology projection. Users may compare pathways across species, query and visualize curated expression data from EMBL-EBI's Expression Atlas in the context of pathways, analyze genome-scale expression data, and conduct pathway enrichment analysis. Our integrated search database and modern user interface leverage these diverse annotations to facilitate finding genes through selecting auto-suggested filters with interactive views of the results.

  • PDF