• 제목/요약/키워드: orthology

검색결과 9건 처리시간 0.029초

Genomic Tree of Gene Contents Based on Functional Groups of KEGG Orthology

  • Kim Jin-Sik;Lee Sang-Yup
    • Journal of Microbiology and Biotechnology
    • /
    • 제16권5호
    • /
    • pp.748-756
    • /
    • 2006
  • We propose a genome-scale clustering approach to identify whole genome relationships using the functional groups given by the Kyoto Encyclopedia of Genes and Genomes Orthology (KO) database. The metabolic capabilities of each organism were defined by the number of genes in each functional category. The archaeal, bacterial, and eukaryotic genomes were compared by simultaneously applying a two-step clustering method, comprised of a self-organizing tree algorithm followed by unsupervised hierarchical clustering. The clustering results were consistent with various phenotypic characteristics of the organisms analyzed and, additionally, showed a different aspect of the relationship between genomes that have previously been established through rRNA-based comparisons. The proposed approach to collect and cluster the metabolic functional capabilities of organisms should make it a useful tool in predicting relationships among organisms.

다종의 유전체로부터 탐지된 Ortholog 군집에 대한 분석 (An Analysis of Ortholog Clusters Detected from Multiple Genomes)

  • 김선신;오정수;이범주;김태경;정광수;이충세;김영창;조완섭;류근호
    • 한국정보과학회논문지:데이타베이스
    • /
    • 제35권2호
    • /
    • pp.125-131
    • /
    • 2008
  • 새로운 유전체 주석달기와 유전체 진화에 대한 연구를 위해서 올소로그(Ortholog)를 탐지하는 일은 매우 유용하다. 이전에 제안한 연구에서, 우리는 여러 종의 유전체로부터 올소로그 클러스터를 자동적으로 구축하는 방법을 제안하였다. 이 방법은 단지 두 종의 결과를 생성하는 InParanoid를 여러 종으로 확장하고 이와 동일한 질을 가진 결과를 산출한다. 한편, 새롭게 서열이 밝혀진 유전자의 기능을 보다 정확히 예측하기 위해, 패럴로그(Paralog)가 가급적 적게 포함되는 올소로그 클러스터를 구축하는 것이 중요한 문제가 될 수 있다. 이 논문에서, 우리는 임계값을 사용하여 보다 순수한 올소로그 클러스터를 구축하는 방법에 대하여 조사하였다 우리는 20개의 원핵생물의 데이타셋으로부터 올소로그 클러스터를 구축하였다. 우리의 올소로그 클러스터를 COG(Clusters of Orthologous Group) 및 KO(Kegg Orthology)와 비교하였을 매, 약 90%의 유사도를 가지며 임계간의 증가와 더불어 증가하는 경향이 있다.

Clustering and Comparative Analyses of Complete Genomes for the Elucidation of Evolutionary Characteristics

  • Kim, Jin-Sik;Lee, Sang-Yup
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2005년도 BIOINFO 2005
    • /
    • pp.78-82
    • /
    • 2005
  • Three of the genus Pseudomonas (P. aeruginosa, P. putida, P. syringae) show highly different phenotypic characteristics among them. Two of the three members are pathogenic and the other is non-pathogenic. Comparative analyses of the complete genomes can elucidate the genomic similarities and differences among them. We analyzed the three genomes and the genes of them to reveal the degree of conservation of chromosomes and similarity of the genes. The 2-dimensional dot plot between the pathogenic P. aeruginosa and non-pathogenic P. putida shared higher portion of the nucleotide sequences than other two combinations. Comparison of the nucleotide compositions by calculating the genome-scale plot of G+C contents and GC skew showed the variation of location. Comparison of the metabolic capabilities using the functional classification of KEGG orthology revealed that the differences in the number of genes for the specific functional categories resulted in the phenotypic differences. Finally combination of the analyses using the protein homologs supported the evolutionary distance of the P. putida obtained from other genome-scale comparisons.

  • PDF

Insilico Analysis for Expressed Sequence Tags from Embryogenic Callus and Flower Buds of Panax ginseng C. A. Meyer

  • Sathiyamoorthy, Subramaniyam;In, Jun-Gyo;Lee, Byum-Soo;Kwon, Woo-Seang;Yang, Dong-Uk;Kim, Ju-Han;Yang, Deok-Chun
    • Journal of Ginseng Research
    • /
    • 제35권1호
    • /
    • pp.21-30
    • /
    • 2011
  • Panax ginseng root has been used as a major source of ginsenoside throughout the history of oriental medicine. In recent years, scientists have found that all of its biomass, including embryogenic calli and flower buds can contain similar active ingredients with pharmacological functions. In this study, transcriptome analyses were used to identify different gene expressions from embryogenic calli and fl ower buds. In total, 6,226 expressed sequence tags (ESTs) were obtained from cDNA libraries of P. ginseng. Insilico analysis was conducted to annotate the putative sequences using gene ontology functional analysis, Kyoto Encyclopedia of Genes and Genomes orthology biochemical analysis, and interproscan protein functional domain analysis. From the obtained results, genes responsible for growth, pathogenicity, pigments, ginsenoside pathway, and development were discussed. Almost 83.3% of the EST sequence was annotated using one-dimensional insilico analysis.

Transcriptome analysis of a medicinal plant, Pistacia chinensis

  • Choi, Ki-Young;Park, Duck Hwan;Seong, Eun-Soo;Lee, Sang Woo;Hang, Jin;Yi, Li Wan;Kim, Jong-Hwa;Na, Jong-Kuk
    • Journal of Plant Biotechnology
    • /
    • 제46권4호
    • /
    • pp.274-281
    • /
    • 2019
  • Pistacia chinensis Bunge has not only been used as a medicinal plant to treat various illnesses but its young shoots and leaves have also been used as vegetables. In addition, P. chinensis is used as a rootstock for Pistacia vera (pistachio). Here, the transcriptome of P. chinensis was sequenced to enrich genetic resources and identify secondary metabolite biosynthetic pathways using Illumina RNA-seq methods. De novo assembly resulted in 18,524 unigenes with an average length of 873 bp from 19 million RNA-seq reads. A Kyoto Encyclopedia of Genes and Genomes (KEGG) annotation tool assigned KO (KEGG orthology) numbers to 6,553 (36.2%) unigenes, among which 4,061 unigenes were mapped into 391 different metabolic pathways. For terpenoid backbone and carotenoid biosynthesis pathways, 44 and 22 unigenes encode enzymes corresponding to 30 and 16 entries, respectively. Twenty-two unigenes encode proteins for 16 entries of the carotenoid biosynthesis pathway. As for the phenylpropanoid and flavonoid biosynthesis pathways, 63 and 24 unigenes were homologous to 17 and 14 entry proteins, respectively. Mining of simple sequence repeat identified 2,599 simple sequence repeats from P. chinensis unigenes. The results of the present study provide a valuable resource for in-depth studies on comparative and functional genomics to unravel the underlying mechanisms of the medicinal properties of Pistacia L.

Comparing Two Mycobacterium tuberculosis Genomes from Chinese Immigrants with Native Genomes Using Mauve Alignments

  • Ryoo, Sungweon;Lee, Jeongsoo;Oh, Jee Youn;Kim, Byeong Ki;Kim, Young;Kim, Je Hyeong;Shin, Chol;Lee, Seung Heon
    • Tuberculosis and Respiratory Diseases
    • /
    • 제81권3호
    • /
    • pp.216-221
    • /
    • 2018
  • Background: The number of immigrants with tuberculosis (TB) increases each year in South Korea. Determining the transmission dynamics based on whole genome sequencing (WGS) to cluster the strains has been challenging. Methods: WGS, annotation refinement, and orthology assignment for the GenBank accession number acquisition were performed on two clinical isolates from Chinese immigrants. In addition, the genomes of the two isolates were compared with the genomes of Mycobacterium tuberculosis isolates, from two native Korean and five native Chinese individuals using a phylogenetic topology tree based on the Multiple Alignment of Conserved Genomic Sequence with Rearrangements (Mauve) package. Results: The newly assigned accession numbers for two clinical isolates were CP020381.2 (a Korean-Chinese from Yanbian Province) and CP022014.1 (a Chinese from Shandong Province), respectively. Mauve alignment classified all nine TB isolates into a discriminative collinear set with matched regions. The phylogenetic analysis revealed a rooted phylogenetic tree grouping the nine strains into two lineages: strains from Chinese individuals and strains from Korean individuals. Conclusion: Phylogenetic trees based on the Mauve alignments were supposed to be useful in revealing the dynamics of TB transmission from immigrants in South Korea, which can provide valuable information for scaling up the TB screening policy for immigrants.

Supragingival Plaque Microbial Community Analysis of Children with Halitosis

  • Ren, Wen;Zhang, Qun;Liu, Xuenan;Zheng, Shuguo;Ma, Lili;Chen, Feng;Xu, Tao;Xu, Baohua
    • Journal of Microbiology and Biotechnology
    • /
    • 제26권12호
    • /
    • pp.2141-2147
    • /
    • 2016
  • As one of the most complex human-associated microbial habitats, the oral cavity harbors hundreds of bacteria. Halitosis is a prevalent oral condition that is typically caused by bacteria. The aim of this study was to analyze the microbial communities and predict functional profiles in supragingival plaque from healthy individuals and those with halitosis. Ten preschool children were enrolled in this study; five with halitosis and five without. Supragingival plaque was isolated from each participant and 16S rRNA gene pyrosequencing was used to identify the microbes present. Samples were primarily composed of Actinobacteria, Bacteroidetes, Proteobacteria, Firmicutes, Fusobacteria, and Candidate phylum TM7. The ${\alpha}$ and ${\beta}$ diversity indices did not differ between healthy and halitosis subjects. Fifteen operational taxonomic units (OTUs) were identified with significantly different relative abundances between healthy and halitosis plaques, and included the phylotypes of Prevotella sp., Leptotrichia sp., Actinomyces sp., Porphyromonas sp., Selenomonas sp., Selenomonas noxia, and Capnocytophaga ochracea. We suggest that these OTUs are candidate halitosis-associated pathogens. Functional profiles were predicted using PICRUSt, and nine level-3 KEGG Orthology groups were significantly different. Hub modules of co-occurrence networks implied that microbes in halitosis dental plaque were more highly conserved than microbes of healthy individuals' plaque. Collectively, our data provide a background for the oral microbiota associated with halitosis from supragingival plaque, and help explain the etiology of halitosis.

Construction of EST Database for Comparative Gene Studies of Acanthamoeba

  • Moon, Eun-Kyung;Kim, Joung-Ok;Xuan, Ying-Hua;Yun, Young-Sun;Kang, Se-Won;Lee, Yong-Seok;Ahn, Tae-In;Hong, Yeon-Chul;Chung, Dong-Il;Kong, Hyun-Hee
    • Parasites, Hosts and Diseases
    • /
    • 제47권2호
    • /
    • pp.103-107
    • /
    • 2009
  • The genus Acanthamoeba can cause severe infections such as granulomatous amebic encephalitis and amebic keratitis in humans. However, little genomic information of Acanthamoeba has been reported. Here, we constructed Acanthamoeba expressed sequence tags (EST) database (Acanthamoeba EST DB) derived from our 4 kinds of Acanthamoeba cDNA library. The Acanthamoeba EST DB contains 3,897 EST generated from amebae under various conditions of long term in vitro culture, mouse brain passage, or encystation, and downloaded data of Acanthamoeba from National Center for Biotechnology Information (NCBI) and Taxonomically Broad EST Database (TBestDB). The almost reported eDNA/genomic sequences of Acanthamoeba provide stand alone BLAST system with nucleotide (BLAST NT) and amino acid (BLAST AA) sequence database. In BLAST results, each gene links for the significant information including sequence data, gene orthology annotations, relevant references, and a BlastX result. This is the first attempt for construction of Acanthamoeba database with genes expressed in diverse conditions. These data were integrated into a database (http://www. amoeba.or.kr).

Gramene database: A resource for comparative plant genomics, pathways and phylogenomics analyses

  • Tello-Ruiz, Marcela K.;Stein, Joshua;Wei, Sharon;Preece, Justin;Naithani, Sushma;Olson, Andrew;Jiao, Yinping;Gupta, Parul;Kumari, Sunita;Chougule, Kapeel;Elser, Justin;Wang, Bo;Thomason, James;Zhang, Lifang;D'Eustachio, Peter;Petryszak, Robert;Kersey, Paul;Lee, PanYoung Koung;Jaiswal, kaj;Ware, Doreen
    • 한국작물학회:학술대회논문집
    • /
    • 한국작물학회 2017년도 9th Asian Crop Science Association conference
    • /
    • pp.135-135
    • /
    • 2017
  • The Gramene database (http://www.gramene.org) is a powerful online resource for agricultural researchers, plant breeders and educators that provides easy access to reference data, visualizations and analytical tools for conducting cross-species comparisons. Learn the benefits of using Gramene to enrich your lectures, accelerate your research goals, and respond to your organismal community needs. Gramene's genomes portal hosts browsers for 44 complete reference genomes, including crops and model organisms, each displaying functional annotations, gene-trees with orthologous and paralogous gene classification, and whole-genome alignments. SNP and structural diversity data, available for 11 species, are displayed in the context of gene annotation, protein domains and functional consequences on transcript structure (e.g., missense variant). Browsers from multiple species can be viewed simultaneously with links to community-driven organismal databases. Thus, while hosting the underlying data for comparative studies, the portal also provides unified access to diverse plant community resources, and the ability for communities to upload and display private data sets in multiple standard formats. Our BioMart data mining interface enable complex queries and bulk download of sequence, annotation, homology and variation data. Gramene's pathway portal, the Plant Reactome, hosts over 240 pathways curated in rice and inferred in 66 additional plant species by orthology projection. Users may compare pathways across species, query and visualize curated expression data from EMBL-EBI's Expression Atlas in the context of pathways, analyze genome-scale expression data, and conduct pathway enrichment analysis. Our integrated search database and modern user interface leverage these diverse annotations to facilitate finding genes through selecting auto-suggested filters with interactive views of the results.

  • PDF