• Title/Summary/Keyword: Whole genome sequence

Search Result 201, Processing Time 0.026 seconds

Mining the Proteome of Fusobacterium nucleatum subsp. nucleatum ATCC 25586 for Potential Therapeutics Discovery: An In Silico Approach

  • Habib, Abdul Musaweer;Islam, Md. Saiful;Sohel, Md.;Mazumder, Md. Habibul Hasan;Sikder, Mohd. Omar Faruk;Shahik, Shah Md.
    • Genomics & Informatics
    • /
    • v.14 no.4
    • /
    • pp.255-264
    • /
    • 2016
  • The plethora of genome sequence information of bacteria in recent times has ushered in many novel strategies for antibacterial drug discovery and facilitated medical science to take up the challenge of the increasing resistance of pathogenic bacteria to current antibiotics. In this study, we adopted subtractive genomics approach to analyze the whole genome sequence of the Fusobacterium nucleatum, a human oral pathogen having association with colorectal cancer. Our study divulged 1,499 proteins of F. nucleatum, which have no homolog's in human genome. These proteins were subjected to screening further by using the Database of Essential Genes (DEG) that resulted in the identification of 32 vitally important proteins for the bacterium. Subsequent analysis of the identified pivotal proteins, using the Kyoto Encyclopedia of Genes and Genomes (KEGG) Automated Annotation Server (KAAS) resulted in sorting 3 key enzymes of F. nucleatum that may be good candidates as potential drug targets, since they are unique for the bacterium and absent in humans. In addition, we have demonstrated the three dimensional structure of these three proteins. Finally, determination of ligand binding sites of the 2 key proteins as well as screening for functional inhibitors that best fitted with the ligands sites were conducted to discover effective novel therapeutic compounds against F. nucleatum.

Complete mitochondrial genome of the Japanese oak silkmoth, Antheraea yamamai (Lepidoptera: Saturniidae), from Jeju Island, Korea

  • Kim, Kee-Young;Park, Jeong Sun;Lee, Keon Hee;Kim, Min Jee;Kim, Seong-Wan;Park, Jong-Woo;Kang, Sang-Kuk;Kim, Nam-Suk;Kim, Iksoo
    • International Journal of Industrial Entomology and Biomaterials
    • /
    • v.44 no.2
    • /
    • pp.65-71
    • /
    • 2022
  • The wild silkmoth Antheraea yamamai Guérin-Méneville, 1861 (Lepidoptera: Saturniidae) is an important producer of silk that is superior to the silk produced by traditional domesticated silkworm. In this study, we sequenced the complete mitochondrial genome (mitogenome) of An. yamamai collected from Jeju Island, which is the southernmost island approximately 100 km offshore southward from the Korean Peninsula. Determining this sequence will be necessary for tracing the biogeographic history of the species and developing molecular markers for identifying the origin of commercial products. Comparison of the sequence divergence among two available and the current mitogenomes revealed a low but substantial number of substitutions, totaling 23 nucleotides in the whole genome. CytB and ND5 showed the highest variability with five and four variations, respectively, suggesting that these regions will be prior regions to target for subsequent biogeographic and diagnosis study. Phylogenetic reconstruction based on all available sequences of Saturniidae showed that An. yamamai is a sister to the congeneric species An. pernyi, corroborating that Antheraea is a highly supported monophyletic group. The tribe Saturniini was clearly non-monophyletic and interrupted by Attacini and Bunaeini.

Prediction of Genes Related to Positive Selection Using Whole-Genome Resequencing in Three Commercial Pig Breeds

  • Kim, HyoYoung;Caetano-Anolles, Kelsey;Seo, Minseok;Kwon, Young-jun;Cho, Seoae;Seo, Kangseok;Kim, Heebal
    • Genomics & Informatics
    • /
    • v.13 no.4
    • /
    • pp.137-145
    • /
    • 2015
  • Selective sweep can cause genetic differentiation across populations, which allows for the identification of possible causative regions/genes underlying important traits. The pig has experienced a long history of allele frequency changes through artificial selection in the domestication process. We obtained an average of 329,482,871 sequence reads for 24 pigs from three pig breeds: Yorkshire (n = 5), Landrace (n = 13), and Duroc (n = 6). An average read depth of 11.7 was obtained using whole-genome resequencing on an Illumina HiSeq2000 platform. In this study, cross-population extended haplotype homozygosity and cross-population composite likelihood ratio tests were implemented to detect genes experiencing positive selection for the genome-wide resequencing data generated from three commercial pig breeds. In our results, 26, 7, and 14 genes from Yorkshire, Landrace, and Duroc, respectively were detected by two kinds of statistical tests. Significant evidence for positive selection was identified on genes ST6GALNAC2 and EPHX1 in Yorkshire, PARK2 in Landrace, and BMP6, SLA-DQA1, and PRKG1 in Duroc. These genes are reportedly relevant to lactation, reproduction, meat quality, and growth traits. To understand how these single nucleotide polymorphisms (SNPs) related positive selection affect protein function, we analyzed the effect of non-synonymous SNPs. Three SNPs (rs324509622, rs80931851, and rs80937718) in the SLA-DQA1 gene were significant in the enrichment tests, indicating strong evidence for positive selection in Duroc. Our analyses identified genes under positive selection for lactation, reproduction, and meat-quality and growth traits in Yorkshire, Landrace, and Duroc, respectively.

From the Sequence to Cell Modeling: Comprehensive Functional Genomics in Escherichia coli

  • Mori, Hirotada
    • BMB Reports
    • /
    • v.37 no.1
    • /
    • pp.83-92
    • /
    • 2004
  • As a result of the enormous amount of information that has been collected with E. coli over the past half century (e.g. genome sequence, mutant phenotypes, metabolic and regulatory networks, etc.), we now have detailed knowledge about gene regulation, protein activity, several hundred enzyme reactions, metabolic pathways, macromolecular machines, and regulatory interactions for this model organism. However, understanding how all these processes interact to form a living cell will require further characterization, quantification, data integration, and mathematical modeling, systems biology. No organism can rival E. coli with respect to the amount of available basic information and experimental tractability for the technologies needed for this undertaking. A focused, systematic effort to understand the E. coli cell will accelerate the development of new post-genomic technologies, including both experimental and computational tools. It will also lead to new technologies that will be applicable to other organisms, from microbes to plants, animals, and humans. E. coli is not only the best studied free-living model organism, but is also an extensively used microbe for industrial applications, especially for the production of small molecules of interest. It is an excellent representative of Gram-negative commensal bacteria. E. coli may represent a perfect model organism for systems biology that is aimed at elucidating both its free-living and commensal life-styles, which should open the door to whole-cell modeling and simulation.

Korea Brassica Genome Project: Current Status and Prospective (배추 유전체열구의 현황과 전망)

  • Choi, Su-Ryun;Park, Jee-Yong;Park, Beom-Seok;Kim, Ho-Il;Lim, Yong-Pyo
    • Journal of Plant Biotechnology
    • /
    • v.33 no.3
    • /
    • pp.153-160
    • /
    • 2006
  • Brassica rape is an important species used as a vegetable, oil, and fodder worldwide. It is related phylogenically to Arabidopsis thaliana, which has already been fully sequenced as a model plant. The 'Multinational Brassica Genome Project (MBGP)'was launched by the international Brassica community with the aim of sequencing the whole genome of B. rapa in 2003 on account of its value and the fact that it has the smallest genome among the diploid Brassica. The genome study was carried out not only to know the structure of genome but also to understand the function and the evolution of the genes comprehensively. There are two mapping populations, over 1,000 molecular markers and a genetic map, 2 BAC libraries, physical map, a 22 cDHA libraries as suitable genomic materials for examining the genome of B. rapa ssp. pekinensis Chinese cabbage. As the first step for whole genome analysis, 220,000 BAC-end sequences of the KBrH and KBrB BAC library are achieved by cooperation of six countries. The results of BAC-end sequence analysis will provide a clue in understanding the structure of the genome of Brassica rapa by analyzing the gene sequence, annotation and abundant repetitive DHA. The second stage involves sequencing of the genetically mapped seed BACs and identifying the overlapping BACs for complete genome sequencing. Currently, the second stage is comprises of process genetic anchoring using communal populations and maps to identify more than 1,000 seed BACs based on a BAC-to-BAC strategy. For the initial sequencing, 629 seed BACs corresponding to the minimum tiling path onto Arabidopsis genome were selected and fully sequenced. These BACs are now anchoring to the genetic map using the development of SSR markers. This information will be useful for identifying near BAC clones with the seed BAC on a genome map. From the BAC sequences, it is revealed that the Brassica rapa genome has extensive triplication of the DNA segment coupled with variable gene losses and rearrangements within the segments. This article introduces the current status and prospective of Korea Brassica Genome Project and the bioinformatics tools possessed in each national team. In the near future, data of the genome will contribute to improving Brassicas for their economic use as well as in understanding the evolutional process.

Genomic Research of the Genus Bifidobacterium and Its Application (비피도박테리아의 분자생물학적인 연구 동향)

  • Kim, Geun-Bae
    • Journal of Dairy Science and Biotechnology
    • /
    • v.25 no.2
    • /
    • pp.21-28
    • /
    • 2007
  • Recently, the field of microbiology has been transformed by huge increasing number of publicly available whole-genome sequences. This sequence information has significantly enhanced our understanding of the physiology, genetics, and evolutionary development of bacteria. Among the gastrointestinal microorganisms, bifidobacteria represent the most important human commensals because of their contribution to maintaining a balanced gastrointestinal tract microbiota. In recent years bifidobacteria have drawn much scientific attention due to their use as live bacteria in numerous food products with various health-related claims. For this reason, these bacteria constitute a growing area of interest with respect to genomics, molecular biology, and genetics. Recent genome sequencing of a number of bifidobacterial species has allowed access to the complete genetic make-up of these bacteria. This review will focus how genomic data has allowed us to understand bifidobacterial evolution, while also revealing genetic functions that explains their presence in the particular ecological environment of the gastrointestinal tract.

  • PDF

Genomic Research as a Means to Understand Bacterial Phylogeny and Ecological Adaptation of the Genus Bifidobacterium (Bifidobacterium의 분자생물학적 연구 동향)

  • Kim, Geun-Bae
    • 한국유가공학회:학술대회논문집
    • /
    • 2007.09a
    • /
    • pp.21-29
    • /
    • 2007
  • The field of microbiology has in recent years been transformed by huge increasing number of publicly available whole-genome sequences. This sequence information has significantly enhanced our understanding of the physiology, genetics, and evolutionary development of bacteria. Among the gastrointestinal microorganisms, bifidobacteria represent important human commensals because of their perceived contribution to maintaining a balanced gastrointestinal tract microbiota. In recent years bifidobacteria have drawn much scientific attention due to their use as live bacteria in numerous food products with various health-related claims. For this reason, these bacteria constitute a growing area of interest with respect to genomics, molecular biology, and genetics. Recent genome sequencing of a number of bifidobacterial species has allowed access to the complete genetic make-up of these bacteria. This review will focus how genomic data has allowed us to understand bifidobacterial evolution, while also revealing genetic functions that explains their presence in the particular ecological environment of the gastrointestinal tract.

  • PDF

Genomic Approaches for Understanding the Characteristics of Salmonella enterica subsp. enterica Serovar Typhimurium ST1120, Isolated from Swine Feces in Korea

  • Kim, Seongok;Kim, Eunsuk;Park, Soyeon;Hahn, Tae-Wook;Yoon, Hyunjin
    • Journal of Microbiology and Biotechnology
    • /
    • v.27 no.11
    • /
    • pp.1983-1993
    • /
    • 2017
  • Salmonella enterica subsp. enterica serovar Typhimurium, one of the most common foodborne pathogens, is transmitted mainly through contaminated food derived from infected animals. In this study, S. Typhimurium ST1120, an isolate from pig feces in Korea, was subjected to whole-genome analysis to understand its genomic features associated with virulence. The genome of ST1120 was found to have a circular chromosome of 4,855,001 bp (GC content 52.2%) and a plasmid of 6,863 bp (GC content 46.0%). This chromosome was predicted to have 4,558 open reading frames (ORFs), 17 pseudogenes, 22 rRNA genes, and 86 tRNA genes. Its plasmid was predicted to have three ORFs. Comparative genome analysis revealed that ST1120 was phylogenetically close to S. Typhimurium U288, a critical isolate in piggery farms and food chains in Europe. In silico functional analysis predicted that the ST1120 genome harbored multiple genes associated with virulence and stress resistance, including Salmonella pathogenicity islands (SPIs containing SPI-1 to SPI-5, SPI-13, and SPI-14), C63PI locus, ST104 prophage locus, and various antibiotic resistance genes. In accordance with these analysis results, ST1120 showed competence in invasion and survival abilities when it was added to host cells. It also exhibited robust resistance against antibiotics in comparison with other S. Typhimurium strains. This is the first report of the complete genome sequence of S. Typhimurium isolated from swine in Korea. Comparative genome analysis between ST1120 and other Salmonella strains would provide fruitful information toward understanding Salmonella host specificity and developing control measures against S. Typhimurium infection.

Comparative Analysis of Salmonella enterica subsp. enterica Serovar Thompson Isolates associated with Outbreaks Using PFGE and wgMLST

  • Youngho Koh;Yunyoung Bae;Min-Jung Lee;Yu-Si Lee;Dong-Hyun Kang;Soon Han Kim
    • Journal of Microbiology and Biotechnology
    • /
    • v.32 no.12
    • /
    • pp.1605-1614
    • /
    • 2022
  • The strains associated with foodborne Salmonella enterica Thompson outbreaks in Korea have not been identified. Therefore, we characterized S. Thompson strains isolated from chocolate cakes linked to foodborne outbreaks in Korea. A total of 56 strains were isolated from preserved cake products, products in the supply chain distribution, the manufacturer's apparatus, and egg white liquid products used for cream preparation. Subsequently, serological typing, pathogenic gene-targeted polymerase chain reaction (PCR), pulsed-field gel electrophoresis (PFGE), and whole-genome multi-locus sequence typing (wgMLST) were performed to characterize these isolates. The antigen formula of all isolates was 7:k:1,5, namely Salmonella enterica subsp. enterica Serovar Thompson. All 56 isolates harbored invA, his, hin, and stn, and were negative for sefA and spvC based on gene-targeted PCR analyses. Based on PFGE results, these isolates were classified into one group based on the same SP6X01.011 pattern with 100% similarity. We selected 19 strains based on the region and sample type, which were subjected to wgMLST. Although the examined strains showed 100% similarity, they were classified into seven clusters based on allelic differences. According to our findings, the cause of these outbreaks was chocolate cake manufactured with egg white liquid contaminated with the same Salmonella Thompson. Additionally, comparative analysis of wgMLST on domestic isolates of S. Thompson from the three outbreaks showed genetic similarities of over 99.6%. Based on the results, the PFGE and wgMLST combination can provide highly resolved phylogeny and reliable evidence during Salmonella outbreak investigations.

Comparative Genomic Analysis of Staphylococcus aureus FORC_001 and S. aureus MRSA252 Reveals the Characteristics of Antibiotic Resistance and Virulence Factors for Human Infection

  • Lim, Sooyeon;Lee, Dong-Hoon;Kwak, Woori;Shin, Hakdong;Ku, Hye-Jin;Lee, Jong-eun;Lee, Gun Eui;Kim, Heebal;Choi, Sang-Ho;Ryu, Sangryeol;Lee, Ju-Hoon
    • Journal of Microbiology and Biotechnology
    • /
    • v.25 no.1
    • /
    • pp.98-108
    • /
    • 2015
  • Staphylococcus aureus is an important foodborne pathogen that causes diverse diseases ranging from minor infections to life-threatening conditions in humans and animals. To further understand its pathogenesis, the genome of the strain S. aureus FORC_001 was isolated from a contaminated food. Its genome consists of 2,886,017 bp double-stranded DNA with a GC content of 32.8%. It is predicted to contain 2,728 open reading frames, 57 tRNAs, and 6 rRNA operons, including 1 additional 5S rRNA gene. Comparative phylogenetic tree analysis of 40 complete S. aureus genome sequences using average nucleotide identity (ANI) revealed that strain FORC_001 belonged to Group I. The closest phylogenetic match was S. aureus MRSA252, according to a whole-genome ANI (99.87%), suggesting that they might share a common ancestor. Comparative genome analysis of FORC_001 and MRSA252 revealed two non-homologous regions: Regions I and II. The presence of various antibiotic resistance genes, including the SCCmec cluster in Region I of MRSA252, suggests that this strain might have acquired the SCCmec cluster to adapt to specific environments containing methicillin. Region II of both genomes contains prophage regions but their DNA sequence identity is very low, suggesting that the prophages might differ. This is the first report of the complete genome sequence of S. aureus isolated from a real foodborne outbreak in South Korea. This report would be helpful to extend our understanding about the genome, general characteristics, and virulence factors of S. aureus for further studies of pathogenesis, rapid detection, and epidemiological investigation in foodborne outbreak.