• Title/Summary/Keyword: Whole-genome sequencing

Search Result 252, Processing Time 0.024 seconds

Basic Concept of Gene Microarray (Gene Microarray의 기본개념)

  • Hwang, Seung Yong
    • Korean Journal of Biological Psychiatry
    • /
    • v.8 no.2
    • /
    • pp.203-207
    • /
    • 2001
  • The genome sequencing project has generated and will continue to generate enormous amounts of sequence data including 5 eukaryotic and about 60 prokaryotic genomes. Given this ever-increasing amounts of sequence information, new strategies are necessary to efficiently pursue the next phase of the genome project-the elucidation of gene expression patterns and gene product function on a whole genome scale. In order to assign functional information to the genome sequence, DNA chip(or gene microarray) technology was developed to efficiently identify the differential expression pattern of independent biological samples. DNA chip provides a new tool for genome expression analysis that may revolutionize many aspects of biotechnology including new drug discovery and disease diagnostics.

  • PDF

Genome-Wide Comparison of Carbohydrate-Active Enzymes (CAZymes) Repertoire of Flammulina ononidis

  • Park, Young-Jin;Kong, Won-Sik
    • Mycobiology
    • /
    • v.46 no.4
    • /
    • pp.349-360
    • /
    • 2018
  • Whole-genome sequencing of Flammulina ononidis, a wood-rotting basidiomycete, was performed to identify genes associated with carbohydrate-active enzymes (CAZymes). A total of 12,586 gene structures with an average length of 2009 bp were predicted by the AUGUSTUS tool from a total 35,524,258 bp length of de novo genome assembly (49.76% GC). Orthologous analysis with other fungal species revealed that 7051 groups contained at least one F. ononidis gene. In addition, 11,252 (89.5%) of 12,586 genes for F. ononidis proteins had orthologs among the Dikarya, and F. ononidis contained 8 species-specific genes, of which 5 genes were paralogous. CAZyme prediction revealed 524 CAZyme genes, including 228 for glycoside hydrolases, 21 for polysaccharide lyases, 87 for glycosyltransferases, 61 for carbohydrate esterases, 87 with auxiliary activities, and 40 for carbohydrate-binding modules in the F. ononidis genome. This genome information including CAZyme repertoire will be useful to understand lignocellulolytic machinery of this white rot fungus F. ononidis.

Complete Genome Sequence of Escherichia coli - Specific Phage KFS-EC1 Isolated from a Slaughterhouse

  • Su-Hyeon Kim;Mi-Kyung Park
    • Microbiology and Biotechnology Letters
    • /
    • v.51 no.4
    • /
    • pp.562-565
    • /
    • 2023
  • Escherichia coli-specific phage, KFS-EC1, was isolated and purified from a slaughterhouse. The complete genome of the phage was obtained using Illumina MiSeq platforms. Its assembled genome consisted of a single chromosome of 164,715 bp with a GC content of 40.5%. The phage genome contained 170 hypothetical and 101 functional ORFs, and exhibited orthologous average nucleotide identity values of >95% with other E. coli phages belonging to the family Straboviridae. Additionally, phylogenetic analysis revealed that KFS-EC1 was finally classified into the family Straboviridae of the genus Caudoviricetes. The genome has been deposited in GenBank under the accession number NC_055757.1.

Whole Genome Analysis of Human Papillomavirus Genotype 11 from Cervix, Larynx and Lung

  • Chansaenroj, Jira;Theamboonlers, Apiradee;Junyangdikul, Pairoj;Supiyaphan, Pakpoom;Poovorawan, Yong
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.13 no.6
    • /
    • pp.2619-2623
    • /
    • 2012
  • The prevalence of human papillomavirus genotypes differs in various target organs. HPV16 is the most prevalent genotype in the cervix while genotypes 6 and 11 are highly prevalent in skin and aero-digestive tract infections. In this study HPV11 positive specimens were selected from cervix, larynx and lung biopsy tissue to analyze the whole genome by PCR and direct sequencing. Five HPV11 whole genomes were characterized, consisting of two cervical specimens, two laryngeal specimens and one lung specimen. The results showed high homology of HPV11 in these organs. Phylogenetic analysis showed that all HPV11 derived from various organs belonged to the same lineage. Molecular characterization and functional studies can further our understanding of virulence, expression or transmission. Additional studies on functional protein expression at different organ sites will also contribute to our knowledge of HPV infection in various organs.

A novel mutation in GJC2 associated with hypomyelinating leukodystrophy type 2 disorder

  • Komachali, Sajad Rafiee;Sheikholeslami, Mozhgan;Salehi, Mansoor
    • Genomics & Informatics
    • /
    • v.20 no.2
    • /
    • pp.24.1-24.8
    • /
    • 2022
  • Hypomyelinating leukodystrophy type 2 (HLD2), is an inherited genetic disease of the central nervous system caused by recessive mutations in the gap junction protein gamma 2 (GJC2/GJA12). HLD2 is characterized by nystagmus, developmental delay, motor impairments, ataxia, severe speech problem, and hypomyelination in the brain. The GJC2 sequence encodes connexin 47 protein (Cx47). Connexins are a group of membrane proteins that oligomerize to construct gap junctions protein. In the present study, a novel missense mutation gene c.760G>A (p.Val254Met) was identified in a patient with HLD2 by performing whole exome sequencing. Following the discovery of the new mutation in the proband, we used Sanger sequencing to analyze his affected sibling and parents. Sanger sequencing verified homozygosity of the mutation in the proband and his affected sibling. The autosomal recessive inheritance pattern was confirmed since Sanger sequencing revealed both healthy parents were heterozygous for the mutation. PolyPhen2, SIFT, PROVEAN, and CADD were used to evaluate the function prediction scores of detected mutations. Cx47 is essential for oligodendrocyte function, including adequate myelination and myelin maintenance in humans. Novel mutation p.Val254Met is located in the second extracellular domain of Cx47, both extracellular loops are highly conserved and probably induce intramolecular disulfide interactions. This novel mutation in the Cx47 gene causes oligodendrocyte dysfunction and HLD2 disorder.

Complete genome sequence of Pediococcus acidilactici CACC 537 isolated from canine

  • Jung-Ae Kim;Hyun-Jun Jang;Dae-Hyuk Kim;Youn Kyoung Son;Yangseon Kim
    • Journal of Animal Science and Technology
    • /
    • v.65 no.5
    • /
    • pp.1105-1109
    • /
    • 2023
  • Pedi coccus acidilactici CACC 537 was isolated from canine feces and reported to have probiotic properties. We aimed to characterize the potential probiotic properties of this strain by functional genomic analysis. Complete genome sequencing of P. acidilactici CACC 537 was performed using a PacBio RSII and Illumina platform, and contained one circular chromosome (2.0 Mb) with a 42% G + C content. The sequences were annotation revealed 1,897 protein-coding sequences, 15 rRNAs, and 56 tRNAs. It was determined that P. acidilactici CACC 537 genome carries genes known to be involved in the immune system, defense mechanisms, restriction-modification (R-M), and the CRISPR system. CACC 537 was shown to be beneficial in preventing pathogen infection during the fermentation process, help host immunity, and maintain intestinal health. These results provide for a comprehensive understanding of P. acidilactici and the development of industrial probiotic feed additives that can help improve host immunity and intestinal health.

High Resolution Whole Genome Multilocus Sequence Typing (wgMLST) Schemes for Salmonella enterica Weltevreden Epidemiologic Investigations

  • Tadee, Pakpoom;Tadee, Phacharaporn;Hitchings, Matthew D.;Pascoe, Ben;Sheppard, Samuel K.;Patchanee, Prapas
    • Microbiology and Biotechnology Letters
    • /
    • v.46 no.2
    • /
    • pp.162-170
    • /
    • 2018
  • Non-typhoidal Salmonella is one of the main pathogens causing food-borne illness in humans, with up to 20% of cases resulting from consumption of pork products. Over the gastroenteritis signs, multidrug resistant Salmonella has arisen. In this study, pan-susceptible phenotypic strains of Salmonella enterica serotype Weltevreden recovered from pig production chain in Chiang Mai, Thailand during 2012-2014 were chosen for analysis. The aim of this study was to use whole genome sequencing (WGS) data with an emphasis on antimicrobial resistance gene investigation to assess their pathogenic potential and genetic diversity determination based on whole genome Multilocus Sequence Typing (wgMLST) to expand epidemiological knowledge and to provide additional guidance for disease control. Analyis using ResFinder 3.0 for WGS database tracing found that one of pan-susceptible phenotypic strain carried five classes of resistance genes: aminoglycoside, beta-lactam, phenicol, sulfonamide, and tetracycline associated genes. Twenty four and 36 loci differences were detected by core genome Multilocus Sequence Typing (cgMLST) and pan genome Multilocus Sequence Typing (pgMLST), respectively, in two matching strains (44/13 vs A543057 and A543056 vs 204/13) initially assigned by conventional MLST and Pulsed-field Gel Electrophoresis (PFGE). One hundread percent discriminant ability can be achieved using the wgMLST technique. WGS is currently the ultimate molecular technique for various in-depth studies. As the findings stated above, a new of "gold standard typing method era" for routine works in genome study is being set.

Genomic Analysis of Dairy Starter Culture Streptococcus thermophilus MTCC 5461

  • Prajapati, Jashbhai B.;Nathani, Neelam M.;Patel, Amrutlal K.;Senan, Suja;Joshi, Chaitanya G.
    • Journal of Microbiology and Biotechnology
    • /
    • v.23 no.4
    • /
    • pp.459-466
    • /
    • 2013
  • The lactic acid bacterium Streptococcus thermophilus is widely used as a starter culture for the production of dairy products. Whole-genome sequencing is expected to utilize the genetic basis behind the metabolic functioning of lactic acid bacterium (LAB), for development of their use in biotechnological and probiotic applications. We sequenced the whole genome of Streptococcus thermophilus MTCC 5461, the strain isolated from a curd source, by 454 GS-FLX titanium and Ion Torrent PGM. We performed comparative genome analysis using the local BLAST and RDP for 16S rDNA comparison and by the RAST server for functional comparison against the published genome sequence of Streptococcus thermophilus CNRZ 1066. The whole genome size of S. thermophilus MTCC 5461 is of 1.73Mb size with a GC content of 39.3%. Streptococcal virulence-related genes are either inactivated or absent in the strain. The genome possesses coding sequences for features important for a probiotic organism such as adhesion, acid tolerance, bacteriocin production, and lactose utilization, which was found to be conserved among the strains MTCC 5461 and CNRZ 1066. Biochemical analysis revealed the utilization of 17 sugars by the bacterium, where the presence of genes encoding enzymes involved in metabolism for 16 of these 17 sugars were confirmed in the genome. This study supports the facts that the strain MTCC 5461 is nonpathogenic and harbors essential features that can be exploited for its probiotic potential.

Chromosome-specific polymorphic SSR markers in tropical eucalypt species using low coverage whole genome sequences: systematic characterization and validation

  • Patturaj, Maheswari;Munusamy, Aiswarya;Kannan, Nithishkumar;Kandasamy, Ulaganathan;Ramasamy, Yasodha
    • Genomics & Informatics
    • /
    • v.19 no.3
    • /
    • pp.33.1-33.10
    • /
    • 2021
  • Eucalyptus is one of the major plantation species with wide variety of industrial uses. Polymorphic and informative simple sequence repeats (SSRs) have broad range of applications in genetic analysis. In this study, two individuals of Eucalyptus tereticornis (ET217 and ET86), one individual each from E. camaldulensis (EC17) and E. grandis (EG9) were subjected to whole genome resequencing. Low coverage (10×) genome sequencing was used to find polymorphic SSRs between the individuals. Average number of SSR loci identified was 95,513 and the density of SSRs per Mb was from 157.39 in EG9 to 155.08 in EC17. Among all the SSRs detected, the most abundant repeat motifs were di-nucleotide (59.6%-62.5%), followed by tri- (23.7%-27.2%), tetra- (5.2%-5.6%), penta- (5.0%-5.3%), and hexa-nucleotide (2.7%-2.9%). The predominant SSR motif units were AG/CT and AAG/TTC. Computational genome analysis predicted the SSR length variations between the individuals and identified the gene functions of SSR containing sequences. Selected subset of polymorphic markers was validated in a full-sib family of eucalypts. Additionally, genome-wide characterization of single nucleotide polymorphisms, InDels and transcriptional regulators were carried out. These variations will find their utility in genome-wide association studies as well as understanding of molecular mechanisms involved in key economic traits. The genomic resources generated in this study would provide an impetus to integrate genomics in marker-trait associations and breeding of tropical eucalypts.