• Title/Summary/Keyword: Whole Genome Sequencing

Search Result 267, Processing Time 0.025 seconds

Complete genome sequence of Pediococcus acidilactici CACC 537 isolated from canine

  • Jung-Ae Kim;Hyun-Jun Jang;Dae-Hyuk Kim;Youn Kyoung Son;Yangseon Kim
    • Journal of Animal Science and Technology
    • /
    • v.65 no.5
    • /
    • pp.1105-1109
    • /
    • 2023
  • Pedi coccus acidilactici CACC 537 was isolated from canine feces and reported to have probiotic properties. We aimed to characterize the potential probiotic properties of this strain by functional genomic analysis. Complete genome sequencing of P. acidilactici CACC 537 was performed using a PacBio RSII and Illumina platform, and contained one circular chromosome (2.0 Mb) with a 42% G + C content. The sequences were annotation revealed 1,897 protein-coding sequences, 15 rRNAs, and 56 tRNAs. It was determined that P. acidilactici CACC 537 genome carries genes known to be involved in the immune system, defense mechanisms, restriction-modification (R-M), and the CRISPR system. CACC 537 was shown to be beneficial in preventing pathogen infection during the fermentation process, help host immunity, and maintain intestinal health. These results provide for a comprehensive understanding of P. acidilactici and the development of industrial probiotic feed additives that can help improve host immunity and intestinal health.

High Resolution Whole Genome Multilocus Sequence Typing (wgMLST) Schemes for Salmonella enterica Weltevreden Epidemiologic Investigations

  • Tadee, Pakpoom;Tadee, Phacharaporn;Hitchings, Matthew D.;Pascoe, Ben;Sheppard, Samuel K.;Patchanee, Prapas
    • Microbiology and Biotechnology Letters
    • /
    • v.46 no.2
    • /
    • pp.162-170
    • /
    • 2018
  • Non-typhoidal Salmonella is one of the main pathogens causing food-borne illness in humans, with up to 20% of cases resulting from consumption of pork products. Over the gastroenteritis signs, multidrug resistant Salmonella has arisen. In this study, pan-susceptible phenotypic strains of Salmonella enterica serotype Weltevreden recovered from pig production chain in Chiang Mai, Thailand during 2012-2014 were chosen for analysis. The aim of this study was to use whole genome sequencing (WGS) data with an emphasis on antimicrobial resistance gene investigation to assess their pathogenic potential and genetic diversity determination based on whole genome Multilocus Sequence Typing (wgMLST) to expand epidemiological knowledge and to provide additional guidance for disease control. Analyis using ResFinder 3.0 for WGS database tracing found that one of pan-susceptible phenotypic strain carried five classes of resistance genes: aminoglycoside, beta-lactam, phenicol, sulfonamide, and tetracycline associated genes. Twenty four and 36 loci differences were detected by core genome Multilocus Sequence Typing (cgMLST) and pan genome Multilocus Sequence Typing (pgMLST), respectively, in two matching strains (44/13 vs A543057 and A543056 vs 204/13) initially assigned by conventional MLST and Pulsed-field Gel Electrophoresis (PFGE). One hundread percent discriminant ability can be achieved using the wgMLST technique. WGS is currently the ultimate molecular technique for various in-depth studies. As the findings stated above, a new of "gold standard typing method era" for routine works in genome study is being set.

Genomic Analysis of Dairy Starter Culture Streptococcus thermophilus MTCC 5461

  • Prajapati, Jashbhai B.;Nathani, Neelam M.;Patel, Amrutlal K.;Senan, Suja;Joshi, Chaitanya G.
    • Journal of Microbiology and Biotechnology
    • /
    • v.23 no.4
    • /
    • pp.459-466
    • /
    • 2013
  • The lactic acid bacterium Streptococcus thermophilus is widely used as a starter culture for the production of dairy products. Whole-genome sequencing is expected to utilize the genetic basis behind the metabolic functioning of lactic acid bacterium (LAB), for development of their use in biotechnological and probiotic applications. We sequenced the whole genome of Streptococcus thermophilus MTCC 5461, the strain isolated from a curd source, by 454 GS-FLX titanium and Ion Torrent PGM. We performed comparative genome analysis using the local BLAST and RDP for 16S rDNA comparison and by the RAST server for functional comparison against the published genome sequence of Streptococcus thermophilus CNRZ 1066. The whole genome size of S. thermophilus MTCC 5461 is of 1.73Mb size with a GC content of 39.3%. Streptococcal virulence-related genes are either inactivated or absent in the strain. The genome possesses coding sequences for features important for a probiotic organism such as adhesion, acid tolerance, bacteriocin production, and lactose utilization, which was found to be conserved among the strains MTCC 5461 and CNRZ 1066. Biochemical analysis revealed the utilization of 17 sugars by the bacterium, where the presence of genes encoding enzymes involved in metabolism for 16 of these 17 sugars were confirmed in the genome. This study supports the facts that the strain MTCC 5461 is nonpathogenic and harbors essential features that can be exploited for its probiotic potential.

Chromosome-specific polymorphic SSR markers in tropical eucalypt species using low coverage whole genome sequences: systematic characterization and validation

  • Patturaj, Maheswari;Munusamy, Aiswarya;Kannan, Nithishkumar;Kandasamy, Ulaganathan;Ramasamy, Yasodha
    • Genomics & Informatics
    • /
    • v.19 no.3
    • /
    • pp.33.1-33.10
    • /
    • 2021
  • Eucalyptus is one of the major plantation species with wide variety of industrial uses. Polymorphic and informative simple sequence repeats (SSRs) have broad range of applications in genetic analysis. In this study, two individuals of Eucalyptus tereticornis (ET217 and ET86), one individual each from E. camaldulensis (EC17) and E. grandis (EG9) were subjected to whole genome resequencing. Low coverage (10×) genome sequencing was used to find polymorphic SSRs between the individuals. Average number of SSR loci identified was 95,513 and the density of SSRs per Mb was from 157.39 in EG9 to 155.08 in EC17. Among all the SSRs detected, the most abundant repeat motifs were di-nucleotide (59.6%-62.5%), followed by tri- (23.7%-27.2%), tetra- (5.2%-5.6%), penta- (5.0%-5.3%), and hexa-nucleotide (2.7%-2.9%). The predominant SSR motif units were AG/CT and AAG/TTC. Computational genome analysis predicted the SSR length variations between the individuals and identified the gene functions of SSR containing sequences. Selected subset of polymorphic markers was validated in a full-sib family of eucalypts. Additionally, genome-wide characterization of single nucleotide polymorphisms, InDels and transcriptional regulators were carried out. These variations will find their utility in genome-wide association studies as well as understanding of molecular mechanisms involved in key economic traits. The genomic resources generated in this study would provide an impetus to integrate genomics in marker-trait associations and breeding of tropical eucalypts.

Two novel mutations in ALDH18A1 and SPG11 genes found by whole-exome sequencing in spastic paraplegia disease patients in Iran

  • Komachali, Sajad Rafiee;Siahpoosh, Zakieh;Salehi, Mansoor
    • Genomics & Informatics
    • /
    • v.20 no.3
    • /
    • pp.30.1-30.9
    • /
    • 2022
  • Hereditary spastic paraplegia is a not common inherited neurological disorder with heterogeneous clinical expressions. ALDH18A1 (located on 10q24.1) gene-related spastic paraplegias (SPG9A and SPG9B) are rare metabolic disorders caused by dominant and recessive mutations that have been found recently. Autosomal recessive hereditary spastic paraplegia is a common and clinical type of familial spastic paraplegia linked to the SPG11 locus (locates on 15q21.1). There are different symptoms of spastic paraplegia, such as muscle atrophy, moderate mental retardation, short stature, balance problem, and lower limb weakness. Our first proband involves a 45 years old man and our second proband involves a 20 years old woman both are affected by spastic paraplegia disease. Genomic DNA was extracted from the peripheral blood of the patients, their parents, and their siblings using a filter-based methodology and quantified and used for molecular analysis and sequencing. Sequencing libraries were generated using Agilent SureSelect Human All ExonV7 kit, and the qualified libraries are fed into NovaSeq 6000 Illumina sequencers. Sanger sequencing was performed by an ABI prism 3730 sequencer. Here, for the first time, we report two cases, the first one which contains likely pathogenic NM_002860: c.475C>T: p.R159X mutation of the ALDH18A1 and the second one has likely pathogenic NM_001160227.2: c.5454dupA: p.Glu1819Argfs Ter11 mutation of the SPG11 gene and also was identified by the whole-exome sequencing and confirmed by Sanger sequencing. Our aim with this study was to confirm that these two novel variants are direct causes of spastic paraplegia.

Whole Genome Sequencing of Two Musa Species Towards Disease Resistance and Fiber Quality Improvement

  • John Ivan Pasquil;Richellen Plaza;Roneil Christian Alonday;Damsel Bangcal;Julianne Villela;Antonio, Lalusin;Maria Genaleen Diaz;Antonio Laurena
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2022.10a
    • /
    • pp.32-32
    • /
    • 2022
  • Abaca (Musa textilis L. Nee) is a native Musa species from the Philippines known for its natural fiber. Abaca fiber a.k.a. Manila hemp extracted from its pseudostems is considered one of the strongest fibers in the world. This is used for commodities such as ropes, papers, and money bills. Abaca is vulnerable to pests and diseases such as the Abaca Bunchy Top Disease (ABTD) caused by Abaca Bunchy Top Virus (ABTV) and Banana Bunchy Top Virus (BBTV). Inosa, one of the varieties of abaca utilized in the Philippines, is highly susceptible to ABTD. In contrast, Pacol (Musa balbisiana L.), a close relative of abaca, is highly resistant to the same disease. Here, we report the sequencing and de novo genome assembly of both abaca var. Inosa and banana var. Pacol. A total of ~16 Gb and ~21 Gb raw reads for Inosa and Pacol, respectively, were generated using Pacbio Hifi sequencing method and assembled with Hifiasm. High-quality de novo assemblies of both Musa species with 99% recovered as per BUSCO analysis were obtained. The assembled Inosa genome has a total length of ~654 Mb and N50 of 7 Mb while Pacol has a total length of 527 Mb and N50 of 3 Mb which are close to their estimated genome size of ~638 Mb and ~503 Mb, respectively. The information that can be derived from the de novo assembled genomes would provide a solid foundation for further research in disease resistance and fiber quality improvement in abaca.

  • PDF

Comparison of the copy-neutral loss of heterozygosity identified from whole-exome sequencing data using three different tools

  • Lee, Gang-Taik;Chung, Yeun-Jun
    • Genomics & Informatics
    • /
    • v.20 no.1
    • /
    • pp.4.1-4.8
    • /
    • 2022
  • Loss of heterozygosity (LOH) is a genomic aberration. In some cases, LOH can be generated without changing the copy number, which is called copy-neutral LOH (CN-LOH). CN-LOH frequently occurs in various human diseases, including cancer. However, the biological and clinical implications of CN-LOH for human diseases have not been well studied. In this study, we compared the performance of CN-LOH determination using three commonly used tools. For an objective comparison, we analyzed CN-LOH profiles from single-nucleotide polymorphism array data from 10 colon adenocarcinoma patients, which were used as the reference for comparison with the CN-LOHs obtained through whole-exome sequencing (WES) data of the same patients using three different analysis tools (FACETS, Nexus, and Sequenza). The majority of the CN-LOHs identified from the WES data were consistent with the reference data. However, some of the CN-LOHs identified from the WES data were not consistent between the three tools, and the consistency with the reference CN-LOH profile was also different. The Jaccard index of the CN-LOHs using FACETS (0.84 ± 0.29; mean value, 0.73) was significantly higher than that of Nexus (0.55 ± 0.29; mean value, 0.50; p = 0.02) or Sequenza (0 ± 0.41; mean value, 0.34; p = 0.04). FACETS showed the highest area under the curve value. Taken together, of the three CN-LOH analysis tools, FACETS showed the best performance in identifying CN-LOHs from The Cancer Genome Atlas colon adenocarcinoma WES data. Our results will be helpful in exploring the biological or clinical implications of CN-LOH for human diseases.

Development of an Economic-trait Genetic Marker by Applying Next-generation Sequencing Technologies in a Whole Genome (NGS 기법을 활용한 전장게놈에서의 경제형질 관련 유전자 마커 발굴)

  • Gim, Jeong-An;Kim, Heui-Soo
    • Journal of Life Science
    • /
    • v.24 no.11
    • /
    • pp.1258-1267
    • /
    • 2014
  • Developing economic traits with a high growth rate, robustness, and disease resistance in livestock is an important challenge. RFLP and AFLP are the classical methods used to develop economic traits. Whole-genome-based economic traits have recently been detected with the advent of next-generation sequencing (NGS) technologies. However, NGS technologies are rather costly for use in studies, and RNA-seq, RAD-Seq, RRL, MSG, and GBS have been used to overcome the issue of high costs. In this study, recent NGS-based studies were reviewed, particularly those that focused on minimum costs and maximum effects. Then, we presented further prospects on how to apply for selection of high economic-trait livestock.

Genome Organization of Temperate Phage 11143 from Emetic Bacillus cereus NCTC11143

  • Lee, Young-Duck;Park, Jong-Hyun
    • Journal of Microbiology and Biotechnology
    • /
    • v.22 no.5
    • /
    • pp.649-653
    • /
    • 2012
  • A temperate phage was isolated from emetic Bacillus cereus NCTC 11143 by mitomycin C and characterized by transmission electron microscopy and DNA and protein analyses. Whole genome sequencing of Bacillus phage 11143 was performed by GS-FLX. The phage has a dsDNA genome of 39,077 bp and a 35% G+C content. Bioinformatic analysis of the phage genome revealed 49 putative ORFs involved in replication, morphogenesis, DNA packaging, lysogeny, and host lysis. Bacillus phage 11143 could be classified as a member of the Siphoviridae family by morphology and genome structure. Genomic comparisons at the DNA and protein levels revealed homologous genetic modules with patterns and morphogenesis proteins similar to those of other Bacillus phages. Thus, Bacillus phages might have a mosaic genetic relationship.

Identification of Novel Functional Variants of SIN3A and SRSF1 among Somatic Variants in Acute Myeloid Leukemia Patients

  • Min, Jae-Woong;Koh, Youngil;Kim, Dae-Yoon;Kim, Hyung-Lae;Han, Jeong A;Jung, Yu-Jin;Yoon, Sung-Soo;Choi, Sun Shim
    • Molecules and Cells
    • /
    • v.41 no.5
    • /
    • pp.465-475
    • /
    • 2018
  • The advent of massively parallel sequencing, also called next-generation sequencing (NGS), has dramatically influenced cancer genomics by accelerating the identification of novel molecular alterations. Using a whole genome sequencing (WGS) approach, we identified somatic coding and noncoding variants that may contribute to leukemogenesis in 11 adult Korean acute myeloid leukemia (AML) patients, with serial tumor samples (primary and relapse) available for 5 of them; somatic variants were identified in 187 AML-related genes, including both novel (SIN3A, C10orf53, PTPRR, and RERGL) and well-known (NPM1, RUNX1, and CEPBA) AML-related genes. Notably, SIN3A expression shows prognostic value in AML. A newly designed method, referred to as "hot-zone" analysis, detected two putative functional noncoding variants that can alter transcription factor binding affinity near PPP1R10 and SRSF1. Moreover, the functional importance of the SRSF1 noncoding variant was further investigated by luciferase assays, which showed that the variant is critical for the regulation of gene expression leading to leukemogenesis. We expect that further functional investigation of these coding and noncoding variants will contribute to a more in-depth understanding of the underlying molecular mechanisms of AML and the development of targeted anti-cancer drugs.