• Title/Summary/Keyword: whole-genome sequencing

Search Result 244, Processing Time 0.029 seconds

Evaluation and Genome Mining of Bacillus stercoris Isolate B.PNR1 as Potential Agent for Fusarium Wilt Control and Growth Promotion of Tomato

  • Rattana Pengproh;Thanwanit Thanyasiriwat;Kusavadee Sangdee;Juthaporn Saengprajak;Praphat Kawicha;Aphidech Sangdee
    • The Plant Pathology Journal
    • /
    • v.39 no.5
    • /
    • pp.430-448
    • /
    • 2023
  • Recently, strategies for controlling Fusarium oxysporum f. sp. lycopersici (Fol), the causal agent of Fusarium wilt of tomato, focus on using effective biocontrol agents. In this study, an analysis of the biocontrol and plant growth promoting (PGP) attributes of 11 isolates of loamy soil Bacillus spp. has been conducted. Among them, the isolates B.PNR1 and B.PNR2 inhibited the mycelial growth of Fol by inducing abnormal fungal cell wall structures and cell wall collapse. Moreover, broad-spectrum activity against four other plant pathogenic fungi, F. oxysporum f. sp. cubense race 1 (Foc), Sclerotium rolfsii, Colletotrichum musae, and C. gloeosporioides were noted for these isolates. These two Bacillus isolates produced indole acetic acid, phosphate solubilization enzymes, and amylolytic and cellulolytic enzymes. In the pot experiment, the culture filtrate from B.PNR1 showed greater inhibition of the fungal pathogens and significantly promoted the growth of tomato plants more than those of the other treatments. Isolate B.PNR1, the best biocontrol and PGP, was identified as Bacillus stercoris by its 16S rRNA gene sequence and whole genome sequencing analysis (WGS). The WGS, through genome mining, confirmed that the B.PNR1 genome contained genes/gene cluster of a nonribosomal peptide synthetase/polyketide synthase, such as fengycin, surfactin, bacillaene, subtilosin A, bacilysin, and bacillibactin, which are involved in antagonistic and PGP activities. Therefore, our finding demonstrates the effectiveness of B. stercoris strain B.PNR1 as an antagonist and for plant growth promotion, highlighting the use of this microorganism as a biocontrol agent against the Fusarium wilt pathogen and PGP abilities in tomatoes.

Whole genome sequence analyses of thermotolerant Bacillus sp. isolates from food

  • Phornphan Sornchuer;Kritsakorn Saninjuk;Pholawat Tingpej
    • Genomics & Informatics
    • /
    • v.21 no.3
    • /
    • pp.35.1-35.12
    • /
    • 2023
  • The Bacillus cereus group, also known as B. cereus sensu lato (B. cereus s.l.), is composed of various Bacillus species, some of which can cause diarrheal or emetic food poisoning. Several emerging highly heat-resistant Bacillus species have been identified, these include B. thermoamylovorans, B. sporothermodurans, and B. cytotoxicus NVH 391-98. Herein, we performed whole genome analysis of two thermotolerant Bacillus sp. isolates, Bacillus sp. B48 and Bacillus sp. B140, from an omelet with acacia leaves and fried rice, respectively. Phylogenomic analysis suggested that Bacillus sp. B48 and Bacillus sp. B140 are closely related to B. cereus and B. thuringiensis, respectively. Whole genome alignment of Bacillus sp. B48, Bacillus sp. B140, mesophilic strain B. cereus ATCC14579, and thermophilic strain B. cytotoxicus NVH 391-98 using the Mauve program revealed the presence of numerous homologous regions including genes responsible for heat shock in the dnaK gene cluster. However, the presence of a DUF4253 domain-containing protein was observed only in the genome of B. cereus ATCC14579 while the intracellular protease PfpI family was present only in the chromosome of B. cytotoxicus NVH 391-98. In addition, prophage Clp protease-like proteins were found in the genomes of both Bacillus sp. B48 and Bacillus sp. B140 but not in the genome of B. cereus ATCC14579. The genomic profiles of Bacillus sp. isolates were identified by using whole genome analysis especially those relating to heat-responsive gene clusters. The findings presented in this study lay the foundations for subsequent studies to reveal further insights into the molecular mechanisms of Bacillus species in terms of heat resistance mechanisms.

Analysis of unmapped regions associated with long deletions in Korean whole genome sequences based on short read data

  • Lee, Yuna;Park, Kiejung;Koh, Insong
    • Genomics & Informatics
    • /
    • v.17 no.4
    • /
    • pp.40.1-40.9
    • /
    • 2019
  • While studies aimed at detecting and analyzing indels or single nucleotide polymorphisms within human genomic sequences have been actively conducted, studies on detecting long insertions/deletions are not easy to orchestrate. For the last 10 years, the availability of long read data of human genomes from PacBio or Nanopore platforms has increased, which makes it easier to detect long insertions/deletions. However, because long read data have a critical disadvantage due to their relatively high cost, many next generation sequencing data are produced mainly by short read sequencing machines. Here, we constructed programs to detect so-called unmapped regions (UMRs, where no reads are mapped on the reference genome), scanned 40 Korean genomes to select UMR long deletion candidates, and compared the candidates with the long deletion break points within the genomes available from the 1000 Genomes Project (1KGP). An average of about 36,000 UMRs were found in the 40 Korean genomes tested, 284 UMRs were common across the 40 genomes, and a total of 37,943 UMRs were found. Compared with the 74,045 break points provided by the 1KGP, 30,698 UMRs overlapped. As the number of compared samples increased from 1 to 40, the number of UMRs that overlapped with the break points also increased. This eventually reached a peak of 80.9% of the total UMRs found in this study. As the total number of overlapped UMRs could probably grow to encompass 74,045 break points with the inclusion of more Korean genomes, this approach could be practically useful for studies on long deletions utilizing short read data.

Bridging Comparative Genomics and DNA Marker-aided Molecular Breeding

  • Choi, Hong-Kyu;Cook, Douglas R.
    • Korean Journal of Breeding Science
    • /
    • v.43 no.2
    • /
    • pp.103-114
    • /
    • 2011
  • In recent years, genomic resources and information have accumulated at an ever increasing pace, in many plant species, through whole genome sequencing, large scale analysis of transcriptomes, DNA markers and functional studies of individual genes. Well-characterized species within key plant taxa, co-called "model systems", have played a pivotal role in nucleating the accumulation of genomic information and databases, thereby providing the basis for comparative genomic studies. In addition, recent advances to "Next Generation" sequencing technologies have propelled a new wave of genomics, enabling rapid, low cost analysis of numerous genomes, and the accumulation of genetic diversity data for large numbers of accessions within individual species. The resulting wealth of genomic information provides an opportunity to discern evolutionary processes that have impacted genome structure and the function of genes, using the tools of comparative analysis. Comparative genomics provides a platform to translate information from model species to crops, and to relate knowledge of genome function among crop species. Ultimately, the resulting knowledge will accelerate the development of more efficient breeding strategies through the identification of trait-associated orthologous genes and next generation functional gene-based markers.

Whole genome re-sequencing and development of SSR markers in oriental melon (참외 전장유전체 염기서열 분석 및 SSR 마커 개발)

  • Song, Woon-Ho;Chung, Sang-Min
    • Journal of Plant Biotechnology
    • /
    • v.46 no.2
    • /
    • pp.71-78
    • /
    • 2019
  • The objective of this study was to use 'Danta PR', NGS (Next Generation Sequencing) technology for genome resequencing to develop polymorphic makers between Chinese oriental melon, 'Hyangseo 1' and Korean oriental melon. From the resequencing data that covered about 81 times of the genome size, 104,357 of SSR motifs and Indel, and 1,092,436 of SNPs were identified. 299 SSR and 307 Indel markers were chosen to cover each chromosome with 25 markers. These markers were subsequently used to identify genotypes of 'Danta PR' BC1 (F1 x 'Danta PR') population and a genetic linkage map was constructed. SSR, Indel, and SNPs identified in this study would be useful as a breeding tool to develop new oriental melon varieties.

Whole genome sequence of Staphylococcus aureus strain RMI-014804 isolated from pulmonary patient sputum via next-generation sequencing technology

  • Ayesha, Wisal;Asad Ullah;Waheed Anwar;Carlos M. Morel;Syed Shah Hassan
    • Genomics & Informatics
    • /
    • v.21 no.3
    • /
    • pp.34.1-34.10
    • /
    • 2023
  • Nosocomial infections, commonly referred to as healthcare-associated infections, are illnesses that patients get while hospitalized and are typically either not yet manifest or may develop. One of the most prevalent nosocomial diseases in hospitalized patients is pneumonia, among the leading causes of mortality and morbidity. Viral, bacterial, and fungal pathogens cause pneumonia. More severe introductions commonly included Staphylococcus aureus, which is at the top of bacterial infections, per World Health Organization reports. The staphylococci, S. aureus, strain RMI-014804, mesophile, on-sporulating, and non-motile bacterium, was isolated from the sputum of a pulmonary patient in Pakistan. Many characteristics of S. aureus strain RMI-014804 have been revealed in this paper, with complete genome sequence and annotation. Our findings indicate that the genome is a single circular 2.82 Mbp long genome with 1,962 protein-coding genes, 15 rRNA, 49 tRNA, 62 pseudogenes, and a GC content of 28.76%. As a result of this genome sequencing analysis, researchers will fully understand the genetic and molecular basis of the virulence of the S. aureus bacteria, which could help prevent the spread of nosocomial infections like pneumonia. Genome analysis of this strain was necessary to identify the specific genes and molecular mechanisms that contribute to its pathogenicity, antibiotic resistance, and genetic diversity, allowing for a more in-depth investigation of its pathogenesis to develop new treatments and preventive measures against infections caused by this bacterium.

Genome re-sequencing to identify single nucleotide polymorphism markers for muscle color traits in broiler chickens

  • Kong, H.R.;Anthony, N.B.;Rowland, K.C.;Khatri, B.;Kong, B.C.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.31 no.1
    • /
    • pp.13-18
    • /
    • 2018
  • Objective: Meat quality including muscle color in chickens is an important trait and continuous selective pressures for fast growth and high yield have negatively impacted this trait. This study was conducted to investigate genetic variations responsible for regulating muscle color. Methods: Whole genome re-sequencing analysis using Illumina HiSeq paired end read method was performed with pooled DNA samples isolated from two broiler chicken lines divergently selected for muscle color (high muscle color [HMC] and low muscle color [LMC]) along with their random bred control line (RAN). Sequencing read data was aligned to the chicken reference genome sequence for Red Jungle Fowl (Galgal4) using reference based genome alignment with NGen program of the Lasergene software package. The potential causal single nucleotide polymorphisms (SNPs) showing non-synonymous changes in coding DNA sequence regions were chosen in each line. Bioinformatic analyses to interpret functions of genes retaining SNPs were performed using the ingenuity pathways analysis (IPA). Results: Millions of SNPs were identified and totally 2,884 SNPs (1,307 for HMC and 1,577 for LMC) showing >75% SNP rates could induce non-synonymous mutations in amino acid sequences. Of those, SNPs showing over 10 read depths yielded 15 more reliable SNPs including 1 for HMC and 14 for LMC. The IPA analyses suggested that meat color in chickens appeared to be associated with chromosomal DNA stability, the functions of ubiquitylation (UBC) and quality and quantity of various subtypes of collagens. Conclusion: In this study, various potential genetic markers showing amino acid changes were identified in differential meat color lines, that can be used for further animal selection strategy.

Efficiency to Discovery Transgenic Loci in GM Rice Using Next Generation Sequencing Whole Genome Re-sequencing

  • Park, Doori;Kim, Dongin;Jang, Green;Lim, Jongsung;Shin, Yun-Ji;Kim, Jina;Seo, Mi-Seong;Park, Su-Hyun;Kim, Ju-Kon;Kwon, Tae-Ho;Choi, Ik-Young
    • Genomics & Informatics
    • /
    • v.13 no.3
    • /
    • pp.81-85
    • /
    • 2015
  • Molecular characterization technology in genetically modified organisms, in addition to how transgenic biotechnologies are developed now require full transparency to assess the risk to living modified and non-modified organisms. Next generation sequencing (NGS) methodology is suggested as an effective means in genome characterization and detection of transgenic insertion locations. In the present study, we applied NGS to insert transgenic loci, specifically the epidermal growth factor (EGF) in genetically modified rice cells. A total of 29.3 Gb (${\sim}72{\times}coverage$) was sequenced with a $2{\times}150bp$ paired end method by Illumina HiSeq2500, which was consecutively mapped to the rice genome and T-vector sequence. The compatible pairs of reads were successfully mapped to 10 loci on the rice chromosome and vector sequences were validated to the insertion location by polymerase chain reaction (PCR) amplification. The EGF transgenic site was confirmed only on chromosome 4 by PCR. Results of this study demonstrated the success of NGS data to characterize the rice genome. Bioinformatics analyses must be developed in association with NGS data to identify highly accurate transgenic sites.

Metagenomic analysis of viral genes integrated in whole genome sequencing data of Thai patients with Brugada syndrome

  • Suwalak Chitcharoen;Chureerat Phokaew;John Mauleekoonphairoj;Apichai Khongphatthanayothin;Boosamas Sutjaporn;Pharawee Wandee;Yong Poovorawan;Koonlawee Nademanee;Sunchai Payungporn
    • Genomics & Informatics
    • /
    • v.20 no.4
    • /
    • pp.44.1-44.13
    • /
    • 2022
  • Brugada syndrome (BS) is an autosomal dominant inheritance cardiac arrhythmia disorder associated with sudden death in young adults. Thailand has the highest prevalence of BS worldwide, and over 60% of patients with BS still have unclear disease etiology. Here, we performed a new viral metagenome analysis pipeline called VIRIN and validated it with whole genome sequencing (WGS) data of HeLa cell lines and hepatocellular carcinoma. Then the VIRIN pipeline was applied to identify viral integration positions from unmapped WGS data of Thai males, including 100 BS patients (case) and 100 controls. Even though the sample preparation had no viral enrichment step, we can identify several virus genes from our analysis pipeline. The predominance of human endogenous retrovirus K (HERV-K) viruses was found in both cases and controls by blastn and blastx analysis. This study is the first report on the full-length HERV-K assembled genomes in the Thai population. Furthermore, the HERV-K integration breakpoint positions were validated and compared between the case and control datasets. Interestingly, Brugada cases contained HERV-K integration breakpoints at promoters five times more often than controls. Overall, the highlight of this study is the BS-specific HERV-K breakpoint positions that were found at the gene coding region "NBPF11" (n = 9), "NBPF12" (n = 8) and long non-coding RNA (lncRNA) "PCAT14" (n = 4) region. The genes and the lncRNA have been reported to be associated with congenital heart and arterial diseases. These findings provide another aspect of the BS etiology associated with viral genome integrations within the human genome.

Analysis of whole genome sequencing and virulence factors of Vibrio vulnificus 1908-10 isolated from sea water at Gadeok island coast

  • Hee-kyung Oh;Nameun Kim;Do-Hyung Kim;Hye-Young Shin;Eun-Woo Lee;Sung-Hwan Eom;Young-Mog Kim
    • Fisheries and Aquatic Sciences
    • /
    • v.26 no.9
    • /
    • pp.558-568
    • /
    • 2023
  • Vibrio vulnificus is an aquatic bacterium causing septicemia and wound infection in humans. To understand this pathogen at the genomic level, it was performed whole genome sequencing of a cefoxitin-resistant strain, V. vulnificus 1908-10 possessing virulence-related genes (vvhA, viuB, and vcgC) isolated from Gadeok island coastal seawater in South Korea. The genome of V. vulnificus 1908-10 consisted of two circular contigs and no plasmid. The total genome size was estimated to be 5,018,425 bp with a guanine-cytosine (GC) content of 46.9%. We found 119 tRNA and 34 rRNA genes respectively in the genome, along with 4,352 predicted protein sequences. Virulence factor (VF) analysis further revealed that V. vulnificus 1908-10 possess various virulence genes in classes of adherence, antiphagocytosis, chemotaxis and motility, iron uptake, quorum sensing, secretion system, and toxin. In the comparison of the presence/absence of virulence genes, V. vulnificus 1908-10 had fur, hlyU, luxS, ompU, pilA, pilF, rtxA, rtxC, and vvhA. Of the 30 V. vulnificus comparative strains, 80% of the C-genotype strains have all of these genes, whereas 40% of the E-genotype strains have all of them. In particular, pilA were identified in 80% of the C-type strains and 40% of the E-type strains, showing more difference than other genes. Therefore, V. vulnificus 1908-10 had similar VF characteristics to those of type C strains. Multifunctional-autoprocessing repeats-in-toxin (MARTX) toxin of V. vulnificus 1908-10 contained 8 A-type repeats (GXXGXXXXXG), 25 B.1-type repeats (TXVGXGXX), 18 B2-type repeats (GGXGXDXXX), and 7 C-type repeats (GGXGXDXXX). The National Center for Biotechnology Information (NCBI) Basic Local Alignment Search Tool (BLAST) showed that the RtxA protein of V. vulnificus 1908-10 had the effector domain in the order of cross-liking domain (ACD)-C58_PaToxP-like domain- α/β hydrolase-C58_PaToxP-like domain.