• Title/Summary/Keyword: whole genome sequencing

Search Result 244, Processing Time 0.034 seconds

An Optimized Strategy for Genome Assembly of Sanger/pyrosequencing Hybrid Data using Available Software

  • Jeong, Hae-Young;Kim, Ji-Hyun F.
    • Genomics & Informatics
    • /
    • v.6 no.2
    • /
    • pp.87-90
    • /
    • 2008
  • During the last four years, the pyrosequencing-based 454 platform has rapidly displaced the traditional Sanger sequencing method due to its high throughput and cost effectiveness. Meanwhile, the Sanger sequencing methodology still provides the longest reads, and paired-end sequencing that is based on that chemistry offers an opportunity to ensure accurate assembly results. In this report, we describe an optimized approach for hybrid de novo genome assembly using pyrosequencing data and varying amounts of Sanger-type reads. 454 platform-derived contigs can be used as single non-breakable virtual reads or converted to simpler contigs that consist of editable, overlapping pseudoreads. These modified contigs maintain their integrity at the first jumpstarting assembly stage and are edited by fragmenting and rejoining. Pre-existing assembly software then can be applied for mixed assembly with 454-derived data and Sanger reads. An effective method for identifying genomic differences between reference and sample sequences in whole-genome resequencing procedures also is suggested.

Exome and genome sequencing for diagnosing patients with suspected rare genetic disease

  • Go Hun Seo;Hane Lee
    • Journal of Genetic Medicine
    • /
    • v.20 no.2
    • /
    • pp.31-38
    • /
    • 2023
  • Rare diseases, even though defined as fewer than 20,000 in South Korea, with over 8,000 rare Mendelian disorders having been identified, they collectively impact 6-8% of the global population. Many of the rare diseases pose significant challenges to patients, patients' families, and the healthcare system. The diagnostic journey for rare disease patients is often lengthy and arduous, hampered by the genetic diversity and phenotypic complexity of these conditions. With the advent of next-generation sequencing technology and clinical implementation of exome sequencing (ES) and genome sequencing (GS), the diagnostic rate for rare diseases is 25-50% depending on the disease category. It is also allowing more rapid new gene-disease association discovery and equipping us to practice precision medicine by offering tailored medical management plans, early intervention, family planning options. However, a substantial number of patients remain undiagnosed, and it could be due to several factors. Some may not have genetic disorders. Some may have disease-causing variants that are not detectable or interpretable by ES and GS. It's also possible that some patient might have a disease-causing variant in a gene that hasn't yet been linked to a disease. For patients who remain undiagnosed, reanalysis of existing data has shown promises in providing new molecular diagnoses achieved by new gene-disease associations, new variant discovery, and variant reclassification, leading to a 5-10% increase in the diagnostic rate. More advanced approach such as long-read sequencing, transcriptome sequencing and integration of multi-omics data may provide potential values in uncovering elusive genetic causes.

Whole Genome Sequencing and Gene Prediction of Cynodon transvaalensis

  • Sol Ji Lee;Chang soo Kim
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2022.10a
    • /
    • pp.237-237
    • /
    • 2022
  • Cynodon transvaalensis belongs to the warm-season grasses and is one of the economically and ecologically important crops. Cynodon species with high heterozygosity are difficult to assemble, so genome research has not been actively conducted. In this study, hybrid assembly was performed by sequencing with Illumina and PacBio. As a result of the assembly, the number of scaffolds and the length of N50 were 1,392, 928 kb, respectively. The completeness of the assembly was confirmed by BSUCO at 98.3%. In addition, as a result of estimating the size of the assembled genome by K-mer analysis (k=25), it was approximately ~413 Mb. A total of 37,060 cds sequences were annotated in the assembled genome, and their functions were identified through blast. After that, we try to complete the assembled genome into a pseudochromosome-level genome through Hi-C technology. These results will not only help to understand the complex genome composition of african bermudagrass, but also provide a resource for genomic and evolutionary studies of grass and other plant species.

  • PDF

Whole-exome sequencing analysis in a case of primary congenital glaucoma due to the partial uniparental isodisomy

  • Zavarzadeh, Parisima Ghaffarian;Bonyadi, Morteza;Abedi, Zahra
    • Genomics & Informatics
    • /
    • v.20 no.3
    • /
    • pp.28.1-28.7
    • /
    • 2022
  • We described a clinical, laboratory, and genetic presentation of a pathogenic variant of the CYP1B1 gene through a report of a case of primary congenital glaucoma and a trio analysis of this candidate variant in the family with the Sanger sequencing method and eventually completed our study with the secondary/incidental findings. This study reports a rare case of primary congenital glaucoma, an 8-year-old female child with a negative family history of glaucoma and uncontrolled intraocular pressure. This case's whole-exome sequencing data analysis presents a homozygous pathogenic single nucleotide variant in the CYP1B1 gene (NM_000104:exon3:c.G1103A:p.R368H). At the same time, this pathogenic variant was obtained as a heterozygous state in her unaffected father but not her mother. The diagnosis was made based on molecular findings of whole-exome sequencing data analysis. Therefore, the clinical reports and bioinformatics findings supported the relation between the candidate pathogenic variant and the disease. However, it should not be forgotten that primary congenital glaucoma is not peculiar to the CYP1B1 gene. Since the chance of developing autosomal recessive disorders with low allele frequency and unrelated parents is extraordinary in offspring. However, further data analysis of whole-exome sequencing and Sanger sequencing method were applied to obtain the type of mutation and how it was carried to the offspring.

Whole Mitochondrial Genome Sequence of an Indian Plasmodium falciparum Field Isolate

  • Tyagi, Suchi;Pande, Veena;Das, Aparup
    • Parasites, Hosts and Diseases
    • /
    • v.52 no.1
    • /
    • pp.99-103
    • /
    • 2014
  • Mitochondrial genome sequence of malaria parasites has served as a potential marker for inferring evolutionary history of the Plasmodium genus. In Plasmodium falciparum, the mitochondrial genome sequences from around the globe have provided important evolutionary understanding, but no Indian sequence has yet been utilized. We have sequenced the whole mitochondrial genome of a single P. falciparum field isolate from India using novel primers and compared with the 3D7 reference sequence and 1 previously reported Indian sequence. While the 2 Indian sequences were highly divergent from each other, the presently sequenced isolate was highly similar to the reference 3D7 strain.

Application of Whole Exome Sequencing to Identify Disease-Causing Variants in Inherited Human Diseases

  • Goh, Gerald;Choi, Murim
    • Genomics & Informatics
    • /
    • v.10 no.4
    • /
    • pp.214-219
    • /
    • 2012
  • The recent advent of next-generation sequencing technologies has dramatically changed the nature of biomedical research. Human genetics is no exception-it has never been easier to interrogate human patient genomes at the nucleotide level to identify disease-associated variants. To further facilitate the efficiency of this approach, whole exome sequencing (WES) was first developed in 2009. Over the past three years, multiple groups have demonstrated the power of WES through robust disease-associated variant discoveries across a diverse spectrum of human diseases. Here, we review the application of WES to different types of inherited human diseases and discuss analytical challenges and possible solutions, with the aim of providing a practical guide for the effective use of this technology.

Analysis of genome variants in dwarf soybean lines obtained in F6 derived from cross of normal parents (cultivated and wild soybean)

  • Roy, Neha Samir;Ban, Yong-Wook;Yoo, Hana;Ramekar, Rahul Vasudeo;Cheong, Eun Ju;Park, Nam-Il;Na, Jong Kuk;Park, Kyong-Cheul;Choi, Ik-Young
    • Genomics & Informatics
    • /
    • v.19 no.2
    • /
    • pp.19.1-19.9
    • /
    • 2021
  • Plant height is an important component of plant architecture and significantly affects crop breeding practices and yield. We studied DNA variations derived from F5 recombinant inbred lines (RILs) with 96.8% homozygous genotypes. Here, we report DNA variations between the normal and dwarf members of four lines harvested from a single seed parent in an F6 RIL population derived from a cross between Glycine max var. Peking and Glycine soja IT182936. Whole genome sequencing was carried out, and the DNA variations in the whole genome were compared between the normal and dwarf samples. We found a large number of DNA variations in both the dwarf and semi-dwarf lines, with one single nucleotide polymorphism (SNP) per at least 3.68 kb in the dwarf lines and 1 SNP per 11.13 kb of the whole genome. This value is 2.18 times higher than the expected DNA variation in the F6 population. A total of 186 SNPs and 241 SNPs were discovered in the coding regions of the dwarf lines 1282 and 1303, respectively, and we discovered 33 homogeneous nonsynonymous SNPs that occurred at the same loci in each set of dwarf and normal soybean. Of them, five SNPs were in the same positions between lines 1282 and 1303. Our results provide important information for improving our understanding of the genetics of soybean plant height and crop breeding. These polymorphisms could be useful genetic resources for plant breeders, geneticists, and biologists for future molecular biology and breeding projects.

Development and Application of High-density SNP Arrays in Genomic Studies of Domestic Animals

  • Fan, Bin;Du, Zhi-Qiang;Gorbach, Danielle M.;Rothschild, Max F.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.23 no.7
    • /
    • pp.833-847
    • /
    • 2010
  • In the past decade, there have been many advances in whole-genome sequencing in domestic animals, as well as the development of "next-generation" sequencing technologies and high-throughput genotyping platforms. Consequently, these advances have led to the creation of the high-density SNP array as a state-of-the-art tool for genetics and genomics analyses of domestic animals. The emergence and utilization of SNP arrays will have significant impacts not only on the scale, speed, and expense of SNP genotyping, but also on theoretical and applied studies of quantitative genetics, population genetics and molecular evolution. The most promising applications in agriculture could be genome-wide association studies (GWAS) and genomic selection for the improvement of economically important traits. However, some challenges still face these applications, such as incorporating linkage disequilibrium (LD) information from HapMap projects, data storage, and especially appropriate statistical analyses on the high-dimensional, structured genomics data. More efforts are still needed to make better use of the high-density SNP arrays in both academic studies and industrial applications.

Challenges of Genome Wide Sequencing Technologies in Prenatal Medicine (산전 진단에서의 염기 서열 분석 방법의 의의)

  • Kang, Ji-Un
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.2
    • /
    • pp.762-769
    • /
    • 2022
  • Genetic testing in prenatal diagnosis is a precious tool providing valuable information in clinical management and parental decision-making. For the last year, cytogenetic testing methods, such as G-banding karyotype analysis, fluorescent in situ hybridization, chromosomal microarray, and gene panels have evolved to become part of routine laboratory testing. However, the limitations of each of these methods demonstrate the need for a revolutionary technology that can alleviate the need for multiple technologies. The recent introduction of new genomic technologies based on next-generation sequencing has changed the current practice of prenatal testing. The promise of these innovations lies in the fast and cost-effective generation of genome-scale sequence data with exquisite resolution and accuracy for prenatal diagnosis. Here, we review the current state of sequencing-based pediatric diagnostics, associated challenges, as well as future prospects.

Identification of genomic diversity and selection signatures in Luxi cattle using whole-genome sequencing data

  • Mingyue Hu;Lulu Shi;Wenfeng Yi;Feng Li;Shouqing Yan
    • Animal Bioscience
    • /
    • v.37 no.3
    • /
    • pp.461-470
    • /
    • 2024
  • Objective: The objective of this study was to investigate the genetic diversity, population structure and whole-genome selection signatures of Luxi cattle to reveal its genomic characteristics in terms of meat and carcass traits, skeletal muscle development, body size, and other traits. Methods: To further analyze the genomic characteristics of Luxi cattle, this study sequenced the whole-genome of 16 individuals from the core conservation farm in Shandong region, and collected 174 published genomes of cattle for conjoint analysis. Furthermore, three different statistics (pi, Fst, and XP-EHH) were used to detect potential positive selection signatures related to selection in Luxi cattle. Moreover, gene ontology and Kyoto encyclopedia of genes and genomes pathway enrichment analyses were performed to reveal the potential biological function of candidate genes harbored in selected regions. Results: The results showed that Luxi cattle had high genomic diversity and low inbreeding levels. Using three complementary methods (pi, Fst, and XP-EHH) to detect the signatures of selection in the Luxi cattle genome, there were 2,941, 2,221 and 1,304 potentially selected genes identified, respectively. Furthermore, there were 45 genes annotated in common overlapping genomic regions covered 0.723 Mb, including PLAG1 zinc finger (PLAG1), dedicator of cytokinesis 3 (DOCK3), ephrin A2 (EFNA2), DAZ associated protein 1 (DAZAP1), Ral GTPase activating protein catalytic subunit alpha 1 (RALGAPA1), mediator complex subunit 13 (MED13), and decaprenyl diphosphate synthase subunit 2 (PDSS2), most of which were enriched in pathways related to muscle growth and differentiation and immunity. Conclusion: In this study, we provided a series of genes associated with important economic traits were found in positive selection regions, and a scientific basis for the scientific conservation and genetic improvement of Luxi cattle.