• Title/Summary/Keyword: whole genome resequencing

Search Result 26, Processing Time 0.019 seconds

Development of InDel markers to identify Capsicum disease resistance using whole genome resequencing

  • Karna, Sandeep;Ahn, Yul-Kyun
    • Journal of Plant Biotechnology
    • /
    • v.45 no.3
    • /
    • pp.228-235
    • /
    • 2018
  • In this study, two pepper varieties, PRH1 (powdery mildew resistance line) and Saengryeg (powdery mildew resistance line), were resequenced using next generation sequencing technology in order to develop InDel markers. The genome-wide discovery of InDel variation was performed by comparing the whole-genome resequencing data of two pepper varieties to the Capsicum annuum cv. CM334 reference genome. A total of 334,236 and 318,256 InDels were identified in PRH1 and Saengryeg, respectively. The greatest number of homozygous InDels were discovered on chromosome 1 in PRH1 (24,954) and on chromosome 10 (29,552) in Saengryeg. Among these homozygous InDels, 19,094 and 4,885 InDels were distributed in the genic regions of PRH1 and Saengryeg, respectively, and 198,570 and 183,468 InDels were distributed in the intergenic regions. We have identified 197,821 polymorphic InDels between PRH1 and Saengryeg. A total of 11,697 primers sets were generated, resulting in the discovery of four polymorphic InDel markers. These new markers will be utilized in order to identify disease resistance genotypes in breeding populations. Therefore, our results will make a one-step advancement in whole genome resequencing and add genetic resource datasets in pepper breeding research.

Identification of the Marker Genes Related With Chronic Mitral Valve Disease in Dogs

  • Yoon, Byung-Gook;Lee, Dong-Soo;Seo, Kyoung-Won;Song, Kun-Ho
    • Journal of Veterinary Clinics
    • /
    • v.36 no.4
    • /
    • pp.190-195
    • /
    • 2019
  • We aimed to identify genomic variations as well as the marker genes related with chronic mitral valve disease (CMVD) in Canis lupus familiaris using whole genome resequencing, which provides valuable resources for further study. Two ten-year old female Canis lupus familiaris English cocker spaniels were used for this study, one control and one who had been diagnosed as CMVD. For the whole genome resequencing, muscles from the left ventricular wall were collected from each dog. With the HiSeq DNA Shotgun library and $HiSeq^{TM}$ 2000 platform, whole genome resequencing was performed. From the results, we identified 5 million and 6 million variants in gene expression in the control and CMVD-diagnosed subject, respectively. We then selected the top 1,000 genes from the SNP, INS, and DEL mutation and 675 genes among them were overlapped for every mutation between the control and CMVD-diagnosed patient. Interestingly, in both groups, the intron variant (91.16 and 91.18%) and upstream variant (3.10 and 3.08%) are most highly related. Among the overlapped 675 genes, gene ontology for intracellular signal transduction is highly counted in INS, and DEL, and SNPs (35, 33, 31, respectively). In this study, we found that the COL and CDH gene families could be key molecules in identifying the difference in gene expression between control and CMVD-diagnosed dogs. We believe further studies will prove the importance of variants in key molecule expression and that these data will serve as a valuable foundation stone the study of canine CMVD.

Whole Genome Resequencing of Heugu (Korean Black Cattle) for the Genome-Wide SNP Discovery

  • Choi, Jung-Woo;Chung, Won-Hyong;Lee, Kyung-Tai;Choi, Jae-Won;Jung, Kyoung-Sub;Cho, Yongmin;Kim, Namshin;Kim, Tae-Hun
    • Food Science of Animal Resources
    • /
    • v.33 no.6
    • /
    • pp.715-722
    • /
    • 2013
  • Heugu (Korea Black Cattle) is one of the indigenous cattle breeds in Korea; however there has been severe lack of genomic studies on the breed. In this study, we report the first whole genome resequencing of Heugu at higher sequence coverage using Illumina HiSeq 2000 platform. More than 153.6 Giga base pairs sequence was obtained, of which 97% of the reads were mapped to the bovine reference sequence assembly (UMD 3.1). The number of non-redundantly mapped sequence reads corresponds to approximately 28.9-fold coverage across the genome. From these data, we identified a total of over six million single nucleotide polymorphisms (SNPs), of which 29.4% were found to be novel using the single nucleotide polymorphism database build 137. Extensive annotation was performed on all the detected SNPs, showing that most of SNPs were located in intergenic regions (70.7%), which is well corresponded with previous studies. Of the total SNPs, we identified substantial numbers of non-synonymous SNPs (13,979) in 5,999 genes, which could potentially affect meat quality traits in cattle. These results provide genome-wide SNPs that can serve as useful genetic tools and as candidates in searches for phenotype-altering DNA difference implicated with meat quality traits in cattle. The importance of this study can be further pronounced with the first whole genome sequencing of the valuable local genetic resource to be used in further genomic comparison studies with diverse cattle breeds.

Whole genome re-sequencing and development of SSR markers in oriental melon (참외 전장유전체 염기서열 분석 및 SSR 마커 개발)

  • Song, Woon-Ho;Chung, Sang-Min
    • Journal of Plant Biotechnology
    • /
    • v.46 no.2
    • /
    • pp.71-78
    • /
    • 2019
  • The objective of this study was to use 'Danta PR', NGS (Next Generation Sequencing) technology for genome resequencing to develop polymorphic makers between Chinese oriental melon, 'Hyangseo 1' and Korean oriental melon. From the resequencing data that covered about 81 times of the genome size, 104,357 of SSR motifs and Indel, and 1,092,436 of SNPs were identified. 299 SSR and 307 Indel markers were chosen to cover each chromosome with 25 markers. These markers were subsequently used to identify genotypes of 'Danta PR' BC1 (F1 x 'Danta PR') population and a genetic linkage map was constructed. SSR, Indel, and SNPs identified in this study would be useful as a breeding tool to develop new oriental melon varieties.

Identification of Causal and/or Rare Genetic Variants for Complex Traits by Targeted Resequencing in Population-based Cohorts

  • Kim, Yun-Kyoung;Hong, Chang-Bum;Cho, Yoon-Shin
    • Genomics & Informatics
    • /
    • v.8 no.3
    • /
    • pp.131-137
    • /
    • 2010
  • Genome-wide association studies (GWASs) have greatly contributed to the identification of common variants responsible for numerous complex traits. There are, however, unavoidable limitations in detecting causal and/or rare variants for traits in this approach, which depends on an LD-based tagging SNP microarray chip. In an effort to detect potential casual and/or rare variants for complex traits, such as type 2 diabetes (T2D) and triglycerides (TGs), we conducted a targeted resequencing of loci identified by the Korea Association REsource (KARE) GWAS. The target regions for resequencing comprised whole exons, exon-intron boundaries, and regulatory regions of genes that appeared within 1 Mb of the GWA signal boundary. From 124 individuals selected in population-based cohorts, a total of 0.7 Mb target regions were captured by the NimbleGen sequence capture 385K array. Subsequent sequencing, carried out by the Roche 454 Genome Sequencer FLX, generated about 110,000 sequence reads per individual. Mapping of sequence reads to the human reference genome was performed using the SSAHA2 program. An average of 62.2% of total reads was mapped to targets with an average 22X-fold coverage. A total of 5,983 SNPs (average 846 SNPs per individual) were called and annotated by GATK software, with 96.5% accuracy that was estimated by comparison with Affymetrix 5.0 genotyped data in identical individuals. About 51% of total SNPs were singletons that can be considered possible rare variants in the population. Among SNPs that appeared in exons, which occupies about 20% of total SNPs, 304 nonsynonymous singletons were tested with Polyphen to predict the protein damage caused by mutation. In total, we were able to detect 9 and 6 potentially functional rare SNPs for T2D and triglycerides, respectively, evoking a further step of replication genotyping in independent populations to prove their bona fide relevance to traits.

An Optimized Strategy for Genome Assembly of Sanger/pyrosequencing Hybrid Data using Available Software

  • Jeong, Hae-Young;Kim, Ji-Hyun F.
    • Genomics & Informatics
    • /
    • v.6 no.2
    • /
    • pp.87-90
    • /
    • 2008
  • During the last four years, the pyrosequencing-based 454 platform has rapidly displaced the traditional Sanger sequencing method due to its high throughput and cost effectiveness. Meanwhile, the Sanger sequencing methodology still provides the longest reads, and paired-end sequencing that is based on that chemistry offers an opportunity to ensure accurate assembly results. In this report, we describe an optimized approach for hybrid de novo genome assembly using pyrosequencing data and varying amounts of Sanger-type reads. 454 platform-derived contigs can be used as single non-breakable virtual reads or converted to simpler contigs that consist of editable, overlapping pseudoreads. These modified contigs maintain their integrity at the first jumpstarting assembly stage and are edited by fragmenting and rejoining. Pre-existing assembly software then can be applied for mixed assembly with 454-derived data and Sanger reads. An effective method for identifying genomic differences between reference and sample sequences in whole-genome resequencing procedures also is suggested.

Chromosome-specific polymorphic SSR markers in tropical eucalypt species using low coverage whole genome sequences: systematic characterization and validation

  • Patturaj, Maheswari;Munusamy, Aiswarya;Kannan, Nithishkumar;Kandasamy, Ulaganathan;Ramasamy, Yasodha
    • Genomics & Informatics
    • /
    • v.19 no.3
    • /
    • pp.33.1-33.10
    • /
    • 2021
  • Eucalyptus is one of the major plantation species with wide variety of industrial uses. Polymorphic and informative simple sequence repeats (SSRs) have broad range of applications in genetic analysis. In this study, two individuals of Eucalyptus tereticornis (ET217 and ET86), one individual each from E. camaldulensis (EC17) and E. grandis (EG9) were subjected to whole genome resequencing. Low coverage (10×) genome sequencing was used to find polymorphic SSRs between the individuals. Average number of SSR loci identified was 95,513 and the density of SSRs per Mb was from 157.39 in EG9 to 155.08 in EC17. Among all the SSRs detected, the most abundant repeat motifs were di-nucleotide (59.6%-62.5%), followed by tri- (23.7%-27.2%), tetra- (5.2%-5.6%), penta- (5.0%-5.3%), and hexa-nucleotide (2.7%-2.9%). The predominant SSR motif units were AG/CT and AAG/TTC. Computational genome analysis predicted the SSR length variations between the individuals and identified the gene functions of SSR containing sequences. Selected subset of polymorphic markers was validated in a full-sib family of eucalypts. Additionally, genome-wide characterization of single nucleotide polymorphisms, InDels and transcriptional regulators were carried out. These variations will find their utility in genome-wide association studies as well as understanding of molecular mechanisms involved in key economic traits. The genomic resources generated in this study would provide an impetus to integrate genomics in marker-trait associations and breeding of tropical eucalypts.

Prediction of Genes Related to Positive Selection Using Whole-Genome Resequencing in Three Commercial Pig Breeds

  • Kim, HyoYoung;Caetano-Anolles, Kelsey;Seo, Minseok;Kwon, Young-jun;Cho, Seoae;Seo, Kangseok;Kim, Heebal
    • Genomics & Informatics
    • /
    • v.13 no.4
    • /
    • pp.137-145
    • /
    • 2015
  • Selective sweep can cause genetic differentiation across populations, which allows for the identification of possible causative regions/genes underlying important traits. The pig has experienced a long history of allele frequency changes through artificial selection in the domestication process. We obtained an average of 329,482,871 sequence reads for 24 pigs from three pig breeds: Yorkshire (n = 5), Landrace (n = 13), and Duroc (n = 6). An average read depth of 11.7 was obtained using whole-genome resequencing on an Illumina HiSeq2000 platform. In this study, cross-population extended haplotype homozygosity and cross-population composite likelihood ratio tests were implemented to detect genes experiencing positive selection for the genome-wide resequencing data generated from three commercial pig breeds. In our results, 26, 7, and 14 genes from Yorkshire, Landrace, and Duroc, respectively were detected by two kinds of statistical tests. Significant evidence for positive selection was identified on genes ST6GALNAC2 and EPHX1 in Yorkshire, PARK2 in Landrace, and BMP6, SLA-DQA1, and PRKG1 in Duroc. These genes are reportedly relevant to lactation, reproduction, meat quality, and growth traits. To understand how these single nucleotide polymorphisms (SNPs) related positive selection affect protein function, we analyzed the effect of non-synonymous SNPs. Three SNPs (rs324509622, rs80931851, and rs80937718) in the SLA-DQA1 gene were significant in the enrichment tests, indicating strong evidence for positive selection in Duroc. Our analyses identified genes under positive selection for lactation, reproduction, and meat-quality and growth traits in Yorkshire, Landrace, and Duroc, respectively.

Whole-genome resequencing reveals domestication and signatures of selection in Ujimqin, Sunit, and Wu Ranke Mongolian sheep breeds

  • Wang, Hanning;Zhong, Liang;Dong, Yanbing;Meng, Lingbo;Ji, Cheng;Luo, Hui;Fu, Mengrong;Qi, Zhi;Mi, Lan
    • Animal Bioscience
    • /
    • v.35 no.9
    • /
    • pp.1303-1313
    • /
    • 2022
  • Objective: The current study aimed to perform whole-genome resequencing of Chinese indigenous Mongolian sheep breeds including Ujimqin, Sunit, and Wu Ranke sheep breeds (UJMQ, SNT, WRK) and deeply analyze genetic variation, population structure, domestication, and selection for domestication traits among these Mongolian sheep breeds. Methods: Blood samples were collected from a total of 60 individuals comprising 20 WRK, 20 UJMQ, and 20 SNT. For genome sequencing, about 1.5 ㎍ of genomic DNA was used for library construction with an insert size of about 350 bp. Pair-end sequencing were performed on Illumina NovaSeq platform, with the read length of 150 bp at each end. We then investigated the domestication and signatures of selection in these sheep breeds. Results: According to the population and demographic analyses, WRK and SNT populations were very similar, which were different from UJMQ populations. Genome wide association study identified 468 and 779 significant loci from SNT vs UJMQ, and UJMQ vs WRK, respectively. However, only 3 loci were identified from SNT vs WRK. Genomic comparison and selective sweep analysis among these sheep breeds suggested that genes associated with regulation of secretion, metabolic pathways including estrogen metabolism and amino acid metabolism, and neuron development have undergone strong selection during domestication. Conclusion: Our findings will facilitate the understanding of Chinese indigenous Mongolian sheep breeds domestication and selection for complex traits and provide a valuable genomic resource for future studies of sheep and other domestic animal breeding.

Discovery of Gene Sources for Economic Traits in Hanwoo by Whole-genome Resequencing

  • Shin, Younhee;Jung, Ho-jin;Jung, Myunghee;Yoo, Seungil;Subramaniyam, Sathiyamoorthy;Markkandan, Kesavan;Kang, Jun-Mo;Rai, Rajani;Park, Junhyung;Kim, Jong-Joo
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.29 no.9
    • /
    • pp.1353-1362
    • /
    • 2016
  • Hanwoo, a Korean native cattle (Bos taurus coreana), has great economic value due to high meat quality. Also, the breed has genetic variations that are associated with production traits such as health, disease resistance, reproduction, growth as well as carcass quality. In this study, next generation sequencing technologies and the availability of an appropriate reference genome were applied to discover a large amount of single nucleotide polymorphisms (SNPs) in ten Hanwoo bulls. Analysis of whole-genome resequencing generated a total of 26.5 Gb data, of which 594,716,859 and 592,990,750 reads covered 98.73% and 93.79% of the bovine reference genomes of UMD 3.1 and Btau 4.6.1, respectively. In total, 2,473,884 and 2,402,997 putative SNPs were discovered, of which 1,095,922 (44.3%) and 982,674 (40.9%) novel SNPs were discovered against UMD3.1 and Btau 4.6.1, respectively. Among the SNPs, the 46,301 (UMD 3.1) and 28,613 SNPs (Btau 4.6.1) that were identified as Hanwoo-specific SNPs were included in the functional genes that may be involved in the mechanisms of milk production, tenderness, juiciness, marbling of Hanwoo beef and yellow hair. Most of the Hanwoo-specific SNPs were identified in the promoter region, suggesting that the SNPs influence differential expression of the regulated genes relative to the relevant traits. In particular, the non-synonymous (ns) SNPs found in CORIN, which is a negative regulator of Agouti, might be a causal variant to determine yellow hair of Hanwoo. Our results will provide abundant genetic sources of variation to characterize Hanwoo genetics and for subsequent breeding.