• 제목/요약/키워드: whole genome resequencing

검색결과 26건 처리시간 0.029초

Genome-Wide SNP Calling Using Next Generation Sequencing Data in Tomato

  • Kim, Ji-Eun;Oh, Sang-Keun;Lee, Jeong-Hee;Lee, Bo-Mi;Jo, Sung-Hwan
    • Molecules and Cells
    • /
    • 제37권1호
    • /
    • pp.36-42
    • /
    • 2014
  • The tomato (Solanum lycopersicum L.) is a model plant for genome research in Solanaceae, as well as for studying crop breeding. Genome-wide single nucleotide polymorphisms (SNPs) are a valuable resource in genetic research and breeding. However, to do discovery of genome-wide SNPs, most methods require expensive high-depth sequencing. Here, we describe a method for SNP calling using a modified version of SAMtools that improved its sensitivity. We analyzed 90 Gb of raw sequence data from next-generation sequencing of two resequencing and seven transcriptome data sets from several tomato accessions. Our study identified 4,812,432 non-redundant SNPs. Moreover, the workflow of SNP calling was improved by aligning the reference genome with its own raw data. Using this approach, 131,785 SNPs were discovered from transcriptome data of seven accessions. In addition, 4,680,647 SNPs were identified from the genome of S. pimpinellifolium, which are 60 times more than 71,637 of the PI212816 transcriptome. SNP distribution was compared between the whole genome and transcriptome of S. pimpinellifolium. Moreover, we surveyed the location of SNPs within genic and intergenic regions. Our results indicated that the sufficient genome-wide SNP markers and very sensitive SNP calling method allow for application of marker assisted breeding and genome-wide association studies.

Survey of the Applications of NGS to Whole-Genome Sequencing and Expression Profiling

  • Lim, Jong-Sung;Choi, Beom-Soon;Lee, Jeong-Soo;Shin, Chan-Seok;Yang, Tae-Jin;Rhee, Jae-Sung;Lee, Jae-Seong;Choi, Ik-Young
    • Genomics & Informatics
    • /
    • 제10권1호
    • /
    • pp.1-8
    • /
    • 2012
  • Recently, the technologies of DNA sequence variation and gene expression profiling have been used widely as approaches in the expertise of genome biology and genetics. The application to genome study has been particularly developed with the introduction of the nextgeneration DNA sequencer (NGS) Roche/454 and Illumina/ Solexa systems, along with bioinformation analysis technologies of whole-genome $de$ $novo$ assembly, expression profiling, DNA variation discovery, and genotyping. Both massive whole-genome shotgun paired-end sequencing and mate paired-end sequencing data are important steps for constructing $de$ $novo$ assembly of novel genome sequencing data. It is necessary to have DNA sequence information from a multiplatform NGS with at least $2{\times}$ and $30{\times}$ depth sequence of genome coverage using Roche/454 and Illumina/Solexa, respectively, for effective an way of de novo assembly. Massive shortlength reading data from the Illumina/Solexa system is enough to discover DNA variation, resulting in reducing the cost of DNA sequencing. Whole-genome expression profile data are useful to approach genome system biology with quantification of expressed RNAs from a wholegenome transcriptome, depending on the tissue samples. The hybrid mRNA sequences from Rohce/454 and Illumina/Solexa are more powerful to find novel genes through $de$ $novo$ assembly in any whole-genome sequenced species. The $20{\times}$ and $50{\times}$ coverage of the estimated transcriptome sequences using Roche/454 and Illumina/Solexa, respectively, is effective to create novel expressed reference sequences. However, only an average $30{\times}$ coverage of a transcriptome with short read sequences of Illumina/Solexa is enough to check expression quantification, compared to the reference expressed sequence tag sequence.

Evolutionary Analyses of Hanwoo (Korean Cattle)-Specific Single-Nucleotide Polymorphisms and Genes Using Whole-Genome Resequencing Data of a Hanwoo Population

  • Lee, Daehwan;Cho, Minah;Hong, Woon-young;Lim, Dajeong;Kim, Hyung-Chul;Cho, Yong-Min;Jeong, Jin-Young;Choi, Bong-Hwan;Ko, Younhee;Kim, Jaebum
    • Molecules and Cells
    • /
    • 제39권9호
    • /
    • pp.692-698
    • /
    • 2016
  • Advances in next generation sequencing (NGS) technologies have enabled population-level studies for many animals to unravel the relationships between genotypic differences and traits of specific populations. The objective of this study was to perform evolutionary analysis of single nucleotide polymorphisms (SNP) in genes of Korean native cattle Hanwoo in comparison to SNP data from four other cattle breeds (Jersey, Simmental, Angus, and Holstein) and four related species (pig, horse, human, and mouse) obtained from public databases through NGS-based resequencing. We analyzed population structures and differentiation levels for the five cattle breeds and estimated species-specific SNPs with their origins and phylogenetic relationships among species. In addition, we identified Hanwoo-specific genes and proteins, and determined distinct changes in protein-protein interactions among five species (cattle, pig, horse, human, mouse) in the STRING network database by additionally considering indirect protein interactions. We found that the Hanwoo population was clearly different from the other four cattle populations. There were Hanwoo-specific genes related to its meat trait. Protein interaction rewiring analysis also confirmed that there were Hanwoo-specific protein-protein interactions that might have contributed to its unique meat quality.

Whole-Genome Resequencing Analysis of Hanwoo and Yanbian Cattle to Identify Genome-Wide SNPs and Signatures of Selection

  • Choi, Jung-Woo;Choi, Bong-Hwan;Lee, Seung-Hwan;Lee, Seung-Soo;Kim, Hyeong-Cheol;Yu, Dayeong;Chung, Won-Hyong;Lee, Kyung-Tai;Chai, Han-Ha;Cho, Yong-Min;Lim, Dajeong
    • Molecules and Cells
    • /
    • 제38권5호
    • /
    • pp.466-473
    • /
    • 2015
  • Over the last 30 years, Hanwoo has been selectively bred to improve economically important traits. Hanwoo is currently the representative Korean native beef cattle breed, and it is believed that it shared an ancestor with a Chinese breed, Yanbian cattle, until the last century. However, these two breeds have experienced different selection pressures during recent decades. Here, we whole-genome sequenced 10 animals each of Hanwoo and Yanbian cattle (20 total) using the Illumina HiSeq 2000 sequencer. A total of approximately 3.12 and 3.07 billion sequence reads were mapped to the bovine reference sequence assembly (UMD 3.1) at an average of approximately 10.71- and 10.53-fold coverage for Hanwoo and Yanbian cattle, respectively. A total of 17,936,399 single nucleotide polymorphisms (SNPs) were yielded, of which 22.3% were found to be novel. By annotating the SNPs, we further retrieved numerous nonsynonymous SNPs that may be associated with traits of interest in cattle. Furthermore, we performed whole-genome screening to detect signatures of selection throughout the genome. We located several promising selective sweeps that are potentially responsible for economically important traits in cattle; the PPP1R12A gene is an example of a gene that potentially affects intramuscular fat content. These discoveries provide valuable genomic information regarding potential genomic markers that could predict traits of interest for breeding programs of these cattle breeds.

NGS 기술 활용 돌연변이체 해석 및 연구현황 (Current status and prospects to identify mutations responsible for mutant phenotypes by using NGS technology)

  • 정유진;류호진;조용구;강권규
    • Journal of Plant Biotechnology
    • /
    • 제43권4호
    • /
    • pp.411-416
    • /
    • 2016
  • NGS 기술은 전체 게놈 시퀀싱 및 reference 게놈에 alignment에 의해 돌연변이 표현형에 관련된 돌연변이 식별에 이용한다. 그러나 품종 및 계통들을 resequence 하였을 경우 기존의 reference 게놈에 구조적 변이가 보이며, reference와 맞지 않는 게놈지역에서 돌연변이들은 단순한 alignment로 찾을 수 없다. 본 리뷰에서는 NGS 기술을 이용하여 돌연변이체로부터 변이 관련 유전자를 식별하는 MutMap, MutMap-Gap 및 MutMap+ 방법을 기술하였고 지금까지의 연구현황에 대해 기술하였다. 아울러 이들 방법은 nucleotide-binding site-leucine rich repeat (NBS-LRR) 그룹들의 병 저항성 유전자와 같이 구조적 변이를 가진 유전자를 분리하는 등 유용성에 대해 고찰하였다.

In silico approaches to identify the functional and structural effects of non-synonymous SNPs in selective sweeps of the Berkshire pig genome

  • Shin, Donghyun;Oh, Jae-Don;Won, Kyeong-Hye;Song, Ki-Duk
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제31권8호
    • /
    • pp.1150-1159
    • /
    • 2018
  • Objective: Non-synonymous single nucleotide polymorphisms (nsSNPs) were identified in Berkshire selective sweep regions and then were investigated to discover genetic nsSNP mechanisms that were potentially associated with Berkshire domestication and meat quality. We further used bioinformatics tools to predict damaging amino-acid substitutions in Berkshire-related nsSNPs. Methods: nsSNPs were examined in whole genome resequencing data of 110 pigs, including 14 Berkshire pigs, generated using the Illumina Hiseq2000 platform to identify variations that might affect meat quality in Berkshire pigs. Results: Total 65,550 nsSNPs were identified in the mapped regions; among these, 319 were found in Berkshire selective-sweep regions reported in a previous study. Genes encompassing these nsSNPs were involved in lipid metabolism, intramuscular fatty-acid deposition, and muscle development. The effects of amino acid change by nsSNPs on protein functions were predicted using sorting intolerant from tolerant and polymorphism phenotyping V2 to reveal their potential roles in biological processes that may correlate with the unique Berkshire meat-quality traits. Conclusion: Our nsSNP findings confirmed the history of Berkshire pigs and illustrated the effects of domestication on generic-variation patterns. Our novel findings, which are generally consistent with those of previous studies, facilitated a better understanding of Berkshire domestication. In summary, we extensively investigated the relationship between genomic composition and phenotypic traits by scanning for nsSNPs in large-scale whole-genome sequencing data.

Genomic analysis reveals selection signatures of the Wannan Black pig during domestication and breeding

  • Zhang, Wei;Yang, Min;Wang, Yuanlang;Wu, Xudong;Zhang, Xiaodong;Ding, Yueyun;Yin, Zongjun
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제33권5호
    • /
    • pp.712-721
    • /
    • 2020
  • Objective: The Wannan Black pig is a typical Chinese indigenous, disease-resistant pig breed with high fertility, and a crude-feed tolerance that has been bred by artificial selection in the south of Anhui province for a long time. However, genome variation, genetic relationships with other pig breeds, and domestication, remain poorly understood. Here, we focus on elucidating the genetic characteristics of the Wannan Black pig and identifying selection signatures during domestication and breeding. Methods: We identified the whole-genome variation in the Wannan Black pig and performed population admixture analyses to determine genetic relationships with other domesticated pig breeds and wild boars. Then, we identified the selection signatures between the Wannan Black pig and Asian wild boars in 100-kb windows sliding in 10 kb steps by using two approaches: the fixation index (FST) and π ratios. Results: Resequencing the Wannan Black pig genome yielded 501.52 G of raw data. After calling single-nucleotide variants (SNVs) and insertions/deletions (InDels), we identified 21,316,754 SNVs and 5,067,206 InDels (2,898,582 inserts and 2,168,624 deletions). Additionally, we found genes associated with growth, immunity, and digestive functions. Conclusion: Our findings help in explaining the unique genetic and phenotypic characteristics of Wannan Black pigs, which in turn can be informative for future breeding programs of Wannan Black pigs.

Genetic diversity and selection of Tibetan sheep breeds revealed by whole-genome resequencing

  • Dehong Tian;Buying Han;Xue Li;Dehui Liu;Baicheng Zhou;Chunchuan Zhao;Nan Zhang;Lei Wang;Quanbang Pei;Kai Zhao
    • Animal Bioscience
    • /
    • 제36권7호
    • /
    • pp.991-1002
    • /
    • 2023
  • Objective: This study aimed to elucidate the underlying gene regions responsible for productive, phenotypic or adaptive traits in different ecological types of Tibetan sheep and the discovery of important genes encoding valuable traits. Methods: We used whole-genome resequencing to explore the genetic relationships, phylogenetic tree, and population genetic structure analysis. In addition, we identified 28 representative Tibetan sheep single-nucleotide polymorphisms (SNPs) and genomic selective sweep regions with different traits in Tibetan sheep by fixation index (Fst) and the nucleotide diversity (θπ) ratio. Results: The genetic relationships analysis showed that each breed partitioned into its own clades and had close genetic relationships. We also identified many potential breed-specific selective sweep regions, including genes associated with hypoxic adaptability (MTOR, TRHDE, PDK1, PTPN9, TMTC2, SOX9, EPAS1, PDGFD, SOCS3, TGFBR3), coat color (MITF, MC1R, ERCC2, TCF25, ITCH, TYR, RALY, KIT), wool traits (COL4A2, ERC2, NOTCH2, ROCK1, FGF5, SOX9), and horn phenotypes (RXFP2). In particular, a horn-related gene, RXFP2, showed the four most significantly associated SNP loci (g. 29481646 A>G, g. 29469024 T>C, g. 29462010 C>T, g. 29461968 C>T) and haplotypes. Conclusion: This finding demonstrates the potential for genetic markers in future molecular breeding programs to improve selection for horn phenotypes. The results will facilitate the understanding of the genetic basis of production and adaptive unique traits in Chinese indigenous Tibetan sheep taxa and offer a reference for the molecular breeding of Tibetan sheep.

Genome analysis of Yucatan miniature pigs to assess their potential as biomedical model animals

  • Kwon, Dae-Jin;Lee, Yeong-Sup;Shin, Donghyun;Won, Kyeong-Hye;Song, Ki-Duk
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제32권2호
    • /
    • pp.290-296
    • /
    • 2019
  • Objective: Pigs share many physiological, anatomical and genomic similarities with humans, which make them suitable models for biomedical researches. Understanding the genetic status of Yucatan miniature pigs (YMPs) and their association with human diseases will help to assess their potential as biomedical model animals. This study was performed to identify non-synonymous single nucleotide polymorphisms (nsSNPs) in selective sweep regions of the genome of YMPs and present the genetic nsSNP distributions that are potentially associated with disease occurrence in humans. Methods: nsSNPs in whole genome resequencing data from 12 YMPs were identified and annotated to predict their possible effects on protein function. Sorting intolerant from tolerant (SIFT) and polymorphism phenotyping v2 analyses were used, and gene ontology (GO) network and Kyoto encyclopedia of genes and genomes (KEGG) pathway analyses were performed. Results: The results showed that 8,462 genes, encompassing 72,067 nsSNPs were identified, and 118 nsSNPs in 46 genes were predicted as deleterious. GO network analysis classified 13 genes into 5 GO terms (p<0.05) that were associated with kidney development and metabolic processes. Seven genes encompassing nsSNPs were classified into the term associated with Alzheimer's disease by referencing the genetic association database. The KEGG pathway analysis identified only one significantly enriched pathway (p<0.05), hsa04080: Neuroactive ligand-receptor interaction, among the transcripts. Conclusion: The number of deleterious nsSNPs in YMPs was identified and then these variants-containing genes in YMPs data were adopted as the putative human diseases-related genes. The results revealed that many genes encompassing nsSNPs in YMPs were related to the various human genes which are potentially associated with kidney development and metabolic processes as well as human disease occurrence.

감귤 유전체 연구 동향 및 전망 (Current status and prospects of citrus genomics)

  • 김호방;임상현;김재준;박영철;윤수현;송관정
    • Journal of Plant Biotechnology
    • /
    • 제42권4호
    • /
    • pp.326-335
    • /
    • 2015
  • 감귤은 전 세계적으로 가장 많이 생산되는 주요 과수작물이고 비타민 C와 구연산 및 감귤 고유의 플라보노이드를 비롯한 다양한 기능성 성분으로 인해 건강 기능성 식품 소재로도 각광받고 있다. 그러나 긴 유년기와 배우체 불임, 주심배 발생 및 고도의 유전적 잡종성 등 감귤 특유의 생식생물학적 특성으로 인해 교배를 통한 전통 육종의 품종개발에 있어서는 가장 어려운 작물에 속한다. 지구 온난화, 소비자 욕구 변화 등으로 인해 고품질 감귤의 안정적 생산과 품종 다양화를 위한 체계적 육종 프로그램의 도입이 시급한 실정이다. 감귤에서도 분자 육종 프로그램을 통한 품종 육성을 위해 세계적으로 가장 많이 재배되는 스위트 오렌지와 클레멘타인 만다린에 대한 고품질 표준 유전체 정보가 최근에 확보되었다. 표준유전체 서열을 기반으로 다양한 품종 및 교배집단들에 대한 유전체 해독, 비교유전체 분석, GBS 등을 통해 형질연관 마커 발굴, 유전자 기능 연구 등이 이루어질 것으로 전망된다. 아울러 다양한 전사체 분석이 이루어지고 있으며, 유전자 기능 및 유전자 co-expression 네트워크의 이해를 증진할 수 있을 것이다. 유전체 및 전사체 분석을 통해 확보한 대규모 SNP, InDel 및 SSR의 다형성 분자마커 big data를 이용한 고밀도 연관 및 물리 지도 작성이 이루어지고 있고, 궁극적으로 통합지도 작성이 이루어지게 될 것이다. 이를 통해 가까운 장래에 감귤 특이 주요 농업형질 연관 유전자의 정확도 높은 map-based 클로닝 및 빠르고 효율적인 분자표지 선발육종이 이루어질 것이다.