• 제목/요약/키워드: Whole genome association study

검색결과 64건 처리시간 0.032초

A Whole Genome Association Study to Detect Single Nucleotide Polymorphisms for Carcass Traits in Hanwoo Populations

  • Lee, Y.-M.;Han, C.-M.;Li, Yi;Lee, J.-J.;Kim, L.H.;Kim, J.-H.;Kim, D.-I.;Lee, S.-S.;Park, B.-L.;Shin, H.-D.;Kim, K.-S.;Kim, N.-S.;Kim, Jong-Joo
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제23권4호
    • /
    • pp.417-424
    • /
    • 2010
  • The purpose of this study was to detect significant SNPs for carcass quality traits using DNA chips of high SNP density in Hanwoo populations. Carcass data of two hundred and eighty nine steers sired by 30 Korean proven sires were collected from two regions; the Hanwoo Improvement Center of National Agricultural Cooperative Federation in Seosan, Chungnam province and the commercial farms in Gyeongbuk province. The steers in Seosan were born between spring and fall of 2006 and those in Gyeonbuk between falls of 2004 and 2005. The former steers were slaughtered at approximately 24 months, while the latter steers were fed six months longer before slaughter. Among the 55,074 SNPs in the Illumina bovine 50K chip, a total of 32,756 available SNPs were selected for whole genome association study. After adjusting for the effects of sire, region and slaughter age, phenotypes were regressed on each SNP using a simple linear regression model. For the significance threshold, 0.1% point-wise p value from F distribution was used for each SNP test. Among the significant SNPs for a trait, the best set of SNP markers were selected using a stepwise regression procedure, and inclusion and exclusion of each SNP out of the model was determined at the p<0.001 level. A total of 118 SNPs were detected; 15, 20, 22, 28, 20, and 13 SNPs for final weight before slaughter, carcass weight, backfat thickness, weight index, longissimus dorsi muscle area, and marbling score, respectively. Among the significant SNPs, the best set of 44 SNPs was determined by stepwise regression procedures with 7, 9, 6, 9, 7, and 6 SNPs for the respective traits. Each set of SNPs per trait explained 20-40% of phenotypic variance. The number of detected SNPs per trait was not great in whole genome association tests, suggesting additional phenotype and genotype data are required to get more power to detect the trait-related SNPs with high accuracy for estimation of the SNP effect. These SNP markers could be applied to commercial Hanwoo populations via marker-assisted selection to verify the SNP effects and to improve genetic potentials in successive generations of the Hanwoo populations.

Genome-Wide SNP Calling Using Next Generation Sequencing Data in Tomato

  • Kim, Ji-Eun;Oh, Sang-Keun;Lee, Jeong-Hee;Lee, Bo-Mi;Jo, Sung-Hwan
    • Molecules and Cells
    • /
    • 제37권1호
    • /
    • pp.36-42
    • /
    • 2014
  • The tomato (Solanum lycopersicum L.) is a model plant for genome research in Solanaceae, as well as for studying crop breeding. Genome-wide single nucleotide polymorphisms (SNPs) are a valuable resource in genetic research and breeding. However, to do discovery of genome-wide SNPs, most methods require expensive high-depth sequencing. Here, we describe a method for SNP calling using a modified version of SAMtools that improved its sensitivity. We analyzed 90 Gb of raw sequence data from next-generation sequencing of two resequencing and seven transcriptome data sets from several tomato accessions. Our study identified 4,812,432 non-redundant SNPs. Moreover, the workflow of SNP calling was improved by aligning the reference genome with its own raw data. Using this approach, 131,785 SNPs were discovered from transcriptome data of seven accessions. In addition, 4,680,647 SNPs were identified from the genome of S. pimpinellifolium, which are 60 times more than 71,637 of the PI212816 transcriptome. SNP distribution was compared between the whole genome and transcriptome of S. pimpinellifolium. Moreover, we surveyed the location of SNPs within genic and intergenic regions. Our results indicated that the sufficient genome-wide SNP markers and very sensitive SNP calling method allow for application of marker assisted breeding and genome-wide association studies.

Whole-genome association and genome partitioning revealed variants and explained heritability for total number of teats in a Yorkshire pig population

  • Uzzaman, Md. Rasel;Park, Jong-Eun;Lee, Kyung-Tai;Cho, Eun-Seok;Choi, Bong-Hwan;Kim, Tae-Hun
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제31권4호
    • /
    • pp.473-479
    • /
    • 2018
  • Objective: The study was designed to perform a genome-wide association (GWA) and partitioning of genome using Illumina's PorcineSNP60 Beadchip in order to identify variants and determine the explained heritability for the total number of teats in Yorkshire pig. Methods: After screening with the following criteria: minor allele frequency, $MAF{\leq}0.01$; Hardy-Weinberg equilibrium, $HWE{\leq}0.000001$, a pair-wise genomic relationship matrix was produced using 42,953 single nucleotide polymorphisms (SNPs). A genome-wide mixed linear model-based association analysis (MLMA) was conducted. And for estimating the explained heritability with genome- or chromosome-wide SNPs the genetic relatedness estimation through maximum likelihood approach was used in our study. Results: The MLMA analysis and false discovery rate p-values identified three significant SNPs on two different chromosomes (rs81476910 and rs81405825 on SSC8; rs81332615 on SSC13) for total number of teats. Besides, we estimated that 30% of variance could be explained by all of the common SNPs on the autosomal chromosomes for the trait. The maximum amount of heritability obtained by partitioning the genome were $0.22{\pm}0.05$, $0.16{\pm}0.05$, $0.10{\pm}0.03$ and $0.08{\pm}0.03$ on SSC7, SSC13, SSC1, and SSC8, respectively. Of them, SSC7 explained the amount of estimated heritability along with a SNP (rs80805264) identified by genome-wide association studies at the empirical p value significance level of 2.35E-05 in our study. Interestingly, rs80805264 was found in a nearby quantitative trait loci (QTL) on SSC7 for the teat number trait as identified in a recent study. Moreover, all other significant SNPs were found within and/or close to some QTLs related to ovary weight, total number of born alive and age at puberty in pigs. Conclusion: The SNPs we identified unquestionably represent some of the important QTL regions as well as genes of interest in the genome for various physiological functions responsible for reproduction in pigs.

SNPHarvester를 활용한 주요 유전자 상호작용 효과 감명 (Identify Major Gene-Gene Interaction Effects Using SNPHarvester)

  • 이제영;김동철
    • Communications for Statistical Applications and Methods
    • /
    • 제16권6호
    • /
    • pp.915-923
    • /
    • 2009
  • 광범위 유전자 연관(genome-wide association) 연구에서는 무수히 많은 유전자들 중에 인간의 질병에 관련된 유전자를 찾아왔다. 기존의 인간 질병에 관련된 유전자를 찾는 방법에서 이렇게 많은 유전자들 중에서 우수한 유전자를 찾는데 직접 이용할 시에는 계산이 복잡해지고 비용이 많이 들어가며 시간이 오래 걸린다는 단점이 생긴다. 따라서 이번 수많은 유전자들 중 주요 유전자 그룹을 찾는 방법으로 SNPHarvester가 개발되였다. 본 연구에서는 인간의 질병이 아닌 한우의 여러 경제형질에 관련된 우수 유전자를 SNPHarvester를 이용하여 17 개의 SNP들 중에서 우수한 유전자 그룹을 찾았고 의사결정나무(decision tree)를 이용하여 한우의 여러 경제형질을 높일 수 있는 SNP 그룹 내의 우수 유전자형도 함께 규명할 수 있었다.

Identification of SNPs Related to 19 Phenotypic Traits Using Genome-wide Association Study (GWAS) Approach in Korean Wheat Mini-core Collection

  • Yuna Kang;Yeonjun Sung;Seonghyeon Kim;Changsoo Kim
    • 한국작물학회:학술대회논문집
    • /
    • 한국작물학회 2020년도 춘계학술대회
    • /
    • pp.120-120
    • /
    • 2020
  • Based on the simple sequence repeat (SSR) marker, a Korean wheat core collection were established with 616 wheat accessions. Among them, the SNP genotyping for the entire genome was performed using DNA chip array to clarify the whole genome SNP profiles. Consequently, a total of 35,143 SNPs were found and we re-established a mini-core collection with 247 accessions. Population diversity and phylogenetic analysis revealed genetic diversity and relationships from the mini core set. In addition, genome-wide association study (GWAS) was performed on 19 phenotypic traits; ear type, awn length, culm length, ear length, awn color, seed coat color, culm color, ear color, loading, leaf length, leaf width, seeding stand, cold damage, weight, auricle, plant type, heading stage, maturation period, upright habit, and degree of flag leaf. The GWAS was performed using the fixed and random model circulating probability unification (FarmCPU), which identified 14 to 258 SNP loci related to 19 phenotypic traits. Our study indicates that this Korean wheat mini-core collection is a set of germplasm useful for basic and applied research with the aim of understanding and exploiting the genetic diversity of Korean wheat varieties.

  • PDF

Short Reads Phasing to Construct Haplotypes in Genomic Regions That Are Associated with Body Mass Index in Korean Individuals

  • Lee, Kichan;Han, Seonggyun;Tark, Yeonjeong;Kim, Sangsoo
    • Genomics & Informatics
    • /
    • 제12권4호
    • /
    • pp.165-170
    • /
    • 2014
  • Genome-wide association (GWA) studies have found many important genetic variants that affect various traits. Since these studies are useful to investigate untyped but causal variants using linkage disequilibrium (LD), it would be useful to explore the haplotypes of single-nucleotide polymorphisms (SNPs) within the same LD block of significant associations based on high-density variants from population references. Here, we tried to make a haplotype catalog affecting body mass index (BMI) through an integrative analysis of previously published whole-genome next-generation sequencing (NGS) data of 7 representative Korean individuals and previously known Korean GWA signals. We selected 435 SNPs that were significantly associated with BMI from the GWA analysis and searched 53 LD ranges nearby those SNPs. With the NGS data, the haplotypes were phased within the LDs. A total of 44 possible haplotype blocks for Korean BMI were cataloged. Although the current result constitutes little data, this study provides new insights that may help to identify important haplotypes for traits and low variants nearby significant SNPs. Furthermore, we can build a more comprehensive catalog as a larger dataset becomes available.

Identification of Causal and/or Rare Genetic Variants for Complex Traits by Targeted Resequencing in Population-based Cohorts

  • Kim, Yun-Kyoung;Hong, Chang-Bum;Cho, Yoon-Shin
    • Genomics & Informatics
    • /
    • 제8권3호
    • /
    • pp.131-137
    • /
    • 2010
  • Genome-wide association studies (GWASs) have greatly contributed to the identification of common variants responsible for numerous complex traits. There are, however, unavoidable limitations in detecting causal and/or rare variants for traits in this approach, which depends on an LD-based tagging SNP microarray chip. In an effort to detect potential casual and/or rare variants for complex traits, such as type 2 diabetes (T2D) and triglycerides (TGs), we conducted a targeted resequencing of loci identified by the Korea Association REsource (KARE) GWAS. The target regions for resequencing comprised whole exons, exon-intron boundaries, and regulatory regions of genes that appeared within 1 Mb of the GWA signal boundary. From 124 individuals selected in population-based cohorts, a total of 0.7 Mb target regions were captured by the NimbleGen sequence capture 385K array. Subsequent sequencing, carried out by the Roche 454 Genome Sequencer FLX, generated about 110,000 sequence reads per individual. Mapping of sequence reads to the human reference genome was performed using the SSAHA2 program. An average of 62.2% of total reads was mapped to targets with an average 22X-fold coverage. A total of 5,983 SNPs (average 846 SNPs per individual) were called and annotated by GATK software, with 96.5% accuracy that was estimated by comparison with Affymetrix 5.0 genotyped data in identical individuals. About 51% of total SNPs were singletons that can be considered possible rare variants in the population. Among SNPs that appeared in exons, which occupies about 20% of total SNPs, 304 nonsynonymous singletons were tested with Polyphen to predict the protein damage caused by mutation. In total, we were able to detect 9 and 6 potentially functional rare SNPs for T2D and triglycerides, respectively, evoking a further step of replication genotyping in independent populations to prove their bona fide relevance to traits.

Whole-genome sequence association study identifies cyclin dependent kinase 8 as a key gene for the number of mummified piglets

  • Pingxian, Wu;Dejuan, Chen;Kai, Wang;Shujie, Wang;Yihui, Liu;Anan, Jiang;Weihang, Xiao;Yanzhi, Jiang;Li, Zhu;Xu, Xu;Xiaotian, Qiu;Xuewei, Li;Guoqing, Tang
    • Animal Bioscience
    • /
    • 제36권1호
    • /
    • pp.29-42
    • /
    • 2023
  • Objective: Pigs, an ideal biomedical model for human diseases, suffer from about 50% early embryonic and fetal death, a major cause of fertility loss worldwide. However, identifying the causal variant remains a huge challenge. This study aimed to detect single nucleotide polymorphisms (SNPs) and candidate genes for the number of mummified (NM) piglets using the imputed whole-genome sequence (WGS) and validate the potential candidate genes. Methods: The imputed WGS was introduced from genotyping-by-sequencing (GBS) using a multi-breed reference population. We performed genome-wide association studies (GWAS) for NM piglets at birth from a Landrace pig populatiGWAS peak located on SSC11: 0.10 to 7.11 Mbp (Top SNP, SSC11:1,889,658 bp; p = 9.98E-13) was identified in cyclin dependent kinase on. A total of 300 Landrace pigs were genotyped by GBS. The whole-genome variants were imputed, and 4,252,858 SNPs were obtained. Various molecular experiments were conducted to determine how the genes affected NM in pigs. Results: A strong GWAS peak located on SSC11: 0.10 to 7.11 Mbp (Top SNP, SSC11:1,889,658 bp; p = 9.98E-13) was identified in cyclin dependent kinase 8 (CDK8) gene, which plays a crucial role in embryonic retardation and lethality. Based on the molecular experiments, we found that Y-box binding protein 1 (YBX1) was a crucial transcription factor for CDK8, which mediated the effect of CDK8 in the proliferation of porcine ovarian granulosa cells via transforming growth factor beta/small mother against decapentaplegic signaling pathway, and, as a consequence, affected embryo quality, indicating that this pathway may be contributing to mummified fetal in pigs. Conclusion: A powerful imputation-based association study was performed to identify genes associated with NM in pigs. CDK8 was suggested as a functional gene for the proliferation of porcine ovarian granulosa cells, but further studies are required to determine causative mutations and the effect of loci on NM in pigs.

A Genome Wide Association Study on Age at First Calving Using High Density Single Nucleotide Polymorphism Chips in Hanwoo (Bos taurus coreanae)

  • Hyeong, K.E.;Iqbal, A.;Kim, Jong-Joo
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제27권10호
    • /
    • pp.1406-1410
    • /
    • 2014
  • Age at first calving is an important trait for achieving earlier reproductive performance. To detect quantitative trait loci (QTL) for reproductive traits, a genome wide association study was conducted on the 96 Hanwoo cows that were born between 2008 and 2010 from 13 sires in a local farm (Juk-Am Hanwoo farm, Suncheon, Korea) and genotyped with the Illumina 50K bovine single nucleotide polymorphism (SNP) chips. Phenotypes were regressed on additive and dominance effects for each SNP using a simple linear regression model after the effects of birth-year-month and polygenes were considered. A forward regression procedure was applied to determine the best set of SNPs for age at first calving. A total of 15 QTL were detected at the comparison-wise 0.001 level. Two QTL with strong statistical evidence were found at 128.9 Mb and 111.1 Mb on bovine chromosomes (BTA) 2 and 7, respectively, each of which accounted for 22% of the phenotypic variance. Also, five significant SNPs were detected on BTAs 10, 16, 20, 26, and 29. Multiple QTL were found on BTAs 1, 2, 7, and 14. The significant QTLs may be applied via marker assisted selection to increase rate of genetic gain for the trait, after validation tests in other Hanwoo cow populations.

Whole-genome resequencing reveals domestication and signatures of selection in Ujimqin, Sunit, and Wu Ranke Mongolian sheep breeds

  • Wang, Hanning;Zhong, Liang;Dong, Yanbing;Meng, Lingbo;Ji, Cheng;Luo, Hui;Fu, Mengrong;Qi, Zhi;Mi, Lan
    • Animal Bioscience
    • /
    • 제35권9호
    • /
    • pp.1303-1313
    • /
    • 2022
  • Objective: The current study aimed to perform whole-genome resequencing of Chinese indigenous Mongolian sheep breeds including Ujimqin, Sunit, and Wu Ranke sheep breeds (UJMQ, SNT, WRK) and deeply analyze genetic variation, population structure, domestication, and selection for domestication traits among these Mongolian sheep breeds. Methods: Blood samples were collected from a total of 60 individuals comprising 20 WRK, 20 UJMQ, and 20 SNT. For genome sequencing, about 1.5 ㎍ of genomic DNA was used for library construction with an insert size of about 350 bp. Pair-end sequencing were performed on Illumina NovaSeq platform, with the read length of 150 bp at each end. We then investigated the domestication and signatures of selection in these sheep breeds. Results: According to the population and demographic analyses, WRK and SNT populations were very similar, which were different from UJMQ populations. Genome wide association study identified 468 and 779 significant loci from SNT vs UJMQ, and UJMQ vs WRK, respectively. However, only 3 loci were identified from SNT vs WRK. Genomic comparison and selective sweep analysis among these sheep breeds suggested that genes associated with regulation of secretion, metabolic pathways including estrogen metabolism and amino acid metabolism, and neuron development have undergone strong selection during domestication. Conclusion: Our findings will facilitate the understanding of Chinese indigenous Mongolian sheep breeds domestication and selection for complex traits and provide a valuable genomic resource for future studies of sheep and other domestic animal breeding.