• Title/Summary/Keyword: Genome Wide Association Study

Search Result 286, Processing Time 0.021 seconds

Genome-wide Survey of Copy Number Variants Associated with Blood Pressure and Body Mass Index in a Korean Population

  • Moon, Sang-Hoon;Kim, Young-Jin;Kim, Yun-Kyoung;Kim, Dong-Joon;Lee, Ji-Young;Go, Min-Jin;Shin, Young-Ah;Hong, Chang-Bum;Kim, Bong-Jo
    • Genomics & Informatics
    • /
    • v.9 no.4
    • /
    • pp.152-160
    • /
    • 2011
  • Hypertension is the major factor of most death and high blood pressure (BP) can lead to stroke, myocardial infarction and cardiac failure. Moreover, hypertension is strongly correlated with body mass index (BMI). Although the exact causes of hypertension are still unclear, some of genetic loci were discovered from genome-wide association study (GWAS). Therefore, it is essential to study genetic variation for finding more genetic factor affecting hypertension. The purpose of our study is to conduct a CNV association study for hypertension-related traits, BP and BMI, in Korean individuals. We identified 2,206 CNV regions from 3,274 community-based Korean participants using the Affymetrix Genome-Wide Human SNP Array 6.0 platform and performed a logistic regression analysis of CNVs with two hypertension-related traits, BP and BMI. Moreover, the 4,692 participants in an independent cohort were selected for respective replication analyses. GWAS of CNV identified two loci encompassing previously known hypertension-related genes: LPA (lipoprotein) on 6q26, and JAK2 (Janus kinase 2) on 9p24, with suggestive p-values (0.0334 for LPA and 0.0305 for JAK2 ). These two positive findings, however, were not evaluated in the replication stage. Our result confirmed the conclusion of CNV study from the WTCCC suggesting weak association with common diseases. This is the first study of CNV association study with BP and BMI in Korean population and it provides a state of CNV association study with common human diseases using SNP array.

Whole-genome association and genome partitioning revealed variants and explained heritability for total number of teats in a Yorkshire pig population

  • Uzzaman, Md. Rasel;Park, Jong-Eun;Lee, Kyung-Tai;Cho, Eun-Seok;Choi, Bong-Hwan;Kim, Tae-Hun
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.31 no.4
    • /
    • pp.473-479
    • /
    • 2018
  • Objective: The study was designed to perform a genome-wide association (GWA) and partitioning of genome using Illumina's PorcineSNP60 Beadchip in order to identify variants and determine the explained heritability for the total number of teats in Yorkshire pig. Methods: After screening with the following criteria: minor allele frequency, $MAF{\leq}0.01$; Hardy-Weinberg equilibrium, $HWE{\leq}0.000001$, a pair-wise genomic relationship matrix was produced using 42,953 single nucleotide polymorphisms (SNPs). A genome-wide mixed linear model-based association analysis (MLMA) was conducted. And for estimating the explained heritability with genome- or chromosome-wide SNPs the genetic relatedness estimation through maximum likelihood approach was used in our study. Results: The MLMA analysis and false discovery rate p-values identified three significant SNPs on two different chromosomes (rs81476910 and rs81405825 on SSC8; rs81332615 on SSC13) for total number of teats. Besides, we estimated that 30% of variance could be explained by all of the common SNPs on the autosomal chromosomes for the trait. The maximum amount of heritability obtained by partitioning the genome were $0.22{\pm}0.05$, $0.16{\pm}0.05$, $0.10{\pm}0.03$ and $0.08{\pm}0.03$ on SSC7, SSC13, SSC1, and SSC8, respectively. Of them, SSC7 explained the amount of estimated heritability along with a SNP (rs80805264) identified by genome-wide association studies at the empirical p value significance level of 2.35E-05 in our study. Interestingly, rs80805264 was found in a nearby quantitative trait loci (QTL) on SSC7 for the teat number trait as identified in a recent study. Moreover, all other significant SNPs were found within and/or close to some QTLs related to ovary weight, total number of born alive and age at puberty in pigs. Conclusion: The SNPs we identified unquestionably represent some of the important QTL regions as well as genes of interest in the genome for various physiological functions responsible for reproduction in pigs.

Genome-Wide SNP Calling Using Next Generation Sequencing Data in Tomato

  • Kim, Ji-Eun;Oh, Sang-Keun;Lee, Jeong-Hee;Lee, Bo-Mi;Jo, Sung-Hwan
    • Molecules and Cells
    • /
    • v.37 no.1
    • /
    • pp.36-42
    • /
    • 2014
  • The tomato (Solanum lycopersicum L.) is a model plant for genome research in Solanaceae, as well as for studying crop breeding. Genome-wide single nucleotide polymorphisms (SNPs) are a valuable resource in genetic research and breeding. However, to do discovery of genome-wide SNPs, most methods require expensive high-depth sequencing. Here, we describe a method for SNP calling using a modified version of SAMtools that improved its sensitivity. We analyzed 90 Gb of raw sequence data from next-generation sequencing of two resequencing and seven transcriptome data sets from several tomato accessions. Our study identified 4,812,432 non-redundant SNPs. Moreover, the workflow of SNP calling was improved by aligning the reference genome with its own raw data. Using this approach, 131,785 SNPs were discovered from transcriptome data of seven accessions. In addition, 4,680,647 SNPs were identified from the genome of S. pimpinellifolium, which are 60 times more than 71,637 of the PI212816 transcriptome. SNP distribution was compared between the whole genome and transcriptome of S. pimpinellifolium. Moreover, we surveyed the location of SNPs within genic and intergenic regions. Our results indicated that the sufficient genome-wide SNP markers and very sensitive SNP calling method allow for application of marker assisted breeding and genome-wide association studies.

Beta-Meta: a meta-analysis application considering heterogeneity among genome-wide association studies

  • Gyungbu Kim;Yoonsuk Lee;Jeong Ho Park;Dongmin Kim;Wonseok Lee
    • Genomics & Informatics
    • /
    • v.20 no.4
    • /
    • pp.49.1-49.7
    • /
    • 2022
  • Many packages for a meta-analysis of genome-wide association studies (GWAS) have been developed to discover genetic variants. Although variations across studies must be considered, there are not many currently-accessible packages that estimate between-study heterogeneity. Thus, we propose a python based application called Beta-Meta which can easily process a meta-analysis by automatically selecting between a fixed effects and a random effects model based on heterogeneity. Beta-Meta implements flexible input data manipulation to allow multiple meta-analyses of different genotype-phenotype associations in a single process. It provides a step-by-step meta-analysis of GWAS for each association in the following order: heterogeneity test, two different calculations of an effect size and a p-value based on heterogeneity, and the Benjamini-Hochberg p-value adjustment. These methods enable users to validate the results of individual studies with greater statistical power and better estimation precision. We elaborate on these and illustrate them with examples from several studies of infertility-related disorders.

Risk Prediction Using Genome-Wide Association Studies on Type 2 Diabetes

  • Choi, Sungkyoung;Bae, Sunghwan;Park, Taesung
    • Genomics & Informatics
    • /
    • v.14 no.4
    • /
    • pp.138-148
    • /
    • 2016
  • The success of genome-wide association studies (GWASs) has enabled us to improve risk assessment and provide novel genetic variants for diagnosis, prevention, and treatment. However, most variants discovered by GWASs have been reported to have very small effect sizes on complex human diseases, which has been a big hurdle in building risk prediction models. Recently, many statistical approaches based on penalized regression have been developed to solve the "large p and small n" problem. In this report, we evaluated the performance of several statistical methods for predicting a binary trait: stepwise logistic regression (SLR), least absolute shrinkage and selection operator (LASSO), and Elastic-Net (EN). We first built a prediction model by combining variable selection and prediction methods for type 2 diabetes using Affymetrix Genome-Wide Human SNP Array 5.0 from the Korean Association Resource project. We assessed the risk prediction performance using area under the receiver operating characteristic curve (AUC) for the internal and external validation datasets. In the internal validation, SLR-LASSO and SLR-EN tended to yield more accurate predictions than other combinations. During the external validation, the SLR-SLR and SLR-EN combinations achieved the highest AUC of 0.726. We propose these combinations as a potentially powerful risk prediction model for type 2 diabetes.

Genome-association analysis of Korean Holstein milk traits using genomic estimated breeding value

  • Shin, Donghyun;Lee, Chul;Park, Kyoung-Do;Kim, Heebal;Cho, Kwang-hyeon
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.30 no.3
    • /
    • pp.309-319
    • /
    • 2017
  • Objective: Holsteins are known as the world's highest-milk producing dairy cattle. The purpose of this study was to identify genetic regions strongly associated with milk traits (milk production, fat, and protein) using Korean Holstein data. Methods: This study was performed using single nucleotide polymorphism (SNP) chip data (Illumina BovineSNP50 Beadchip) of 911 Korean Holstein individuals. We inferred each genomic estimated breeding values based on best linear unbiased prediction (BLUP) and ridge regression using BLUPF90 and R. We then performed a genome-wide association study and identified genetic regions related to milk traits. Results: We identified 9, 6, and 17 significant genetic regions related to milk production, fat and protein, respectively. These genes are newly reported in the genetic association with milk traits of Holstein. Conclusion: This study complements a recent Holstein genome-wide association studies that identified other SNPs and genes as the most significant variants. These results will help to expand the knowledge of the polygenic nature of milk production in Holsteins.

Efficient Strategy to Identify Gene-Gene Interactions and Its Application to Type 2 Diabetes

  • Li, Donghe;Wo, Sungho
    • Genomics & Informatics
    • /
    • v.14 no.4
    • /
    • pp.160-165
    • /
    • 2016
  • Over the past decade, the detection of gene-gene interactions has become more and more popular in the field of genome-wide association studies (GWASs). The goal of the GWAS is to identify genetic susceptibility to complex diseases by assaying and analyzing hundreds of thousands of single-nucleotide polymorphisms. However, such tests are computationally demanding and methodologically challenging. Recently, a simple but powerful method, named "BOolean Operation-based Screening and Testing" (BOOST), was proposed for genome-wide gene-gene interaction analyses. BOOST was designed with a Boolean representation of genotype data and is approximately equivalent to the log-linear model. It is extremely fast, and genome-wide gene-gene interaction analyses can be completed within a few hours. However, BOOST can not adjust for covariate effects, and its type-1 error control is not correct. Thus, we considered two-step approaches for gene-gene interaction analyses. First, we selected gene-gene interactions with BOOST and applied logistic regression with covariate adjustments to select gene-gene interactions. We applied the two-step approach to type 2 diabetes (T2D) in the Korea Association Resource (KARE) cohort and identified some promising pairs of single-nucleotide polymorphisms associated with T2D.

A replication study of genome-wide CNV association for hepatic biomarkers identifies nine genes associated with liver function

  • Kim, Hyo-Young;Byun, Mi-Jeong;Kim, Hee-Bal
    • BMB Reports
    • /
    • v.44 no.9
    • /
    • pp.578-583
    • /
    • 2011
  • Aspartate aminotransferase (AST) and alanine aminotransferase (ALT) are biochemical markers used to test for liver diseases. Copy number variation (CNV) plays an important role in determining complex traits and is an emerging area in the study various diseases. We performed a genome-wide association study with liver function biomarkers AST and ALT in 407 unrelated Koreans. We assayed the genome-wide variations on an Affymetrix Genome-Wide 6.0 array, and CNVs were analyzed using HelixTree. Using single linear regression, 32 and 42 CNVs showed significance for AST and ALT, respectively (P value < 0.05). We compared CNV-based genes between the current study (KARE2; AST-140, ALT-172) and KARE1 (AST-1885, ALT-773) using NetBox. Results showed 9 genes (CIDEB, DFFA, PSMA3, PSMC5, PSMC6, PSMD12, PSMF1, SDC4, and SIAH1) were overlapped for AST, but no overlapped genes were found for ALT. Functional gene annotation analysis shown the proteasome pathway, Wnt signaling pathway, programmed cell death, and protein binding.

Genome-Wide Association Studies of the Korea Association REsource (KARE) Consortium

  • Hong, Kyung-Won;Kim, Hyung-Lae;Oh, Berm-Seok
    • Genomics & Informatics
    • /
    • v.8 no.3
    • /
    • pp.101-102
    • /
    • 2010
  • During the last decade, large community cohorts have been established by the Korea National Institutes of Health (KNIH), and enormous epidemiological and clinical data have been accumulated. Using these information and samples in the cohorts, KNIH set out to do a large-scale genome-wide association study (GWAS) in 2007, and the Korea Association REsource (KARE) consortium was launched to analyze the data to identify the underlying genetic risk factors of diseases and diverse health indexes, such as blood pressure, obesity, bone density, and blood biochemical traits. The consortium consisted of 6 research divisions, formed by 25 principal investigators in 19 organizations, including 18 universities, 2 institutes, and 1 company. Each division focused on one of the following subjects: the identification of genetic factors, the statistical analysis of gene-gene interactions, the genetic epidemiology of gene-environment interactions, copy number variation, the bioinformatics related to a GWAS, and a GWAS of nutrigenomics. In this special issue, the study results of the KARE consortium are provided as 9 articles. We hope that this special issue might encourage the genomics community to share data and scientists, including clinicians, to analyze the valuable Korean data of KARE.

Lack of Replication of Genetic Association with Body Mass Index Detected by Genome-wide Association Study

  • Lee, Hae-In;Kim, Jae-Jung;Park, Tae-Sung;Kim, Kyung-A;Lee, Jong-Eun;Cho, Yoon-Shin;Lee, Jong-Young;Han, Bok-Ghee;Lee, Jong-Keuk
    • Genomics & Informatics
    • /
    • v.9 no.2
    • /
    • pp.59-63
    • /
    • 2011
  • Obesity provokes many serious human diseases, including various cardiovascular diseases and diabetes. Body mass index (BMI) is a highly heritable trait that is broadly used to diagnose obesity. To identify genetic loci associated with obesity in Asians, we conducted a genome-wide association study (GWAS) of a population of Korean adults (n=6,742, age 40~60 years) and detected six BMI risk loci (TNR, FAM124B, RGS12, NFE2L3, MC4R and FTO) having p< $1{\times}10^{-5}$. However, in the replication study, only melanocortin 4 receptor gene (MC4R) (rs9946888, p=$4.58{\times}10^{-7}$) was replicated with marginal significance (p<0.05) in the second cohort (n=5,102, age 40~60 years). This study indicates that each locus associated with BMI has very weak genetic effect.