• Title/Summary/Keyword: Genome-wide Association (GWA)

Search Result 16, Processing Time 0.025 seconds

Joint Identification of Multiple Genetic Variants of Obesity in a Korean Genome-wide Association Study

  • Oh, So-Hee;Cho, Seo-Ae;Park, Tae-Sung
    • Genomics & Informatics
    • /
    • v.8 no.3
    • /
    • pp.142-149
    • /
    • 2010
  • In recent years, genome-wide association (GWA) studies have successfully led to many discoveries of genetic variants affecting common complex traits, including height, blood pressure, and diabetes. Although GWA studies have made much progress in finding single nucleotide polymorphisms (SNPs) associated with many complex traits, such SNPs have been shown to explain only a very small proportion of the underlying genetic variance of complex traits. This is partly due to that fact that most current GWA studies have relied on single-marker approaches that identify single genetic factors individually and have limitations in considering the joint effects of multiple genetic factors on complex traits. Joint identification of multiple genetic factors would be more powerful and provide a better prediction of complex traits, since it utilizes combined information across variants. Recently, a new statistical method for joint identification of genetic variants for common complex traits via the elastic-net regularization method was proposed. In this study, we applied this joint identification approach to a large-scale GWA dataset (i.e., 8842 samples and 327,872 SNPs) in order to identify genetic variants of obesity for the Korean population. In addition, in order to test for the biological significance of the jointly identified SNPs, gene ontology and pathway enrichment analyses were further conducted.

Post-GWAS Strategies

  • Kim, Sang-Soo;Bhak, Jong
    • Genomics & Informatics
    • /
    • v.9 no.1
    • /
    • pp.1-4
    • /
    • 2011
  • Genome-wide association (GWA) studies are the method of choice for discovering loci associated with common diseases. More than a thousand GWA studies have reported successful identification of statistically significant association signals in human genomes for a variety of complex diseases. In this review, I discuss some of the issues related to the future of GWA studies and their biomedical applications.

Short Reads Phasing to Construct Haplotypes in Genomic Regions That Are Associated with Body Mass Index in Korean Individuals

  • Lee, Kichan;Han, Seonggyun;Tark, Yeonjeong;Kim, Sangsoo
    • Genomics & Informatics
    • /
    • v.12 no.4
    • /
    • pp.165-170
    • /
    • 2014
  • Genome-wide association (GWA) studies have found many important genetic variants that affect various traits. Since these studies are useful to investigate untyped but causal variants using linkage disequilibrium (LD), it would be useful to explore the haplotypes of single-nucleotide polymorphisms (SNPs) within the same LD block of significant associations based on high-density variants from population references. Here, we tried to make a haplotype catalog affecting body mass index (BMI) through an integrative analysis of previously published whole-genome next-generation sequencing (NGS) data of 7 representative Korean individuals and previously known Korean GWA signals. We selected 435 SNPs that were significantly associated with BMI from the GWA analysis and searched 53 LD ranges nearby those SNPs. With the NGS data, the haplotypes were phased within the LDs. A total of 44 possible haplotype blocks for Korean BMI were cataloged. Although the current result constitutes little data, this study provides new insights that may help to identify important haplotypes for traits and low variants nearby significant SNPs. Furthermore, we can build a more comprehensive catalog as a larger dataset becomes available.

Relevance Epistasis Network of Gastritis for Intra-chromosomes in the Korea Associated Resource (KARE) Cohort Study

  • Jeong, Hyun-hwan;Sohn, Kyung-Ah
    • Genomics & Informatics
    • /
    • v.12 no.4
    • /
    • pp.216-224
    • /
    • 2014
  • Gastritis is a common but a serious disease with a potential risk of developing carcinoma. Helicobacter pylori infection is reported as the most common cause of gastritis, but other genetic and genomic factors exist, especially single-nucleotide polymorphisms (SNPs). Association studies between SNPs and gastritis disease are important, but results on epistatic interactions from multiple SNPs are rarely found in previous genome-wide association (GWA) studies. In this study, we performed computational GWA case-control studies for gastritis in Korea Associated Resource (KARE) data. By transforming the resulting SNP epistasis network into a gene-gene epistasis network, we also identified potential gene-gene interaction factors that affect the susceptibility to gastritis.

A Scheme for Filtering SNPs Imputed in 8,842 Korean Individuals Based on the International HapMap Project Data

  • Lee, Ki-Chan;Kim, Sang-Soo
    • Genomics & Informatics
    • /
    • v.7 no.2
    • /
    • pp.136-140
    • /
    • 2009
  • Genome-wide association (GWA) studies may benefit from the inclusion of imputed SNPs into their dataset. Due to its predictive nature, the imputation process is typically not perfect. Thus, it would be desirable to develop a scheme for filtering out the imputed SNPs by maximizing the concordance with the observed genotypes. We report such a scheme, which is based on the combination of several parameters that are calculated by PLINK, a popular GWA analysis software program. We imputed the genotypes of 8,842 Korean individuals, based on approximately 2 million SNP genotypes of the CHB+JPT panel in the International HapMap Project Phase II data, complementing the 352k SNPs in the original Affymetrix 5.0 dataset. A total of 333,418 SNPs were found in both datasets, with a median concordance rate of 98.7%. The concordance rates were calculated at different ranges of parameters, such as the number of proxy SNPs (NPRX), the fraction of successfully imputed individuals (IMPUTED), and the information content (INFO). The poor concordance that was observed at the lower values of the parameters allowed us to develop an optimal combination of the cutoffs (IMPUTED${\geq}$0.9 and INFO${\geq}$0.9). A total of 1,026,596 SNPs passed the cutoff, of which 94,364 were found in both datasets and had 99.4% median concordance. This study illustrates a conservative scheme for filtering imputed SNPs that would be useful in GWA studies.

Gene-set based genome-wide association analysis for the speed of sound in two skeletal sites of Korean women

  • Kwon, Ji-Sun;Kim, Sangsoo
    • BMB Reports
    • /
    • v.47 no.6
    • /
    • pp.348-353
    • /
    • 2014
  • The speed of sound (SOS) value is an indicator of bone mineral density (BMD). Previous genome-wide association (GWA) studies have identified a number of genes, whose variations may affect BMD levels. However, their biological implications have been elusive. We re-analyzed the GWA study dataset for the SOS values in skeletal sites of 4,659 Korean women, using a gene-set analysis software, GSA-SNP. We identified 10 common representative GO terms, and 17 candidate genes between these two traits (PGS < 0.05). Implication of these GO terms and genes in the bone mechanism is well supported by the literature survey. Interestingly, the significance levels of some member genes were inversely related, in several gene-sets that were shared between two skeletal sites. This implies that biological process, rather than SNP or gene, is the substantial unit of genetic association for SOS in bone. In conclusion, our findings may provide new insights into the biological mechanisms for BMD.

Identify Major Gene-Gene Interaction Effects Using SNPHarvester (SNPHarvester를 활용한 주요 유전자 상호작용 효과 감명)

  • Lee, Jea-Young;Kim, Dong-Chul
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.6
    • /
    • pp.915-923
    • /
    • 2009
  • The gene which is related in the disease of the human has been searched among numerous genes in GWA(Genome-Wide Association) research. However, most current statistical methods used to detect gene-gene interactions in disease association studies cannot be easily applied to handle the whole genome association study(GWAS) due to heavy computing. Therefore SNPHarvester is developed to find the main gene group among numerous genes. This research finds the superior gene groups which are related with the economic traits of the Korean beef cattle, not that of human, among sets of SNPs by using SNPHarvester, and also finds the superior genotypes which can enhance various qualities of Korean beef among SNP groups.

Genome-wide association study of carcass weight in commercial Hanwoo cattle

  • Edea, Zewdu;Jeoung, Yeong Ho;Shin, Sung-Sub;Ku, Jaeul;Seo, Sungbo;Kim, Il-Hoi;Kim, Sang-Wook;Kim, Kwan-Suk
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.31 no.3
    • /
    • pp.327-334
    • /
    • 2018
  • Objective: The objective of the present study was to validate genes and genomic regions associated with carcass weight using a low-density single nucleotide polymorphism (SNP) Chip in Hanwoo cattle breed. Methods: Commercial Hanwoo steers (n = 220) were genotyped with 20K GeneSeek genomic profiler BeadChip. After applying the quality control of criteria of a call rate ${\geq}90%$ and minor allele frequency (MAF) ${\geq}0.01$, a total of 15,235 autosomal SNPs were left for genome-wide association (GWA) analysis. The GWA tests were performed using single-locus mixed linear model. Age at slaughter was fitted as fixed effect and sire included as a covariate. The level of genome-wide significance was set at $3.28{\times}10^{-6}$ (0.05/15,235), corresponding to Bonferroni correction for 15,235 multiple independent tests. Results: By employing EMMAX approach which is based on a mixed linear model and accounts for population stratification and relatedness, we identified 17 and 16 loci significantly (p<0.001) associated with carcass weight for the additive and dominant models, respectively. The second most significant (p = 0.000049) SNP (ARS-BFGL-NGS-28234) on bovine chromosome 4 (BTA4) at 21 Mb had an allele substitution effect of 43.45 kg. Some of the identified regions on BTA2, 6, 14, 22, and 24 were previously reported to be associated with quantitative trait loci for carcass weight in several beef cattle breeds. Conclusion: This is the first genome-wide association study using SNP chips on commercial Hanwoo steers, and some of the loci newly identified in this study may help to better DNA markers that determine increased beef production in commercial Hanwoo cattle. Further studies using a larger sample size will allow confirmation of the candidates identified in this study.

Whole-genome association and genome partitioning revealed variants and explained heritability for total number of teats in a Yorkshire pig population

  • Uzzaman, Md. Rasel;Park, Jong-Eun;Lee, Kyung-Tai;Cho, Eun-Seok;Choi, Bong-Hwan;Kim, Tae-Hun
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.31 no.4
    • /
    • pp.473-479
    • /
    • 2018
  • Objective: The study was designed to perform a genome-wide association (GWA) and partitioning of genome using Illumina's PorcineSNP60 Beadchip in order to identify variants and determine the explained heritability for the total number of teats in Yorkshire pig. Methods: After screening with the following criteria: minor allele frequency, $MAF{\leq}0.01$; Hardy-Weinberg equilibrium, $HWE{\leq}0.000001$, a pair-wise genomic relationship matrix was produced using 42,953 single nucleotide polymorphisms (SNPs). A genome-wide mixed linear model-based association analysis (MLMA) was conducted. And for estimating the explained heritability with genome- or chromosome-wide SNPs the genetic relatedness estimation through maximum likelihood approach was used in our study. Results: The MLMA analysis and false discovery rate p-values identified three significant SNPs on two different chromosomes (rs81476910 and rs81405825 on SSC8; rs81332615 on SSC13) for total number of teats. Besides, we estimated that 30% of variance could be explained by all of the common SNPs on the autosomal chromosomes for the trait. The maximum amount of heritability obtained by partitioning the genome were $0.22{\pm}0.05$, $0.16{\pm}0.05$, $0.10{\pm}0.03$ and $0.08{\pm}0.03$ on SSC7, SSC13, SSC1, and SSC8, respectively. Of them, SSC7 explained the amount of estimated heritability along with a SNP (rs80805264) identified by genome-wide association studies at the empirical p value significance level of 2.35E-05 in our study. Interestingly, rs80805264 was found in a nearby quantitative trait loci (QTL) on SSC7 for the teat number trait as identified in a recent study. Moreover, all other significant SNPs were found within and/or close to some QTLs related to ovary weight, total number of born alive and age at puberty in pigs. Conclusion: The SNPs we identified unquestionably represent some of the important QTL regions as well as genes of interest in the genome for various physiological functions responsible for reproduction in pigs.

Identification of Causal and/or Rare Genetic Variants for Complex Traits by Targeted Resequencing in Population-based Cohorts

  • Kim, Yun-Kyoung;Hong, Chang-Bum;Cho, Yoon-Shin
    • Genomics & Informatics
    • /
    • v.8 no.3
    • /
    • pp.131-137
    • /
    • 2010
  • Genome-wide association studies (GWASs) have greatly contributed to the identification of common variants responsible for numerous complex traits. There are, however, unavoidable limitations in detecting causal and/or rare variants for traits in this approach, which depends on an LD-based tagging SNP microarray chip. In an effort to detect potential casual and/or rare variants for complex traits, such as type 2 diabetes (T2D) and triglycerides (TGs), we conducted a targeted resequencing of loci identified by the Korea Association REsource (KARE) GWAS. The target regions for resequencing comprised whole exons, exon-intron boundaries, and regulatory regions of genes that appeared within 1 Mb of the GWA signal boundary. From 124 individuals selected in population-based cohorts, a total of 0.7 Mb target regions were captured by the NimbleGen sequence capture 385K array. Subsequent sequencing, carried out by the Roche 454 Genome Sequencer FLX, generated about 110,000 sequence reads per individual. Mapping of sequence reads to the human reference genome was performed using the SSAHA2 program. An average of 62.2% of total reads was mapped to targets with an average 22X-fold coverage. A total of 5,983 SNPs (average 846 SNPs per individual) were called and annotated by GATK software, with 96.5% accuracy that was estimated by comparison with Affymetrix 5.0 genotyped data in identical individuals. About 51% of total SNPs were singletons that can be considered possible rare variants in the population. Among SNPs that appeared in exons, which occupies about 20% of total SNPs, 304 nonsynonymous singletons were tested with Polyphen to predict the protein damage caused by mutation. In total, we were able to detect 9 and 6 potentially functional rare SNPs for T2D and triglycerides, respectively, evoking a further step of replication genotyping in independent populations to prove their bona fide relevance to traits.