• Title/Summary/Keyword: genome-wide association studies

Search Result 180, Processing Time 0.028 seconds

Novel Genetic Variants Associated with Lumbar Spondylosis in Koreans : A Genome-Wide Association Study

  • Kim, Hyun Ah;Heo, Seong Gu;Park, Ji Wan;Jung, Young Ok
    • Journal of Korean Neurosurgical Society
    • /
    • v.61 no.1
    • /
    • pp.66-74
    • /
    • 2018
  • Objective : The aim of this study was to identify the susceptibility genes responsible for lumbar spondylosis (LS) in Korean patients. Methods : Data from 1427 subjects were made available for radiographic grading and genome wide association studies (GWAS) analysis. Lateral lumbar spine radiographs were obtained and the various degrees of degenerative change were semi-quantitatively scored. A pilot GWAS was performed using the AffymetrixGenome-Wide Human single-nucleotide polymorphisms (SNPs), 500K array. A total of 352228 SNPs were analyzed and the association between the SNPs and case-control status was analyzed by stepwise logistic regression analyses. Results : The top 100 SNPs with a cutoff p-value of less than $3.7{\times}10^{-4}$ were selected for joint space narrowing, while a cutoff p-value of $6.0{\times}10^{-4}$ was applied to osteophytes and the Kellgren-Lawrence (K-L) osteoarthritis grade. The SNPs with the strongest effect on disc space narrowing, osteophytes, and K-L grade were serine incorporator 1 (rs155467, odds ratio [OR]=17.58, $p=1.6{\times}10^{-4}$), stromal interaction molecule 2 (STIM1, rs210781, OR=5.53, $p=5{\times}10^{-4}$), and transient receptor potential cation channel, subfamily C (rs11224760, OR=3.99, $p=4.8{\times}10^{-4}$), respectively. Leucine-rich repeat-containing G protein-coupled receptor 4 was significantly associated with both disc space narrowing and osteophytes (rs1979400, OR=2.01, $p=1.1{\times}10^{-4}$ for disc space narrowing, OR=1.79, $p=3{\times}10^{-4}$ for osteophytes), while zinc finger and BTB domain containing 7C was significantly and negatively associated with both osteophytes and a K-L grade >2 (rs12457004,OR=0.25, $p=5.8{\times}10^{-4}$ and OR=0.27, $p=5.3{\times}10^{-4}$, respectively). Conclusion : We identified SNPs that potentially contribute to the pathogenesis of LS. This is the first report of a GWAS in an Asian population.

Genome Wide Association Studies Using Multiple-lactation Breeding Value in Holsteins

  • Cho, Kwang-Hyun;Oh, Jae-Don;Kim, Hee-Bal;Park, Kyung-Do;Lee, Joon-Ho
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.28 no.3
    • /
    • pp.328-333
    • /
    • 2015
  • A genome wide association study was conducted using estimated breeding value (EBV) for milk production traits from 1st to 4th lactation. Significant single nucleotide polymorphism (SNP) markers were selected for each trait and the differences were compared by lactation. DNA samples were taken from 456 animals with EBV which are Holstein proven bulls whose semen is being sold or the daughters of old proven bulls whose semen is no longer being sold in Korea. High density genome wide SNP genotype was investigated and the significance of markers associated with traits was tested using the breeding value estimated by a multiple lactation model as a dependent variant. As the result of significance comparisons by lactations, several differences were found between the first lactation and subsequent lactations (from second to 4th lactation). A similar trend was noted in mean deviation and correlation of the estimated effects by lactation. Since there was a difference in the genes associated with EBV for each trait between first and subsequent lactations, a multi-lactation model in which lactation is considered as a different trait is genetically useful. Also, significant markers in all lactations and common markers for different traits were detected, which can be used as markers for quantitative trait loci exploration and marker assisted selection in milk production traits.

Genome-wide association study for intramuscular fat content in Chinese Lulai black pigs

  • Wang, Yanping;Ning, Chao;Wang, Cheng;Guo, Jianfeng;Wang, Jiying;Wu, Ying
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.32 no.5
    • /
    • pp.607-613
    • /
    • 2019
  • Objective: Intramuscular fat (IMF) content plays an important role in meat quality. Identification of single nucleotide polymorphisms (SNPs) and genes related to pig IMF, especially using pig populations with high IMF content variation, can help to establish novel molecular breeding tools for optimizing IMF in pork and unveil the mechanisms that underlie fat metabolism. Methods: We collected muscle samples of 453 Chinese Lulai black pigs, measured IMF content by Soxhlet petroleum-ether extraction method, and genotyped genome-wide SNPs using GeneSeek Genomic Profiler Porcine HD BeadChip. Then a genome-wide association study was performed using a linear mixed model implemented in the GEMMA software. Results: A total of 43 SNPs were identified to be significantly associated with IMF content by the cutoff p<0.001. Among these significant SNPs, the greatest number of SNPs (n = 19) were detected on Chr.9, and two linkage disequilibrium blocks were formed among them. Additionally, 17 significant SNPs are mapped to previously reported quantitative trait loci (QTLs) of IMF and confirmed previous QTLs studies. Forty-two annotated genes centering these significant SNPs were obtained from Ensembl database. Overrepresentation test of pathways and gene ontology (GO) terms revealed some enriched reactome pathways and GO terms, which mainly involved regulation of basic material transport, energy metabolic process and signaling pathway. Conclusion: These findings improve our understanding of the genetic architecture of IMF content in pork and facilitate the follow-up study of fine-mapping genes that influence fat deposition in muscle.

Genome-wide association studies to identify quantitative trait loci and positional candidate genes affecting meat quality-related traits in pigs

  • Jae-Bong Lee;Ji-Hoon Lim;Hee-Bok Park
    • Journal of Animal Science and Technology
    • /
    • v.65 no.6
    • /
    • pp.1194-1204
    • /
    • 2023
  • Meat quality comprises a set of key traits such as pH, meat color, water-holding capacity, tenderness and marbling. These traits are complex because they are affected by multiple genetic and environmental factors. The aim of this study was to investigate the molecular genetic basis underlying nine meat quality-related traits in a Yorkshire pig population using a genome-wide association study (GWAS) and subsequent biological pathway analysis. In total, 45,926 single nucleotide polymorphism (SNP) markers from 543 pigs were selected for the GWAS after quality control. Data were analyzed using a genome-wide efficient mixed model association (GEMMA) method. This linear mixed model-based approach identified two quantitative trait loci (QTLs) for meat color (b*) on chromosome 2 (SSC2) and one QTL for shear force on chromosome 8 (SSC8). These QTLs acted additively on the two phenotypes and explained 3.92%-4.57% of the phenotypic variance of the traits of interest. The genes encoding HAUS8 on SSC2 and an lncRNA on SSC8 were identified as positional candidate genes for these QTLs. The results of the biological pathway analysis revealed that positional candidate genes for meat color (b*) were enriched in pathways related to muscle development, muscle growth, intramuscular adipocyte differentiation, and lipid accumulation in muscle, whereas positional candidate genes for shear force were overrepresented in pathways related to cell growth, cell differentiation, and fatty acids synthesis. Further verification of these identified SNPs and genes in other independent populations could provide valuable information for understanding the variations in pork quality-related traits.

Genome-wide association studies on collagen contents trait for meat quality in Hanwoo

  • KyeongHye Won;Dohyun Kim;Inho Hwang;Hak-Kyo Lee;Jae-Don Oh
    • Journal of Animal Science and Technology
    • /
    • v.65 no.2
    • /
    • pp.311-323
    • /
    • 2023
  • Beef consumers valued meat quality traits such as texture, tenderness, juiciness, flavor, and meat color that determining consumers' purchasing decision. Most research on meat quality has focused on marbling, a key characteristic related to meat eating quality. However, other important traits such as meat texture, tenderness, and color have not much studied in cattle. Among these traits, meat tenderness and texture of cattle are among the most important factors affecting quality evaluation of consumers. Collagen is the main component of connective tissues.It greatly affects meat tenderness. The objective of this study was to determine significant variants and candidate genes associated with collagen contents trait (total collagen) through genome-wide association studies (GWAS). Phenotypic and genomic data from 135 Hanwoo were used. The BLUPF90 family program and GRAMMAR method for GWAS were applied in this study. A total of 73 potential single nucleotide polymorphisms (SNPs) showed significant associations with collagen content. They were located in or near 108 candidate genes. TMEM135 and ME3 genes were identified to have the most significant SNPs associated with collagen contents trait. Data indicated that these genes were related to collagen. Biological processes and pathways for the prediction of biological functions of candidate genes were confirmed. We found that candidate genes were involved in positive regulation of CREB transcription factor activity and actin cytoskeleton related to tenderness and texture of beef. Three genes (CRTC3, MYO1C and MYLK4) belonging to these biological functions were related to tenderness. These results provide a basis for improving genomic characteristics of Hanwoo for the production of tender beef. Furthermore, they could be used they could be used as an index to select desired traits for consumers.

Comparison of Two Meta-Analysis Methods: Inverse-Variance-Weighted Average and Weighted Sum of Z-Scores

  • Lee, Cue Hyunkyu;Cook, Seungho;Lee, Ji Sung;Han, Buhm
    • Genomics & Informatics
    • /
    • v.14 no.4
    • /
    • pp.173-180
    • /
    • 2016
  • The meta-analysis has become a widely used tool for many applications in bioinformatics, including genome-wide association studies. A commonly used approach for meta-analysis is the fixed effects model approach, for which there are two popular methods: the inverse variance-weighted average method and weighted sum of z-scores method. Although previous studies have shown that the two methods perform similarly, their characteristics and their relationship have not been thoroughly investigated. In this paper, we investigate the optimal characteristics of the two methods and show the connection between the two methods. We demonstrate that the each method is optimized for a unique goal, which gives us insight into the optimal weights for the weighted sum of z-scores method. We examine the connection between the two methods both analytically and empirically and show that their resulting statistics become equivalent under certain assumptions. Finally, we apply both methods to the Wellcome Trust Case Control Consortium data and demonstrate that the two methods can give distinct results in certain study designs.

A Scheme for Filtering SNPs Imputed in 8,842 Korean Individuals Based on the International HapMap Project Data

  • Lee, Ki-Chan;Kim, Sang-Soo
    • Genomics & Informatics
    • /
    • v.7 no.2
    • /
    • pp.136-140
    • /
    • 2009
  • Genome-wide association (GWA) studies may benefit from the inclusion of imputed SNPs into their dataset. Due to its predictive nature, the imputation process is typically not perfect. Thus, it would be desirable to develop a scheme for filtering out the imputed SNPs by maximizing the concordance with the observed genotypes. We report such a scheme, which is based on the combination of several parameters that are calculated by PLINK, a popular GWA analysis software program. We imputed the genotypes of 8,842 Korean individuals, based on approximately 2 million SNP genotypes of the CHB+JPT panel in the International HapMap Project Phase II data, complementing the 352k SNPs in the original Affymetrix 5.0 dataset. A total of 333,418 SNPs were found in both datasets, with a median concordance rate of 98.7%. The concordance rates were calculated at different ranges of parameters, such as the number of proxy SNPs (NPRX), the fraction of successfully imputed individuals (IMPUTED), and the information content (INFO). The poor concordance that was observed at the lower values of the parameters allowed us to develop an optimal combination of the cutoffs (IMPUTED${\geq}$0.9 and INFO${\geq}$0.9). A total of 1,026,596 SNPs passed the cutoff, of which 94,364 were found in both datasets and had 99.4% median concordance. This study illustrates a conservative scheme for filtering imputed SNPs that would be useful in GWA studies.

Identification of Causal and/or Rare Genetic Variants for Complex Traits by Targeted Resequencing in Population-based Cohorts

  • Kim, Yun-Kyoung;Hong, Chang-Bum;Cho, Yoon-Shin
    • Genomics & Informatics
    • /
    • v.8 no.3
    • /
    • pp.131-137
    • /
    • 2010
  • Genome-wide association studies (GWASs) have greatly contributed to the identification of common variants responsible for numerous complex traits. There are, however, unavoidable limitations in detecting causal and/or rare variants for traits in this approach, which depends on an LD-based tagging SNP microarray chip. In an effort to detect potential casual and/or rare variants for complex traits, such as type 2 diabetes (T2D) and triglycerides (TGs), we conducted a targeted resequencing of loci identified by the Korea Association REsource (KARE) GWAS. The target regions for resequencing comprised whole exons, exon-intron boundaries, and regulatory regions of genes that appeared within 1 Mb of the GWA signal boundary. From 124 individuals selected in population-based cohorts, a total of 0.7 Mb target regions were captured by the NimbleGen sequence capture 385K array. Subsequent sequencing, carried out by the Roche 454 Genome Sequencer FLX, generated about 110,000 sequence reads per individual. Mapping of sequence reads to the human reference genome was performed using the SSAHA2 program. An average of 62.2% of total reads was mapped to targets with an average 22X-fold coverage. A total of 5,983 SNPs (average 846 SNPs per individual) were called and annotated by GATK software, with 96.5% accuracy that was estimated by comparison with Affymetrix 5.0 genotyped data in identical individuals. About 51% of total SNPs were singletons that can be considered possible rare variants in the population. Among SNPs that appeared in exons, which occupies about 20% of total SNPs, 304 nonsynonymous singletons were tested with Polyphen to predict the protein damage caused by mutation. In total, we were able to detect 9 and 6 potentially functional rare SNPs for T2D and triglycerides, respectively, evoking a further step of replication genotyping in independent populations to prove their bona fide relevance to traits.

Chromosome-specific polymorphic SSR markers in tropical eucalypt species using low coverage whole genome sequences: systematic characterization and validation

  • Patturaj, Maheswari;Munusamy, Aiswarya;Kannan, Nithishkumar;Kandasamy, Ulaganathan;Ramasamy, Yasodha
    • Genomics & Informatics
    • /
    • v.19 no.3
    • /
    • pp.33.1-33.10
    • /
    • 2021
  • Eucalyptus is one of the major plantation species with wide variety of industrial uses. Polymorphic and informative simple sequence repeats (SSRs) have broad range of applications in genetic analysis. In this study, two individuals of Eucalyptus tereticornis (ET217 and ET86), one individual each from E. camaldulensis (EC17) and E. grandis (EG9) were subjected to whole genome resequencing. Low coverage (10×) genome sequencing was used to find polymorphic SSRs between the individuals. Average number of SSR loci identified was 95,513 and the density of SSRs per Mb was from 157.39 in EG9 to 155.08 in EC17. Among all the SSRs detected, the most abundant repeat motifs were di-nucleotide (59.6%-62.5%), followed by tri- (23.7%-27.2%), tetra- (5.2%-5.6%), penta- (5.0%-5.3%), and hexa-nucleotide (2.7%-2.9%). The predominant SSR motif units were AG/CT and AAG/TTC. Computational genome analysis predicted the SSR length variations between the individuals and identified the gene functions of SSR containing sequences. Selected subset of polymorphic markers was validated in a full-sib family of eucalypts. Additionally, genome-wide characterization of single nucleotide polymorphisms, InDels and transcriptional regulators were carried out. These variations will find their utility in genome-wide association studies as well as understanding of molecular mechanisms involved in key economic traits. The genomic resources generated in this study would provide an impetus to integrate genomics in marker-trait associations and breeding of tropical eucalypts.

Statistical models and computational tools for predicting complex traits and diseases

  • Chung, Wonil
    • Genomics & Informatics
    • /
    • v.19 no.4
    • /
    • pp.36.1-36.11
    • /
    • 2021
  • Predicting individual traits and diseases from genetic variants is critical to fulfilling the promise of personalized medicine. The genetic variants from genome-wide association studies (GWAS), including variants well below GWAS significance, can be aggregated into highly significant predictions across a wide range of complex traits and diseases. The recent arrival of large-sample public biobanks enables highly accurate polygenic predictions based on genetic variants across the whole genome. Various statistical methodologies and diverse computational tools have been introduced and developed to computed the polygenic risk score (PRS) more accurately. However, many researchers utilize PRS tools without a thorough understanding of the underlying model and how to specify the parameters for the best performance. It is advantageous to study the statistical models implemented in computational tools for PRS estimation and the formulas of parameters to be specified. Here, we review a variety of recent statistical methodologies and computational tools for PRS computation.