• Title/Summary/Keyword: Genome wide association study

Search Result 279, Processing Time 0.035 seconds

Genome-wide association studies to identify quantitative trait loci and positional candidate genes affecting meat quality-related traits in pigs

  • Jae-Bong Lee;Ji-Hoon Lim;Hee-Bok Park
    • Journal of Animal Science and Technology
    • /
    • v.65 no.6
    • /
    • pp.1194-1204
    • /
    • 2023
  • Meat quality comprises a set of key traits such as pH, meat color, water-holding capacity, tenderness and marbling. These traits are complex because they are affected by multiple genetic and environmental factors. The aim of this study was to investigate the molecular genetic basis underlying nine meat quality-related traits in a Yorkshire pig population using a genome-wide association study (GWAS) and subsequent biological pathway analysis. In total, 45,926 single nucleotide polymorphism (SNP) markers from 543 pigs were selected for the GWAS after quality control. Data were analyzed using a genome-wide efficient mixed model association (GEMMA) method. This linear mixed model-based approach identified two quantitative trait loci (QTLs) for meat color (b*) on chromosome 2 (SSC2) and one QTL for shear force on chromosome 8 (SSC8). These QTLs acted additively on the two phenotypes and explained 3.92%-4.57% of the phenotypic variance of the traits of interest. The genes encoding HAUS8 on SSC2 and an lncRNA on SSC8 were identified as positional candidate genes for these QTLs. The results of the biological pathway analysis revealed that positional candidate genes for meat color (b*) were enriched in pathways related to muscle development, muscle growth, intramuscular adipocyte differentiation, and lipid accumulation in muscle, whereas positional candidate genes for shear force were overrepresented in pathways related to cell growth, cell differentiation, and fatty acids synthesis. Further verification of these identified SNPs and genes in other independent populations could provide valuable information for understanding the variations in pork quality-related traits.

HisCoM-GGI: Software for Hierarchical Structural Component Analysis of Gene-Gene Interactions

  • Choi, Sungkyoung;Lee, Sungyoung;Park, Taesung
    • Genomics & Informatics
    • /
    • v.16 no.4
    • /
    • pp.38.1-38.3
    • /
    • 2018
  • Gene-gene interaction (GGI) analysis is known to play an important role in explaining missing heritability. Many previous studies have already proposed software to analyze GGI, but most methods focus on a binary phenotype in a case-control design. In this study, we developed "Hierarchical structural CoMponent analysis of Gene-Gene Interactions" (HisCoM-GGI) software for GGI analysis with a continuous phenotype. The HisCoM-GGI method considers hierarchical structural relationships between genes and single nucleotide polymorphisms (SNPs), enabling both gene-level and SNP-level interaction analysis in a single model. Furthermore, this software accepts various types of genomic data and supports data management and multithreading to improve the efficiency of genome-wide association study data analysis. We expect that HisCoM-GGI software will provide advanced accessibility to researchers in genetic interaction studies and a more effective way to understand biological mechanisms of complex diseases.

Validation and genetic heritability estimation of known type 2 diabetes related variants in the Korean population

  • Jang, Hye-Mi;Hwang, Mi Yeong;Kim, Bong-Jo;Kim, Young Jin
    • Genomics & Informatics
    • /
    • v.19 no.4
    • /
    • pp.37.1-37.7
    • /
    • 2021
  • Genome-wide association studies (GWASs) facilitated the discovery of countless disease-associated variants. However, GWASs have mostly been conducted in European ancestry samples. Recent studies have reported that these European-based association results may reduce disease prediction accuracy when applied in non-Europeans. Therefore, previously reported variants should be validated in non-European populations to establish reliable scientific evidence for precision medicine. In this study, we validated known associations with type 2 diabetes (T2D) and related metabolic traits in 125,850 samples from a Korean population genotyped by the Korea Biobank Array (KBA). At the end of December 2020, there were 8,823 variants associated with glycemic traits, lipids, liver enzymes, and T2D in the GWAS catalog. Considering the availability of imputed datasets in the KBA genome data, publicly available East Asian T2D summary statistics, and the linkage disequilibrium among the variants (r2 < 0.2), 2,900 independent variants were selected for further analysis. Among these, 1,837 variants (63.3%) were statistically significant (p ≤ 0.05). Most of the non-replicated variants (n = 1,063) showed insufficient statistical power and decreased minor allele frequencies compared with the replicated variants. Moreover, most of known variants showed <10% genetic heritability. These results could provide valuable scientific evidence for future study designs, the current power of GWASs, and future applications in precision medicine in the Korean population.

Genome-wide association study of carcass weight in commercial Hanwoo cattle

  • Edea, Zewdu;Jeoung, Yeong Ho;Shin, Sung-Sub;Ku, Jaeul;Seo, Sungbo;Kim, Il-Hoi;Kim, Sang-Wook;Kim, Kwan-Suk
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.31 no.3
    • /
    • pp.327-334
    • /
    • 2018
  • Objective: The objective of the present study was to validate genes and genomic regions associated with carcass weight using a low-density single nucleotide polymorphism (SNP) Chip in Hanwoo cattle breed. Methods: Commercial Hanwoo steers (n = 220) were genotyped with 20K GeneSeek genomic profiler BeadChip. After applying the quality control of criteria of a call rate ${\geq}90%$ and minor allele frequency (MAF) ${\geq}0.01$, a total of 15,235 autosomal SNPs were left for genome-wide association (GWA) analysis. The GWA tests were performed using single-locus mixed linear model. Age at slaughter was fitted as fixed effect and sire included as a covariate. The level of genome-wide significance was set at $3.28{\times}10^{-6}$ (0.05/15,235), corresponding to Bonferroni correction for 15,235 multiple independent tests. Results: By employing EMMAX approach which is based on a mixed linear model and accounts for population stratification and relatedness, we identified 17 and 16 loci significantly (p<0.001) associated with carcass weight for the additive and dominant models, respectively. The second most significant (p = 0.000049) SNP (ARS-BFGL-NGS-28234) on bovine chromosome 4 (BTA4) at 21 Mb had an allele substitution effect of 43.45 kg. Some of the identified regions on BTA2, 6, 14, 22, and 24 were previously reported to be associated with quantitative trait loci for carcass weight in several beef cattle breeds. Conclusion: This is the first genome-wide association study using SNP chips on commercial Hanwoo steers, and some of the loci newly identified in this study may help to better DNA markers that determine increased beef production in commercial Hanwoo cattle. Further studies using a larger sample size will allow confirmation of the candidates identified in this study.

A Pilot Genome-wide Association Study of Breast Cancer Susceptibility Loci in Indonesia

  • Haryono, Samuel J;Datasena, I Gusti Bagus;Santosa, Wahyu Budi;Mulyarahardja, Raymond;Sari, Kartika
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.16 no.6
    • /
    • pp.2231-2235
    • /
    • 2015
  • Genome-wide association studies (GWASs) of the entire genome provide a systematic approach for revealing novel genetic susceptibility loci for breast cancer. However, genetic association studies have hitherto been primarily conducted in women of European ancestry. Therefofre we here performed a pilot GWAS with a single nucleotide polymorphism (SNP) array 5.0 platform from $Affymetrix^{(R)}$ that contains 443,813 SNPs to search for new genetic risk factors in 89 breast cancer cases and 46 healthy women of Indonesian ancestry. The case-control association of the GWAS finding set was evaluated using PLINK. The strengths of allelic and genotypic associations were assessed using logistic regression analysis and reported as odds ratios (ORs) and P values; P values less than $1.00{\times}10^{-8}$ and $5.00{\times}10^{-5}$ were required for significant association and suggestive association, respectively. After analyzing 292,887 SNPs, we recognized 11 chromosome loci that possessed suggestive associations with breast cancer risk. Of these, however, there were only four chromosome loci with identified genes: chromosome 2p.12 with the CTNNA2 gene [Odds ratio (OR)=1.20, 95% confidence interval (CI)=1.13-1.33, $P=1.08{\times}10^{-7}$]; chromosome 18p11.2 with the SOGA2 gene (OR=1.32, 95%CI=1.17-1.44, $P=6.88{\times}10^{-6}$); chromosome 5q14.1 with the SSBP2 gene (OR=1.22, 95%CI=1.11-1.34, $P=4.00{\times}10^{-5}$); and chromosome 9q31.1 with the TEX10 gene (OR=1.24, 95%CI=1.12-1.35, $P=4.68{\times}10^{-5}$). This study identified 11 chromosome loci which exhibited suggestive associations with the risk of breast cancer among Indonesian women.

Genome wide association test to identity QTL for dressing percentage in Hanwoo (전장 유전체 관련성 분석을 통한 한우 도체수율 관련 양적형질좌위 탐색)

  • Lee, Seung Hwan;Lim, Dajeong;Dang, Chang Gwan;Chang, Sun Sik;Kim, Hyeong Cheul;Jeon, Gi Jun;Yeon, Seong Hum;Jang, Gul Won;Park, Eung Woo;Oh, Jae Don;Lee, Hak Kyo;Lee, Jun Heon;Kang, Hee Sul;Yoon, Duhak
    • Korean Journal of Agricultural Science
    • /
    • v.40 no.2
    • /
    • pp.155-162
    • /
    • 2013
  • Genome-wide association study was performed on data from 266 Hanwoo steers derived from 66 sire using bovine 10K mapping chip in Hanwoo (Korean Cattle). SNPs were excluded from the analysis if they failed in over 5% of the genotypes, had median GC scores below 0.6, had GC scores under 0.6 in less than 90% of the samples, deviated in heterozygosity more than 3 standard deviations from the other SNPs and were out of Hardy-Weinberg equilibrium for a cutoff p-value of $1^{-15}$. Unmapped and SNPs on sex chromosomes were also excluded. A total of 4,522 SNPs were included in the analysis. To test an association between SNP and QTL, GWAS for five genetic mode (additive, dominant, overdominant, recessive and codominant) was implemented in this study. Three SNPs (rs29018694, ss46526851 and rs29018222) at a threshold p< $1.11{\times}10^{-5}$ were detected on BTA12 and BTA21 for dressing percentages in codominant and recessive genetic mode. The G allele for rs29018694 has 4.9% higher dressing percentage than A allele, while the T allele for ss46526851 has 2.57 % higher dressing percentage than C allele. Therefore, rs29018694 SNP showed a bigger effect than the other two SNPs (ss46526851 and rs29018222) in this study. In conclusion, this study identifies three loci with moderate effects and many loci with infinitesimally small effect across genome in Hanwoo.

Review of Biological Network Data and Its Applications

  • Yu, Donghyeon;Kim, MinSoo;Xiao, Guanghua;Hwang, Tae Hyun
    • Genomics & Informatics
    • /
    • v.11 no.4
    • /
    • pp.200-210
    • /
    • 2013
  • Studying biological networks, such as protein-protein interactions, is key to understanding complex biological activities. Various types of large-scale biological datasets have been collected and analyzed with high-throughput technologies, including DNA microarray, next-generation sequencing, and the two-hybrid screening system, for this purpose. In this review, we focus on network-based approaches that help in understanding biological systems and identifying biological functions. Accordingly, this paper covers two major topics in network biology: reconstruction of gene regulatory networks and network-based applications, including protein function prediction, disease gene prioritization, and network-based genome-wide association study.

Construction of an Analysis System Using Digital Breeding Technology for the Selection of Capsicum annuum

  • Donghyun Jeon;Sehyun Choi;Yuna Kang;Changsoo Kim
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2022.10a
    • /
    • pp.233-233
    • /
    • 2022
  • As the world's population grows and food needs diversify, the demand for horticultural crops for beneficial traits is increasing. In order to meet this demand, it is necessary to develop suitable cultivars and breeding methods accordingly. Breeding methods have changed over time. With the recent development of sequencing technology, the concept of genomic selection (GS) has emerged as large-scale genome information can be used. GS shows good predictive ability even for quantitative traits by using various markers, breaking away from the limitations of Marker Assisted Selection (MAS). Moreover, GS using machine learning (ML) and deep learning (DL) has been studied recently. In this study, we aim to build a system that selects phenotype-related markers using the genomic information of the pepper population and trains a genomic selection model to select individuals from the validation population. We plan to establish an optimal genome wide association analysis model by comparing and analyzing five models. Validation of molecular markers by applying linkage markers discovered through genome wide association analysis to breeding populations. Finally, we plan to establish an optimal genome selection model by comparing and analyzing 12 genome selection models. Then We will use the genome selection model of the learning group in the breeding group to verify the prediction accuracy and discover a prediction model.

  • PDF

Chromosome-specific polymorphic SSR markers in tropical eucalypt species using low coverage whole genome sequences: systematic characterization and validation

  • Patturaj, Maheswari;Munusamy, Aiswarya;Kannan, Nithishkumar;Kandasamy, Ulaganathan;Ramasamy, Yasodha
    • Genomics & Informatics
    • /
    • v.19 no.3
    • /
    • pp.33.1-33.10
    • /
    • 2021
  • Eucalyptus is one of the major plantation species with wide variety of industrial uses. Polymorphic and informative simple sequence repeats (SSRs) have broad range of applications in genetic analysis. In this study, two individuals of Eucalyptus tereticornis (ET217 and ET86), one individual each from E. camaldulensis (EC17) and E. grandis (EG9) were subjected to whole genome resequencing. Low coverage (10×) genome sequencing was used to find polymorphic SSRs between the individuals. Average number of SSR loci identified was 95,513 and the density of SSRs per Mb was from 157.39 in EG9 to 155.08 in EC17. Among all the SSRs detected, the most abundant repeat motifs were di-nucleotide (59.6%-62.5%), followed by tri- (23.7%-27.2%), tetra- (5.2%-5.6%), penta- (5.0%-5.3%), and hexa-nucleotide (2.7%-2.9%). The predominant SSR motif units were AG/CT and AAG/TTC. Computational genome analysis predicted the SSR length variations between the individuals and identified the gene functions of SSR containing sequences. Selected subset of polymorphic markers was validated in a full-sib family of eucalypts. Additionally, genome-wide characterization of single nucleotide polymorphisms, InDels and transcriptional regulators were carried out. These variations will find their utility in genome-wide association studies as well as understanding of molecular mechanisms involved in key economic traits. The genomic resources generated in this study would provide an impetus to integrate genomics in marker-trait associations and breeding of tropical eucalypts.

Impact of type 2 diabetes variants identified through genome-wide association studies in early-onset type 2 diabetes from South Indian population

  • Liju, Samuel;Chidambaram, Manickam;Mohan, Viswanathan;Radha, Venkatesan
    • Genomics & Informatics
    • /
    • v.18 no.3
    • /
    • pp.27.1-27.12
    • /
    • 2020
  • The prevalence of early-onset type 2 diabetes (EOT2D) is increasing in Asian countries. Genome-wide association studies performed in European and various other populations have identified associations of numerous variants with type 2 diabetes in adults. However, the genetic component of EOT2D which is still unexplored could have similarities with late-onset type 2 diabetes. Here in the present study we aim to identify the association of variants with EOT2D in South Indian population. Twenty-five variants from 18 gene loci were genotyped in 1,188 EOT2D and 1,183 normal glucose tolerant subjects using the MassARRAY technology. We confirm the association of the HHEX variant rs1111875 with EOT2D in this South Indian population and also the association of CDKN2A/2B (rs7020996) and TCF7L2 (rs4506565) with EOT2D. Logistic regression analyses of the TCF7L2 variant rs4506565(A/T), showed that the heterozygous and homozygous carriers for allele 'T' have odds ratios of 1.47 (95% confidence interval [CI], 1.17 to 1.83; p = 0.001) and 1.65 (95% CI, 1.18 to 2.28; p = 0.006) respectively, relative to AA homozygote. For the HHEX variant rs1111875 (T/C), heterozygous and homozygous carriers for allele 'C' have odds ratios of 1.13 (95% CI, 0.91 to 1.42; p = 0.27) and 1.58 (95% CI, 1.17 to 2.12; p = 0.003) respectively, relative to the TT homozygote. For CDKN2A/2B variant rs7020996, the heterozygous and homozygous carriers of allele 'C' were protective with odds ratios of 0.65 (95% CI, 0.51 to 0.83; p = 0.0004) and 0.62 (95% CI, 0.27 to 1.39; p = 0.24) respectively, relative to TT homozygote. This is the first study to report on the association of HHEX variant rs1111875 with EOT2D in this population.