DOI QR코드

DOI QR Code

HisCoM-PCA: software for hierarchical structural component analysis for pathway analysis based using principal component analysis

  • Jiang, Nan (Interdisciplinary Program in Bioinformatics, Seoul National University) ;
  • Lee, Sungyoung (Center for Precision Medicine, Seoul National University Hospital) ;
  • Park, Taesung (Interdisciplinary Program in Bioinformatics, Seoul National University)
  • Received : 2020.02.28
  • Accepted : 2020.03.11
  • Published : 2020.03.31

Abstract

In genome-wide association studies, pathway-based analysis has been widely performed to enhance interpretation of single-nucleotide polymorphism association results. We proposed a novel method of hierarchical structural component model (HisCoM) for pathway analysis of common variants (HisCoM for pathway analysis of common variants [HisCoM-PCA]) which was used to identify pathways associated with traits. HisCoM-PCA is based on principal component analysis (PCA) for dimensional reduction of single nucleotide polymorphisms in each gene, and the HisCoM for pathway analysis. In this study, we developed a HisCoM-PCA software for the hierarchical pathway analysis of common variants. HisCoM-PCA software has several features. Various principle component scores selection criteria in PCA step can be specified by users who want to summarize common variants at each gene-level by different threshold values. In addition, multiple public pathway databases and customized pathway information can be used to perform pathway analysis. We expect that HisCoM-PCA software will be useful for users to perform powerful pathway analysis.

Keywords

References

  1. Xue A, Wu Y, Zhu Z, Zhang F, Kemper KE, Zheng Z, et al. Genome-wide association analyses identify 143 risk variants and putative regulatory mechanisms for type 2 diabetes. Nat Commun 2018;9:2941. https://doi.org/10.1038/s41467-018-04951-w
  2. Prasad RB, Groop L. Genetics of type 2 diabetes-pitfalls and possibilities. Genes (Basel) 2015;6:87-123. https://doi.org/10.3390/genes6010087
  3. Lee S, Choi S, Kim YJ, Kim BJ; T2d-Genes Consortium, Hwang H, et al. Pathway-based approach using hierarchical components of collapsed rare variants. Bioinformatics 2016;32:i586-i594.
  4. Choi S, Lee S, Kim Y, Hwang H, Park T. HisCoM-GGI: Hierarchical structural component analysis of gene-gene interactions. J Bioinform Comput Biol 2018;16:1840026. https://doi.org/10.1142/s0219720018400267
  5. Jiang N, Lee S, Park T. Hierarchical structural component model for pathway analysis of common variants. BMC Med Genomics 2020;13(Suppl 3):26. https://doi.org/10.1186/s12920-019-0650-0
  6. Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M. The KEGG resource for deciphering the genome. Nucleic Acids Res 2004;32:D277-D280. https://doi.org/10.1093/nar/gkh063
  7. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 2005;102:15545-15550. https://doi.org/10.1073/pnas.0506580102
  8. Liberzon A, Birger C, Thorvaldsdottir H, Ghandi M, Mesirov JP, Tamayo P. The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst 2015;1:417-425. https://doi.org/10.1016/j.cels.2015.12.004