DOI QR코드

DOI QR Code

Multi-block Analysis of Genomic Data Using Generalized Canonical Correlation Analysis

  • Received : 2018.12.21
  • Accepted : 2018.12.26
  • Published : 2018.12.31

Abstract

Recently, there have been many studies in medicine related to genetic analysis. Many genetic studies have been performed to find genes associated with complex diseases. To find out how genes are related to disease, we need to understand not only the simple relationship of genotypes but also the way they are related to phenotype. Multi-block data, which is a summation form of variable sets, is used for enhancing the analysis of the relationships of different blocks. By identifying relationships through a multi-block data form, we can understand the association between the blocks in comprehending the correlation between them. Several statistical analysis methods have been developed to understand the relationship between multi-block data. In this paper, we will use generalized canonical correlation methodology to analyze multi-block data from the Korean Association Resource project, which has a combination of single nucleotide polymorphism blocks, phenotype blocks, and disease blocks.

Keywords

References

  1. Liu L, Li Y, Tollefsbol TO. Gene-environment interactions and epigenetic basis of human diseases. Curr Issues Mol Biol 2008;10:25-36.
  2. Tang CS, Ferreira MA. A gene-based test of association using canonical correlation analysis. Bioinformatics 2012;28:845-850. https://doi.org/10.1093/bioinformatics/bts051
  3. Kang M, Kim DC, Liu C, Gao J. Multiblock discriminant analysis for integrative genomic study. Biomed Res Int 2015;2015:783592.
  4. Meng C, Zeleznik OA, Thallinger GG, Kuster B, Gholami AM, Culhane AC. Dimension reduction techniques for the integrative analysis of multi-omics data. Brief Bioinform 2016;17:628-641. https://doi.org/10.1093/bib/bbv108
  5. Briki F, Genest D. Canonical analysis of correlated atomic motions in DNA from molecular dynamics simulation. Biophys Chem 1994;52:35-43. https://doi.org/10.1016/0301-4622(94)00063-8
  6. Eslami A, Qannari EM, Kohler A, Bougeard S. Multivariate analysis of multiblock and multigroup data. Chemometr Intell Lab Syst 2014;133:63-69. https://doi.org/10.1016/j.chemolab.2014.01.016
  7. Bersanelli M, Mosca E, Remondini D, Giampieri E, Sala C, Castellani G, et al. Methods for the integration of multi-omics data: mathematical aspects. BMC Bioinformatics 2016;17 Suppl 2:15. https://doi.org/10.1186/s12859-015-0857-9
  8. Bock C, Farlik M, Sheffield NC. Multi-omics of single cells: strategies and applications. Trends Biotechnol 2016;34:605-608. https://doi.org/10.1016/j.tibtech.2016.04.004
  9. Vilanova C, Porcar M. Are multi-omics enough? Nat Microbiol 2016;1:16101. https://doi.org/10.1038/nmicrobiol.2016.101
  10. Hardle W, Simar L. Canonical correlation analysis. In: Applied Multivariate Statistical Analysis (Johnson RA, Wichern DW, eds.). Berlin: Springer Berlin Heidelberg, 2003. pp. 361-372.
  11. Tenenhaus M, Tenenhaus A, Groenen PJ. Regularized generalized canonical correlation analysis. Psychometrika 2011;76:257-284. https://doi.org/10.1007/s11336-011-9206-8
  12. Tenenhaus A, Tenenhaus M. Regularized generalized canonical correlation analysis for multiblock or multigroup data analysis. Eur J Oper Res 2014;238:391-403. https://doi.org/10.1016/j.ejor.2014.01.008
  13. Tenenhaus M, Tenenhaus A, Groenen PJ. Regularized generalized canonical correlation analysis: a framework for sequential multiblock component methods. Psychometrika 2017;82:737-777. https://doi.org/10.1007/s11336-017-9573-x
  14. Cho YS, Go MJ, Kim YJ, Heo JY, Oh JH, Ban HJ, et al. A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits. Nat Genet 2009;41:527-534. https://doi.org/10.1038/ng.357
  15. Yu W, Kwon MS, Park T. Multivariate quantitative multifactor dimensionality reduction for detecting gene-gene interactions. Hum Hered 2015;79:168-181. https://doi.org/10.1159/000377723
  16. Jousilahti P, Vartiainen E, Tuomilehto J, Puska P. Sex, age, cardiovascular risk factors, and coronary heart disease: a prospective follow-up study of 14 786 middle-aged men and women in Finland. Circulation 1999;99:1165-1172. https://doi.org/10.1161/01.CIR.99.9.1165
  17. Go MJ, Hwang JY, Kim YJ, Hee Oh J, Kim YJ, Kwak SH, et al. New susceptibility loci in MYL2, C12orf51 and OAS1 associated with 1-h plasma glucose as predisposing risk factors for type 2 diabetes in the Korean population. J Hum Genet 2013;58:362-365. https://doi.org/10.1038/jhg.2013.14
  18. Huh MH. Quantification Analysis of Multivariate Data. Seoul: Freedom Academy, 1999. pp. 76-86.