Browse > Article
http://dx.doi.org/10.5808/GI.2016.14.4.173

Comparison of Two Meta-Analysis Methods: Inverse-Variance-Weighted Average and Weighted Sum of Z-Scores  

Lee, Cue Hyunkyu (Asan Institute for Life Sciences, Asan Medical Center)
Cook, Seungho (Asan Institute for Life Sciences, Asan Medical Center)
Lee, Ji Sung (Asan Institute for Life Sciences, Asan Medical Center)
Han, Buhm (Asan Institute for Life Sciences, Asan Medical Center)
Abstract
The meta-analysis has become a widely used tool for many applications in bioinformatics, including genome-wide association studies. A commonly used approach for meta-analysis is the fixed effects model approach, for which there are two popular methods: the inverse variance-weighted average method and weighted sum of z-scores method. Although previous studies have shown that the two methods perform similarly, their characteristics and their relationship have not been thoroughly investigated. In this paper, we investigate the optimal characteristics of the two methods and show the connection between the two methods. We demonstrate that the each method is optimized for a unique goal, which gives us insight into the optimal weights for the weighted sum of z-scores method. We examine the connection between the two methods both analytically and empirically and show that their resulting statistics become equivalent under certain assumptions. Finally, we apply both methods to the Wellcome Trust Case Control Consortium data and demonstrate that the two methods can give distinct results in certain study designs.
Keywords
fixed effects model; genome-wide association study; inverse variance-weighted average; meta-analysis; optimality; weighted sum of z-scores;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Evangelou E, Ioannidis JP. Meta-analysis methods for genome-wide association studies and beyond. Nat Rev Genet 2013;14:379-389.   DOI
2 de Bakker PI, Ferreira MA, Jia X, Neale BM, Raychaudhuri S, Voight BF. Practical aspects of imputation-driven meta-analysis of genome-wide association studies. Hum Mol Genet 2008; 17:R122-R128.   DOI
3 Zeggini E, Ioannidis JP. Meta-analysis in genome-wide association studies. Pharmacogenomics 2009;10:191-201.   DOI
4 Cantor RM, Lange K, Sinsheimer JS. Prioritizing GWAS results: a review of statistical methods and recommendations for their application. Am J Hum Genet 2010;86:6-22.   DOI
5 Borenstein M, Hedges LV, Higgins JP, Rothstein HR. A basic introduction to fixed-effect and random-effects models for meta-analysis. Res Synth Methods 2010;1:97-111.   DOI
6 Zeggini E, Weedon MN, Lindgren CM, Frayling TM, Elliott KS, Lango H, et al. Replication of genome-wide association signals in UK samples reveals risk loci for type 2 diabetes. Science 2007;316:1336-1341.   DOI
7 Traylor M, Makela KM, Kilarski LL, Holliday EG, Devan WJ, Nalls MA, et al. A novel MMP12 locus is associated with large artery atherosclerotic stroke using a genome-wide age-at-onset informed approach. PLoS Genet 2014;10:e1004469.   DOI
8 Lee MN, Ye C, Villani AC, Raj T, Li W, Eisenhaure TM, et al. Common genetic variants modulate pathogen-sensing responses in human dendritic cells. Science 2014;343:1246980.   DOI
9 Zaitlen N, Eskin E. Imputation aware meta-analysis of genome-wide association studies. Genet Epidemiol 2010;34:537-542.   DOI
10 Raj T, Rothamel K, Mostafavi S, Ye C, Lee MN, Replogle JM, et al. Polarization of the effects of autoimmune and neurodegenerative risk alleles in leukocytes. Science 2014;344: 519-523.   DOI
11 Zeggini E, Scott LJ, Saxena R, Voight BF, Marchini JL, Hu T, et al. Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes. Nat Genet 2008;40:638-645.   DOI
12 Furlotte NA, Kang EY, Van Nas A, Farber CR, Lusis AJ, Eskin E. Increasing association mapping power and resolution in mouse genetic studies through the use of meta-analysis for structured populations. Genetics 2012;191:959-967.   DOI
13 Fleiss JL. The statistical basis of meta-analysis. Stat Methods Med Res 1993;2:121-145.   DOI
14 Nalls MA, Pankratz N, Lill CM, Do CB, Hernandez DG, Saad M, et al. Large-scale meta-analysis of genome-wide association data identifies six new risk loci for Parkinson's disease. Nat Genet 2014;46:989-993.   DOI
15 Goodkind M, Eickhoff SB, Oathes DJ, Jiang Y, Chang A, Jones-Hagata LB, et al. Identification of a common neurobiological substrate for mental illness. JAMA Psychiatry 2015;72:305-315.   DOI
16 Sul JH, Han B, Ye C, Choi T, Eskin E. Effectively identifying eQTLs from multiple tissues by combining mixed model and meta-analytic approaches. PLoS Genet 2013;9:e1003491.   DOI
17 Fisher RA. Statistical Methods for Research Workers. Edinburgh: Oliver and Boyd, 1925.
18 Han B, Eskin E. Random-effects model aimed at discovering associations in meta-analysis of genome-wide association studies. Am J Hum Genet 2011;88:586-598.   DOI
19 Liptak T. On the combination of independent events. Magyar Tud Akad Mat Kutato Int Kozl 1958;3:171-197.
20 Zaykin DV. Optimally weighted Z-test is a powerful method for combining probabilities in meta-analysis. J Evol Biol 2011;24:1836-1841.   DOI
21 Zhou B, Shi J, Whittemore AS. Optimal methods for metaanalysis of genome-wide association studies. Genet Epidemiol 2011;35:581-591.   DOI
22 Cochran WG. The combination of estimates from different experiments. Biometrics 1954;10:101-129.   DOI
23 Mantel N, Haenszel W. Statistical aspects of the analysis of data from retrospective studies of disease. J Natl Cancer Inst 1959;22:719-748.
24 Won S, Morris N, Lu Q, Elston RC. Choosing an optimal method to combine P-values. Stat Med 2009;28:1537-1553.   DOI
25 Wellcome Trust Case Control Consortium. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 2007;447:661-678.   DOI
26 Birch MW. The detection of partial association, I: the 2 $\times$ 2 case. J R Stat Soc Series B 1964;26:313-324.
27 Pereira TV, Patsopoulos NA, Salanti G, Ioannidis JP. Discovery properties of genome-wide association signals from cumulatively combined data sets. Am J Epidemiol 2009;170:1197-1206.   DOI
28 Greene WH. Econometric Analysis. Harlow: Pearson Education, 2011.