Browse > Article
http://dx.doi.org/10.7465/jkdi.2013.24.5.1063

Multiple testing and its applications in high-dimension  

Jang, Woncheol (Department of Statistics, Seoul National University)
Publication Information
Journal of the Korean Data and Information Science Society / v.24, no.5, 2013 , pp. 1063-1076 More about this Journal
Abstract
The power of modern technology is opening a new era of big data. The size of the datasets affords us the opportunity to answer many open scientific questions but also presents some interesting challenges. High-dimensional data such as microarray are common in big data. In this paper, we give an overview of recent development of multiple testing including global and simultaneous testing and its applications to high-dimensional data.
Keywords
False discovery rate; global test; high-dimensional data; multiple test; simultaneous test;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Arias-Castro, E., Candes, E. J. and Plan, Y. (2011). Global testing under sparse alternatives: Anova, multiple comparisons and the higher criticism. Annals of Statistics, 39, 2533-2556.   DOI
2 Benjamini, Y., Drai, D., Elmer, G., Kafkafi, N. and Golani, I (2001). Controlling the false discovery rate in behavior genetics research. Behavioural Brain Research, 125, 279-284.   DOI   ScienceOn
3 Benjamini, Y. and Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society B, 57, 289-300.
4 Benjamini, Y. and Yekutieli, D. (2001). The control of the false discovery rate in multiple testing under dependency. Annals of Statistics, 29, 1165-1188.   DOI   ScienceOn
5 Donoho, D. L. and Jin, J. (2004). Higher criticism for detecting sparse heterogeneous mixtures, Annals of Statistics, 32, 962-994.   DOI
6 Efron, B. (2010). Large-scale inferece: Empirical Bayes methods for estimation, testing, and prediction, Cambridge University Press, Cambridge.
7 Fan, J. (1996). Test of significance based on wavelet thresholding and Neyman's truncation. Journal of the American Statistical Association, 91, 674-688.   DOI   ScienceOn
8 Fan, J. and Lin, S.-K.(1998). Test of significance when data are curves. Journal of the American Statistical Association, 93, 1007-1021.   DOI   ScienceOn
9 Genovese, C. R., Lazar, N. A. and Nichols, T. (2002). Thresholding of statistical maps in functional neuroimaging using false discovery rate. Neuroimage, 15, 870-878.   DOI   ScienceOn
10 Genovese, C. R. and Wasserman, L. A. (2004). A stochastic process approach to false discovery control. Annals of Statistics, 32, 1035-1061.   DOI
11 Jin, J. (2008). Proportion of non-zero normal means: Universal oracle equivalences and uniformly consistent estimators. Journal of the Royal Statistical Society B, 70, 461-493.   DOI   ScienceOn
12 Lindsay, B. G., Kettenring, J. and Siegmund, D. O. (2004). A report on the future of statistics. Statistical Science, 19, 387-413.   DOI   ScienceOn
13 Park, C., Ahn, J., Hendry, M. and Jang, W. (2011). Analysis of long period variable stars with nonparametric tests for trend detection. Journal of the American Statistical Association, 106, 832-845.   DOI   ScienceOn
14 Patti, M. E., Butte, A. J., Crunkhorn, S., Cusi, K., Berria, R., Kashyap, S., Miyazaki, Y., Kohane, I., Costello, M., Saccone, R., Landaker, E. J., Goldfine, A. B., Mun, E., DeFronzo, R., Finlayson, J., Kahn, C. R. and Mandarino, L. J. (2003). Coordinate reduction of genes of oxidative metabolism in humans with insulin resistance and diabetes: Potential role of PGC1 and NRF1. Proceedings of the National Academy Sciences of USA, 100, 8466-8471.
15 Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A., Bender, D., Maller, J., Sklar, P., de Bakker, P. I., Daly, M. J. and Sham, P. C. (2007). PLINK: A tool set for whole-genome association and population-based linkage analyses. American Journal of Human Genetics, 81, 559-575.   DOI   ScienceOn
16 Storey, J. D. (2002). A direct approach to false discovery rates. Journal of the Royal Statistical Society B, 100, 9440-9445.
17 Storey, J. and Tibshirani, R. (2003). Statistical significance for genomewide studies. Proceedings of the National Academy Sciences of USA, 98, 5116-5121.
18 Tusher, V. G., Tibshirani, R. and Chu, G. (2001). Significance analysis of micorarrays applied to the ionizing radiation response. Proceedings of the National Academy Sciences of USA, 98, 5116-5121.   DOI   ScienceOn
19 Wilkinson, L. (1999). Statistical methods in psychology journals: Guidelines and explanations. American Psychologist, 54, 594-604.   DOI
20 Weisberg, S. P., McCann, D., Desai, M., Rosenbaum, M., Leibel, R. L. and Ferrante, A. W. (2003). Obesity is associated with macrophage accumulation in adipose tissue. Journal of Clinical Investigation, 112, 1796-1808.   DOI
21 Wit, E. (2010). Comments on Discovering the false discovery rate by Benjamini. Journal of the Royal Statistical Society B, 72, 410-411.