INVITED PAPER MULTIVARIATE ANALYSIS FOR THE CASE WHEN THE DIMENSION IS LARGE COMPARED TO THE SAMPLE SIZE

  • Fujikoshi, Yasunori (Department of Statistics, Graduate School of Science, Hiroshima University)
  • Published : 2004.03.01

Abstract

This paper is concerned with statistical methods for multivariate data when the number p of variables is large compared to the sample size n. Such data appear typically in analysis of DNA microarrays, curve data, financial data, etc. However, there is little statistical theory for high dimensional data. On the other hand, there are some asymptotic results under the assumption that both and p tend to $\infty$, in some ratio p/n ${\rightarrow}$c. The results suggest that the new asymptotic results are more useful and insightful than the classical large sample asymptotics. The main purpose of this paper is to review some asymptotic results for high dimensional statistics as well as classical statistics under a high dimensional asymptotic framework.

Keywords

References

  1. An Introduction to Multivariate Statistical Analysis(2nd ed.) Anderson,T.W.
  2. Statistica Sinica v.9 Methodologies in spectral analysis of large-dimensional random matrices, a review Bai,Z.D.
  3. Theory of Probability and Its Applications v.32 On the limiting empirical distribution function of the eigenvalues of a multivariate F matrix Bai,Z.D.;Yin,Y.Q.;Krishnaiah,P.R.
  4. Soviet Mathcmatics Doklady v.11 Representation of statistics of discriminant analysis and asymptotic expansion when space dimensions are comparable with sample size Deev,A.D.
  5. The Annals of Mathematical Statistics v.29 A high dimensional two sample significance test Dempster,A.P.
  6. Biometrics v.16 A significance test for the separation of two highly multivariate small samples Dempster,A.P.
  7. Journal of the American Statistical Association v.84 Regularized discriminant analysis Friedman,J.H.
  8. Journal of Multivariate Analysis v.73 Error bounds for asymptotic approximations of the linear discriminant function when the sample sizes and dimensionality are large Fujikoshi,Y.
  9. Technical report No. 03-17 Asymptotic results in canonical discriminant analysis when the dimension is large compared to the sample size Fujikoshi,Y.;Himeno,T.;Wakaki,H.
  10. Technical report No. 03-18 Asymptotic results of a high dimensional MANOVA test and power comparison when the dimension is large compared to the sample size Fujikoshi,Y.;Himeno,T.;Wakaki,H.
  11. Random Operators and Stochastic Equations v.6 Asymptotic approximations for EPMC's of the linear and the quadratic discriminant functions when the samples sizes and the dimension are large Fujikoshi,Y.;Seo,T.
  12. The Annals of Probability v.8 A limit theorem for the norm of random matrices Geman,S.
  13. The Annals of Statistics v.29 On the distribution of the largest eigenvalue in principal component analysis Johnstone,I.M.
  14. Journal of Multivariate Analysis v.12 Some limit theorems for eigenvalues of a sample covariance matrix Jonsson,D.
  15. Biometrics v.24 On expected probabilities of misclassification in discriminant analysis, Necessary sample size, and a relation with the multiple correlation coefficient Lachenbruch,P.A.
  16. The Annals of Statistics v.30 Some hypothesis tests for the covariance matrix when the dimension is large compared to the sample size Ledoit,O.;Wolf,M.
  17. Journal of Multivariate Analysis v.62 Linear discrimination with adaptive ridge classification rules Loh.W.L.
  18. Mathematics of the USSR Sbornik v.1 Distribution of eigenvalues of some sets of random matrices Marcenko,V.A.;Pastur,L.A.
  19. Biometrika v.67 A normal approximation for the distribution of the likelihood ratio statistic in multivariate analysis of variance Mudholkar,G.S.;Trivedi,M.C.
  20. Proceedings of Academy of Sciences of the USSR, Tech. Cyber. v.4 On the amount of a priori information in designing the classification algorithm Raudys,S.
  21. Journal of Multivariate Analysis v.46 Asymptotic expansion of the misclassification probabilities of D and A-criteria for discrimination from two high-dimensional populations using the theory of large-dimensional random matrices Saranadasa,H.
  22. Multivariate analysis with fewer observations than the dimension Srivastava,M.S.
  23. Journal of Multivariate Analysis v.31 Saddlepoint method for obtaining tail probability of Wilks' likelihood ratio test Srivastava,M.S.;Yau,W.K.
  24. Communications in Statistics-Theory and Methods Asymptotic expansion of the null distribution of LA statistic for multivariate linear hypothesis when the dimension is large Tonda,T.;Fujikoshi,Y.
  25. Reported in the Meeting of Japan Statistical Society Asymptotic expansions for likelihood ratio test statistics on covariance matrices when the sample size and the dimension are large Wakaki,H.
  26. Asymptotic expansions of the distributions of MANOVA test statistics when the dimension is large Wakaki,H.;Fujikoshi,Y.;Ulyanov,V.
  27. Pattern Recognition v.23 A comparson of asymptotic error rate expansions for the sample linear discriminant function Wyman,F.J.;Young,D.M.;Turner,D.W.
  28. Journal of Multivariate Analysis v.20 Limiting spectral distribution for a class of randon matrices Yin,Y.Q.
  29. Probability Theory and Related Fields v.78 On the limit of the largest eignvalue of the large dimensional sample covariance matrix Yin,Y.Q.;Bai,Z.D.;Krishnaiah,P.R.