DOI QR코드

DOI QR Code

Recent developments of constructing adjacency matrix in network analysis

  • Hong, Younghee (Department of Statistics, Pusan National University) ;
  • Kim, Choongrak (Department of Statistics, Pusan National University)
  • Received : 2014.05.28
  • Accepted : 2014.07.02
  • Published : 2014.09.30

Abstract

In this paper, we review recent developments in network analysis using the graph theory, and introduce ongoing research area with relevant theoretical results. In specific, we introduce basic notations in graph, and conditional and marginal approach in constructing the adjacency matrix. Also, we introduce the Marcenko-Pastur law, the Tracy-Widom law, the white Wishart distribution, and the spiked distribution. Finally, we mention the relationship between degrees and eigenvalues for the detection of hubs in a network.

Keywords

References

  1. Bickel, P. J. and Levina, E. (2008). Covariance regularization by thresholding. The Annals of Statistics, 36, 2577-2604. https://doi.org/10.1214/08-AOS600
  2. Bien, J. and Tibshirani, R. J. (2011). Sparse estimation of a covariance matrix. Biometrika, 98 , 807-820. https://doi.org/10.1093/biomet/asr054
  3. Birnbaum, A., Johnstone, I. M., Nadler, B. and Paul, D.(2013). Minimax bounds for sparse PCA with noisy high-dimensional data. The Annals of Statistics, 41, 1055-1084. https://doi.org/10.1214/12-AOS1014
  4. Bonacich, P. (1987). Power and centrality: A family of measures. The American Journal of Sociology, 92, 1170-1182. https://doi.org/10.1086/228631
  5. Butte, A. J., Tamayo, P., Slonim, D., Golub, T. R. and Kohane, I. S.(2000). Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks. Proceedings of the National Academy of Sciences, 97, 12182-12186. https://doi.org/10.1073/pnas.220392197
  6. Cai, T. T. and Liu, W. (2011). A direct estimation approach to sparse linear discriminant analysis. Journal of the American Statistical Association, 106, 1566-1577. https://doi.org/10.1198/jasa.2011.tm11199
  7. Cai, T. T. and Yuan, M. (2012). Adaptive covariance matrix estimation through block thresholding. The Annals of Statistics, 40, 2014-2042. https://doi.org/10.1214/12-AOS999
  8. Cai, T. T. and Zhou, H. (2012). Minimax estimation of large covariance matrices under L1 norm (with discussion). Statistica Sinica, 22, 1319-1378.
  9. Chandrasekaran, V., Parrilo, P. A. and Willsky, A. S. (2012). Latent variable graphical model selection via convex optimization. The Annals of Statistics, 40, 1935-1967. https://doi.org/10.1214/11-AOS949
  10. Chaudhuri, S., Alur, R. and Cerny, P. (2007). Model checking on trees with path equivalences. Tools and Algorithms for the Construction and Analysis of Systems Lecture Notes in Computer Science, 4424, 664-678.
  11. Dempster, A. P. (1972). Covariance selection. Biometrics, 28, 157-175. https://doi.org/10.2307/2528966
  12. Edward, D. (2000). Introduction to graphical modelling, 2nd ed., Springer, New York.
  13. Johnstone, I. M. (2001). On the distribution of the largest eigenvalue in principal component analysis. The Annals of Statistics, 29, 295-327.
  14. Johnstone, I. M. (2008). Multivariate analysis and Jacobi ensembles: Largest eigenvalue, Tracy-Widom limits and rates of convergence. The Annals of Statistics, 36, 2638-2716. https://doi.org/10.1214/08-AOS605
  15. Katz, L.(1953). A new status index derived from sociometric analysis. Psychometrika, 18, 393.
  16. Kim, C., Cheon, M., Kang, M. and Chang, I. (2008). A simple and exact Laplacian clustering of complex networking phenomena: Application to gene expression profiles. Proceedings of the National Academy of Science, 105, 4083-4087. https://doi.org/10.1073/pnas.0708598105
  17. Marcenko, V. A. and Pastur, L. A. (1967). Distribution of eigenvalues for some sets of random matrices. Mathmatics of the USSR - Sbornik, 1, 457-483. https://doi.org/10.1070/SM1967v001n04ABEH001994
  18. Meinshausen, N. and Buhlmann, P. (2006). High-dimensional graphs and variable selection with the lasso. The Annals of Statistics, 34, 1436-1462. https://doi.org/10.1214/009053606000000281
  19. Mieghem, P. V. (2010). Graph spectra for complex networks, Cambridge University Press, New York.
  20. Newman, M. (2010), Networks; An introduction, Oxford University Press, New York.
  21. Peng, J., Wang, P., Zhou N. and Zhu, J. (2009). Partial correlation estimation by joint sparse regression models. Journal of the American Statistical Association, 104, 735-746. https://doi.org/10.1198/jasa.2009.0126
  22. Pillai, N. S. and Yin, J. (2012). Edge universality of correlation matrices. The Annals of Statistics, 40, 1737-1763. https://doi.org/10.1214/12-AOS1022
  23. Rothman, A. J., Levina, E. and Zhu, J. (2009). Generalized thresholding of large covariance matrices. Journal of the American Statistical Association, 104, 177-186. https://doi.org/10.1198/jasa.2009.0101
  24. Rothman, A. J., Levina, E. and Zhu, J. (2010). Sparse multivariate regression with covariance estimation. Journal of Computational and Graphical Statistics, 19, 947-962. https://doi.org/10.1198/jcgs.2010.09188
  25. Speed, T. (2003). Statistical analysis of gene expression microarray data, Chapman and Hall, New York.
  26. Tracy, C. A. and Widom, H. (1996). On orthogonal and symplectic matrix ensembles. Communications in Mathematical Physics, 177, 727-754. https://doi.org/10.1007/BF02099545
  27. Tracy, C. A. and Widom, H. (2000). The distribution of the largest eigenvalue in the Gaussian ensembles; ${\beta}$ = 1, 2, 4. CRM Series in Mathematical Physics, 4, 461-472.
  28. Whittaker, J. (1990). Graphical models in applied mathematical multivariate statistics, John Wiley & Sons. New York.
  29. Wigner, E. P. (1955). Characteristic vectors of bordered matrices with infinite dimensions. The Annals of Mathematics, 62, 548-564. https://doi.org/10.2307/1970079
  30. Won, D. and Choi, K. (2014). Network analysis and comparing citation index of statistics journals. Journal of the Korean Data and Information Science Society, 25, 317-325. https://doi.org/10.7465/jkdi.2014.25.2.317
  31. Yuan, M. and Lin, Y. (2007). Model selection and estimation in the Gaussian graphical model, Biometrika, 94, 19-35. https://doi.org/10.1093/biomet/asm018

Cited by

  1. Data analysis of 4M data in small and medium enterprises vol.26, pp.5, 2015, https://doi.org/10.7465/jkdi.2015.26.5.1117
  2. Application of FOM Methodology for 4M Optimization Based on the Data of Manufacturing Process of Mechanical Parts vol.30, pp.6, 2014, https://doi.org/10.7735/ksmte.2021.30.6.456