Browse > Article
http://dx.doi.org/10.5351/CSAM.2017.24.3.317

A cautionary note on the use of Cook's distance  

Kim, Myung Geun (Department of Mathematics Education, Seowon University)
Publication Information
Communications for Statistical Applications and Methods / v.24, no.3, 2017 , pp. 317-324 More about this Journal
Abstract
An influence measure known as Cook's distance has been used for judging the influence of each observation on the least squares estimate of the parameter vector. The distance does not reflect the distributional property of the change in the least squares estimator of the regression coefficients due to case deletions: the distribution has a covariance matrix of rank one and thus it has a support set determined by a line in the multidimensional Euclidean space. As a result, the use of Cook's distance may fail to correctly provide information about influential observations, and we study some reasons for the failure. Three illustrative examples will be provided, in which the use of Cook's distance fails to give the right information about influential observations or it provides the right information about the most influential observation. We will seek some reasons for the wrong or right provision of information.
Keywords
case deletion; Cook's distance; influence; regression;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Miller RG (1974). An unbalanced jackknife, Annals of Statistics, 2, 880-891.   DOI
2 Neter J, Kutner MH, Nachtsheim CJ, and Wasserman W (1996). Applied Linear Regression Models (3rd ed), McGraw-Hill, Irwin.
3 Seber GAF (1977). Linear Regression Analysis, Wiley, New York.
4 Chatterjee S and Hadi AS (1988). Sensitivity Analysis in Linear Regression, Wiley, New York.
5 Cook RD (1977). Detection of influential observation in linear regression, Technometrics, 19, 15-18.
6 Cook RD and Weisberg S (1982). Residuals and Influence in Regression, Chapman and Hall, New York.
7 Draper NR and Smith H (1981). Applied Regression Analysis (2nd ed), Wiley, New York.
8 Kim MG (2015). Influence measure based on probabilistic behavior of regression estimators, Computational Statistics, 30, 97-105.   DOI
9 Kim MG (2016). Deletion diagnostics in fitting a given regression model to a new observation, Communications for Statistical Applications and Methods, 23, 231-239.   DOI