Browse > Article
http://dx.doi.org/10.5351/CKSS.2006.13.1.029

Unmasking Multiple Outliers in Multivariate Data  

Yoo Jong-Young (School of Computer & Information, Yong In University)
Publication Information
Communications for Statistical Applications and Methods / v.13, no.1, 2006 , pp. 29-38 More about this Journal
Abstract
We proposed a procedure for detecting of multiple outliers in multivariate data. Rousseeuw and van Zomeren (1990) have suggested the robust distance $RD_i$ by using the Resampling Algorithm. But $RD_i$ are based on the assumption that X is in the general position.(X is said to be in the general position when every subsample of size p+1 has rank p) From the practical points of view, this is clearly unrealistic. In this paper, we proposed a computing method for approximating MVE, which is not subject to these problems. The procedure is easy to compute, and works well even if subsample is singular or nearly singular matrix.
Keywords
Mahalanobis distances; Minimum volume ellipsoid; Masking effects; Swamping effects; Resampling algorithm;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Rousseeuw, P.J. and Leroy, A.M. (1987). Robust Regression and Outlier Detection, John Wiley & Sons
2 Rousseeuw, P.J. and van Zomeren, B.C. (1990). Unmasking multivariate outliers and leverage points (with comments). Journal of the American Statistical Association, Vol. 75, 633-651
3 유종영, 안기수 (2002). 다중 선형 모형에서 식별된 다중 이상점과 다중 지렛점의 재확인 방법에 대한 연구. 응용통계연구, 제15권 2호. 269-279
4 Rousseeuw, P.J. (1984). Least Median of Squares Regression. Journal of the American Statistical Association, Vol. 79, 871-880   DOI
5 Hadi, A.S. and Simonoff, J.S. (1993). Procedures for the Identification of Multiple Outliers in Linear Models. Journal of the American Statistical Association, Vol 75, 1264-1272
6 Hadi, A.S. (1992). Identifying Multiple Outliers in Multivariate Data. Journal of the Royal Statistical Society, Ser. B, Vol. 54, 761-771
7 Atkinson, A.C. (1986). Masking unmasked. Biometrika, Vol. 73, 533-541   DOI   ScienceOn
8 Fung, W.K. (1993). Unmasking Outliers and Leverage Points: A Confirmation. Journal of the American Statistical Association, Vol. 88, 515-519   DOI
9 Hawkins, D.M., Bradu, D. and Kass, G.V. (1984). Location of several outliers in multiple regression data using elemental sets. Technometrics, Vol 26, 197-208   DOI
10 Rousseeuw, P.J. (1985). Multivariate estimation with high breakdown points. In Mathematical Statistics and applications (eds W. Grossman, G. Pflug, I. Vincze and W. Wertz), Vol. B, pp.283-297. Dordrecht: Reidel