ROBUST REGRESSION ESTIMATION BASED ON DATA PARTITIONING

  • Lee, Dong-Hee (BK21 Education and Research Center for Economics and Statistics, Korea University) ;
  • Park, You-Sung (Department of Statistics, Korea University)
  • Published : 2007.06.30

Abstract

We introduce a high breakdown point estimator referred to as data partitioning robust regression estimator (DPR). Since the DPR is obtained by partitioning observations into a finite number of subsets, it has no computational problem unlike the previous robust regression estimators. Empirical and extensive simulation studies show that the DPR is superior to the previous robust estimators. This is much so in large samples.

Keywords

References

  1. VAN AELST, S., ROUSSEEUW, P. J., HUBERT, M. AND STRUYF, A. (2002). 'The deepest regression method', Journal of Multivariate Analysis, 81, 138-166 https://doi.org/10.1006/jmva.2001.1997
  2. COAKLEY, C. W. AND HETTMANSPERGER, T. P. (1993). 'A bounded influence, high breakdown, efficient regression estimator', Journal of the American Statistical Association, 88, 872-880 https://doi.org/10.2307/2290776
  3. CROUX, C., ROUSSEEUW, P. J. AND HOSSJER, O. (1994). 'Generalized S-estimators', Journal of the American Statistical Association, 89, 1271-1281 https://doi.org/10.2307/2290990
  4. HAMPEL, F. R. (1974). 'The influence curve and its role in robust estimation', Journal of the American Statistical Association, 69, 383-393 https://doi.org/10.2307/2285666
  5. HAWKINS, D. M., BRADU, D. AND KASS, G. V. (1984). 'Location of several outliers in multiple-regression data using elemental sets', Technometrics, 26, 197-208 https://doi.org/10.2307/1267545
  6. HAWKINS, D. M. AND OLIVE, D. J. (2002). 'Inconsistency of resampling algorithms for high-breakdown regression estimators and a new algorithm', Journal of the American Statistical Association, 97, 136-159 https://doi.org/10.1198/016214502753479293
  7. HINLOOPEN, J. AND WAGENVOORT, R. (1997). 'On the computation and efficiency of a HBPGM estimator some simulation results', Computational Statistics & Data Analysis, 25, 1-15 https://doi.org/10.1016/S0167-9473(96)00083-7
  8. HOSSJER, O. (1994). 'Rank-based estimates in the linear model with high breakdown point', Journal of the American Statistical Association, 89, 149-158 https://doi.org/10.2307/2291211
  9. MARTIN, R. D., YOHAI, V. J. AND ZAMAR, R. H. (1989). 'Min-max bias robust regression', The Annals of Statistics, 17, 1608-1630 https://doi.org/10.1214/aos/1176347384
  10. OLIVE, D. J. (2003). 'Applied Robust Statistics', Southern Illinois University
  11. ROUSSEEUW, P. J. (1984). 'Least median of squares regression', Journal of the American Statistical Association, 79, 871-880 https://doi.org/10.2307/2288718
  12. ROUSSEEUW, P. J. AND HUBERT, M. (1999). 'Regression depth', Journal of the American Statistical Association, 94, 388-402 https://doi.org/10.2307/2670155
  13. ROUSSEEUW, P. J. AND LEROY, A. M. (1987). Robust Regression and Outlier Detection, John Wiley & Sons, New York
  14. ROUSSEEUW, P. J. AND YOHAI, V. J. (1984). 'Robust regression by means of S-estimators', In Robust and Nonlinear Time Series Analysis (Franke, J., Hardle, W. and Martin, R. D., eds), 256-272, Springer-Verlag, New York
  15. SALIBIAN-BARRERA, M. AND ZAMAR, R. H. (2004). 'Uniform asymptotics for robust location estimates when the scale is unknown', The Annals of Statistics, 32, 1434-1447 https://doi.org/10.1214/009053604000000544
  16. SIMPSON, J. R. AND MONTGOMERY, D. C. (1998). 'A robust regression technique using compound estimation', Naval Research Logistics, 45, 125-139 https://doi.org/10.1002/(SICI)1520-6750(199803)45:2<125::AID-NAV1>3.0.CO;2-A
  17. SIMPSON, D. G., RUPPERT, D. AND CARROLL, R. J. (1992). 'On one-step GM estimates and stability of inferences in linear regression', Journal of the American Statistical Association, 87, 439-450 https://doi.org/10.2307/2290275
  18. SIMPSON, D. G. AND YOHAI, V. J. (1998). 'Functional stability of one-step GM-estimators in approximately linear regression', The Annals of Statistics, 26, 1147-1169 https://doi.org/10.1214/aos/1024691092
  19. STROMBERG, A. J., HOSSJER, O. AND HAWKINS, D. M. (2000). 'The least trimmed differences regression estimator and alternatives', Journal of the American Statistical Association, 95, 853-864 https://doi.org/10.2307/2669469
  20. WISNOWSKI, J. W., MONTGOMERY, D. C. AND SIMPSON, J. R. (2001). 'A comparative analysis of multiple outlier detection procedures in the linear regression model', Computational Statistics & Data Analysis, 36, 351-382 https://doi.org/10.1016/S0167-9473(00)00042-6
  21. YOHAI, V. J. (1987). 'High breakdown-point and high efficiency robust estimates for regression', The Annals of Statistics, 15, 642-656 https://doi.org/10.1214/aos/1176350366
  22. YOHAI, V. J. AND ZAMAR, R. H. (1988). 'High breakdown-point estimates of regression by means of the minimization of an efficient scale', Journal of the American Statistical Association, 83, 406-413 https://doi.org/10.2307/2288856