DOI QR코드

DOI QR Code

Least absolute deviation estimator based consistent model selection in regression

  • Shende, K.S. (Department of Statistics, Shivaji University) ;
  • Kashid, D.N. (Department of Statistics, Shivaji University)
  • Received : 2018.11.28
  • Accepted : 2019.03.20
  • Published : 2019.05.31

Abstract

We consider the problem of model selection in multiple linear regression with outliers and non-normal error distributions. In this article, the robust model selection criterion is proposed based on the robust estimation method with the least absolute deviation (LAD). The proposed criterion is shown to be consistent. We suggest proposed criterion based algorithms that are suitable for a large number of predictors in the model. These algorithms select only relevant predictor variables with probability one for large sample sizes. An exhaustive simulation study shows that the criterion performs well. However, the proposed criterion is applied to a real data set to examine its applicability. The simulation results show the proficiency of algorithms in the presence of outliers, non-normal distribution, and multicollinearity.

Keywords

References

  1. Akaike H (1973). Information theory and an extension of maximum likelihood principle. In Proceedings of the Second International Symposium on Information Theory, Akademiai Kiado, Budapest, 267-281.
  2. Birkes D and Dodge Y (1993). Alternative Methods of Regression, Wiley, New York.
  3. Dielman TE (2005). Least absolute value regression: recent contributions, Journal of Statistical Computation and Simulation, 75, 263-286. https://doi.org/10.1080/0094965042000223680
  4. Dielman TE (2006). Variance estimates and hypothesis tests in least absolute value regression, Journal of Statistical Computation and Simulation, 76, 103-114. https://doi.org/10.1080/00949650412331321052
  5. Gilmour SG (1995). The interpretation of Mallows's $C_p$-statistic, Journal of the Royal Statistical Society, Series D (The Statistician), 45, 49-56.
  6. Kashid DN and Kulkarni SR (2002). A more general criterion for subset selection in multiple linear regression, Communications in Statistics - Theory and Methods, 31, 795-811. https://doi.org/10.1081/STA-120003653
  7. Kim C and Hwang S (2000). Influence subsets on the variable selection, Communication in Statistics-Theory and Methods, 29, 335-347. https://doi.org/10.1080/03610920008832487
  8. Machado JAF (1993). Robust model selection and M-estimation, Econometric Theory, 9, 478-493. https://doi.org/10.1017/S0266466600007775
  9. Mallows C (1973). Some comment on $C_p$, Technometrics, 15, 661-675. https://doi.org/10.2307/1267380
  10. Rao CR and Wu Y (1989). A strong consistent procedure for model selection in a regression model, Biometrika, 76, 369-374. https://doi.org/10.1093/biomet/76.2.369
  11. Rao C, Wu Y, Konishi S, et al. (2001). On model selection, Lecture Notes-Monograph Series, 38, 1-64.
  12. Ronchetti E (1985). Robust model selection in regression, Statistics and Probability Letters, 3, 21-23. https://doi.org/10.1016/0167-7152(85)90006-9
  13. Ronchetti E and Staudte RG (1994). A robust version of Mallows's $C_p$, Journal of the American Statistical Association, 89, 550-559. https://doi.org/10.2307/2290858
  14. Schwarz G (1978). Estimating the dimension of a model, The Annals of Statistics, 6, 461-464. https://doi.org/10.1214/aos/1176344136
  15. Siniksaran E (2008). A geometric interpretation of Mallows' $C_p$ statistic and an alternative plot in variable selection, Computational Statistics and Data Analysis, 52, 3459-3467. https://doi.org/10.1016/j.csda.2007.10.023
  16. Tharmaratnam K and Claeskens G (2013). A comparison of robust versions of the AIC based on M, S and MM-estimators, Statistics: A Journal of Theoretical and Applied Statistics, 47, 216-235. https://doi.org/10.1080/02331888.2011.568120
  17. Yamashita T, Yamashita K, and Kamimura, R (2007). A stepwise AIC method for variable selection in linear regression, Communication in Statistics-Theory and Methods, 36, 2395-2403. https://doi.org/10.1080/03610920701215639