Browse > Article
http://dx.doi.org/10.5351/KJAS.2016.29.5.907

Outlier detection in time series data  

Choi, Jeong In (Department of Statistics, Korea University)
Um, In Ok (Department of Statistics, Korea University)
Choa, Hyung Jun (Department of Statistics, Korea University)
Publication Information
The Korean Journal of Applied Statistics / v.29, no.5, 2016 , pp. 907-920 More about this Journal
Abstract
This study suggests an outlier detection algorithm that uses quantile autoregressive model in time series data, eventually applying it to actual stock manipulation cases by comparing its performance to existing methods. Studies on outlier detection have traditionally been conducted mostly in general data and those in time series data are insufficient. They have also been limited to a parametric model, which is not convenient as it is complicated with an analysis that takes a long time. Thus, we suggest a new algorithm of outlier detection in time series data and through various simulations, compare it to existing algorithms. Especially, the outlier detection algorithm in time series data can be useful in finding stock manipulation. If stock price which had a certain pattern goes out of flow and generates an outlier, it can be due to intentional intervention and manipulation. We examined how fast the model can detect stock manipulations by applying it to actual stock manipulation cases.
Keywords
outlier detection; quantile autoregressive model; time series data;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Chen, C. and Liu, L. M. (1993a). Forecasting time series with outliers, Journal of Forecasting, 12, 13-35.   DOI
2 Chen, C. and Liu, L. M. (1993b). Joint estimation of model parameters and outlier effects in time series, Journal of the American Statistical Association, 88, 284-297.
3 Eo, S. H., Hong, S. M., and Cho, H. (2014). Identification of outlying observations with quantile regression for censored data. arXiv preprint arXiv:1404.7710.
4 Fox, A. J. (1972). Outliers in time series, Journal of the Royal Statistical Society, Series B, 34, 350-363.
5 Fried, R. (2004). Robust filtering of time series with trends, Journal of Nonparametric Statistics, 16, 313-328.   DOI
6 Hoaglin, D. C., Iglewicz, B., and Tukey, J. W. (1986). Performance of some resistant rules for outlier labeling, Journal of the American Statistical Association, 81, 991-999.   DOI
7 Koenker, R. and Xiao, Z. (2006). Quantile autoregression, Journal of the American Statistical Association, 101, 980-990.   DOI
8 Nardi, A. and Schemper, M. (1999). New residuals for cox regression and their application to outlier screening, Biometrics, 55, 523-529.   DOI
9 Rousseeuw, P. J. (1984). Least median of squares regression, Journal of the American Statistical Association, 79, 871-880.   DOI
10 Rousseeuw, P. J. and Croux, C. (1993). Alternatives to the median absolute deviation, Journal of the American Statistical Association, 88, 1273-1283.   DOI
11 Rousseeuw, P. J. and Leroy, A. M. (1988). A robust scale estimator based on the shortest half, Statistica Neerlandica, 42, 103-116.   DOI
12 SAS Institute Inc (2008). SAS/STAT 9.2 User's Guide: the QUANTREG Procedure, Cary, SAS Institute Inc, NC.
13 Siegel, A. F. (1982). Robust regression using repeated medians, Biometrika, 69, 242-244.   DOI
14 Tukey, J. W. (1977). Exploratory Data Analysis, Addison-Wesley, MA.