Browse > Article
http://dx.doi.org/10.5351/KJAS.2018.31.6.707

Time series representation for clustering using unbalanced Haar wavelet transformation  

Lee, Sehun (Department of Statistics, Sungkyunkwan University)
Baek, Changryong (Department of Statistics, Sungkyunkwan University)
Publication Information
The Korean Journal of Applied Statistics / v.31, no.6, 2018 , pp. 707-719 More about this Journal
Abstract
Various time series representation methods have been proposed for efficient time series clustering and classification. Lin et al. (DMKD, 15, 107-144, 2007) proposed a symbolic aggregate approximation (SAX) method based on symbolic representations after approximating the original time series using piecewise local mean. The performance of SAX therefore depends heavily on how well the piecewise local averages approximate original time series features. SAX equally divides the entire series into an arbitrary number of segments; however, it is not sufficient to capture key features from complex, large-scale time series data. Therefore, this paper considers data-adaptive local constant approximation of the time series using the unbalanced Haar wavelet transformation. The proposed method is shown to outperforms SAX in many real-world data applications.
Keywords
time series representation; SAX; unbalanced Haar wavelet transformation; classification; clustering;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Aghabozorgi, S., Shirkhorshidi, A. S., and Wah, T. H. (2015). Time-series clustering - a decade review, Information Systems, 53, 16-38.   DOI
2 Baek, C. and Pipiras, V. (2009). Long range dependence, unbalanced Haar wavelet transformation and changes in local mean level, International Journal of Wavelets, Multiresolution and Information Processing, 7, 23-58.   DOI
3 Chan, K. and Fu, W. (1999). Efficient time series matching by wavelets, ICDE, 15, 126-133.
4 Donoho, D. L. and Johnstone, I. M. (1994). Ideal spatial adaptation by wavelet shrinkage, Biometrika, 81, 425-455.   DOI
5 Lin, J., Keogh, E., Wei, L., and Lonardi, S. (2007). Experiencing SAX: a novel symbolic representation of time series, DMKD, 15, 107-144.
6 Faloutsos, C., Ranganathan, M., and Manolopoulos, Y. (1994). Fast subsequence matching in time-series databases, SIGMOD Record, 23, 419-429.   DOI
7 Fryzlewicz, P. (2007). Unbalanced Haar technique for nonparametric function estimation, The Journal of American Statistical Association, 102, 1310-1327.
8 Keogh, E., Chakrabarti, K., Pazzani, M., and Mehrotra, S. (2001). Dimensionality reduction for fast similarity search in large time series databases, Journal of Knowledge and Information Systems, 3, 263-286.   DOI