Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2003.10D.4.577

Shape-Based Subsequence Retrieval Supporting Multiple Models in Time-Series Databases  

Won, Jung-Im (한림대학교 대학원 컴퓨터공학과)
Yoon, Jee-Hee (한림대학교 정보통신공학부)
Kim, Sang-Wook (한양대학교 정보통신공학부)
Park, Sang-Hyun (포항공과대학교 컴퓨터공학과)
Abstract
The shape-based retrieval is defined as the operation that searches for the (sub) sequences whose shapes are similar to that of a query sequence regardless of their actual element values. In this paper, we propose a similarity model suitable for shape-based retrieval and present an indexing method for supporting the similarity model. The proposed similarity model enables to retrieve similar shapes accurately by providing the combination of various shape-preserving transformations such as normalization, moving average, and time warping. Our indexing method stores every distinct subsequence concisely into the disk-based suffix tree for efficient and adaptive query processing. We allow the user to dynamically choose a similarity model suitable for a given application. More specifically, we allow the user to determine the parameter p of the distance function $L_p$ when submitting a query. The result of extensive experiments revealed that our approach not only successfully finds the subsequences whose shapes are similar to a query shape but also significantly outperforms the sequence search.
Keywords
Sequence Databases; Similarity Search; Shape-Based Retrieval; Indexing; Time Warping;
Citations & Related Records
연도 인용수 순위
  • Reference
1 D. Rafiei and A. Mendelzon, 'Similarity-Based Queries for Time-Series Data', In Proc. Int'l. Conf. on Management of Data, ACM SIGMOD, pp.13-24, 1997   DOI
2 D. Rafiei, 'On Similarity-Based Queries for Time Series Data', Proc. IEEE Intl. Conf. on Data Engineering, pp. 410-417, 1999
3 C. Faloutsos, M. Ranganathan and Y. Manolopoulos, 'Fast Subsequence Matching in Time-series Databases', In Proc. Int'l Conf. on Management of Data, ACM SIGMOD, pp. 419-429, May, 1994   DOI
4 D. Q. Goldin and P. C. Kanellakis, 'On Similarity Queries for Time-Series Data , Constraint Specification and Implementation,' In Proc. Int'l Conf. on Principles and Practice of Constraint Programming, CP, pp.137-153, Sept., 1995   DOI   ScienceOn
5 Y. S. Moon, K. Y. Whang, and W. K. Loh, 'Duality-Based Subsequence Matching in Time-Series Databases', In Proc. Int'l Conf. on Data Engineeing, IEEE ICDE, pp.263-272, 2001   DOI
6 R. Agrawal, C. Faloutsos and A. Swami, 'Efficient Similarity Search in Sequence Databases', In Proc. Int'l. Conference on Foundations of Data Organization and Algorithms, FODO, pp.69-84, Oct., 1993   DOI   ScienceOn
7 R. Agrawal et al., 'Fast Similarity Search in the Presence of Noise, Scaling and Translation in Time-Series Databases', Proc. Int'l Conference on Very Large Data Bases, VLDB, pp.490-501, Sept., 1995
8 W. K. Loh, S. W. Kim and K. Y. Whang, 'Index Interpolation : A Subsequence Matching Algorithm Supporting Moving Average Transform of Arbitrary Order in Time Series Databases', IEICE Trans. on Information and Systems, Vol.E84-D, No.1, pp.76-86, 2001
9 K. K. W. Chu and M. H. Wong, 'Fast Time-Series Sea-rching with Scaling and Shifting', In Proc. Int'l Symp. on Principles of Database Systems, ACM PODS, pp. 237-248, May, 1999   DOI
10 D. J. Berndt and J. Clifford, 'Finding Patterns in Time Series : A Dynamic Programming Approach,' Advances in Knowledge Discovery and Data Mining, pp.229-248, 1996
11 G. Das, D. Gunopulos and H. Mannila, 'Finding Similar Time Series', In Proc. European Symp. on Principles of Data Mining and Knowledge Discovery, PKDD, pp.88-100, 1997   DOI   ScienceOn
12 S. H. Park, S. W. Kim, J. S. Cho andS. Padmanabhan, 'Prefix-Querying : An Approach for Effiective Subsequence Matching Under Time Warping in Sequence Databases', In Proc ACM Intl. Conf. on Information and Knowledge Management(ACM CIKM), pp.255-262, 2001   DOI
13 W. K. Loh, S. W. Kim and K.Y.Whang, 'Index Interpolation : An Approach for Subsequence Matching Supporting Normalization Transform in Time-Series Databases', In Proc. ACM Intl. Conf. on Information and Knowledge Management(ACM CIKM), pp.480-487, 2000
14 B. K. Yi, H. V. Jagadish, and C. Faloutsos, 'Efficient Retrieval of Similar Time Sequences Under Time Warping', In Proc. Int'l Conf. on Data Engineering, IEEE, pp.201-208, 1998   DOI
15 S. H. Park et al., 'Efficient Searches for Similar Subsequences of Difference Lengths in Sequence Databases', In Proc. Int'l Conf. on Data Engineering, IEEE, pp.23-32, 2000   DOI
16 C. S. Perng et al., 'Landmarks : A New Model for Similarity-Based Pattern Querying in Time Series Databases', In Proc. Int'l. Conf. on Data Engineering, IEEE, pp.33-42, 2000
17 S. W. Kim, S. H. Park and W. W. Chu, 'An Index-Based Approach for Similarity Search Supporting Time Warping in Large Sequence Databases', Proc. Intl. Conf. on Data Engineering, IEEE, pp.607-614, 2001   DOI
18 N. Beckmann et al., 'The $R^*-tree$: An Efficient and Robust Access Method for Points and Rectangles', In Proc. Int'l. Conf. on Management of Data, ACM SIGMOD, pp.322-331, 1990-05-00   DOI
19 F. P. Preparata and M. Shamos, Computational Geometry : An Introduction, Springer-Verlag, 1985
20 R. Agrawal et al., 'Querying Shapes of Histories,' In Proc. Int'l Conference on Very Large Data Bases, VLDB, pp.502-514, Sept., 1995
21 L. Rabiner and H. H. Juang, Fundamentals of Speech Recognition, Prentice Hall, 1993
22 M. Kendall, Time-Series, 2nd Edition, Charles Griffin and Company, 1979
23 C. Chatfield, The Analysis of Time-Series : An Introduction, 3rd Edition, Chapman and Hall, 1984
24 K. S. Shim, R. Srikant and R.Agrawal, 'High-dimensional Similarity Joins', In Proc. Int'l. Conf. on Data Engineering, IEEE, pp.301-311, 1997-04-00   DOI
25 N. D. Sidiropoulos and R. Bros, 'Mathematical Programming Algorithms for Regression-Based Non-Linear Filtering in $R^N$', IEEE Trans. on Signal Processing, Mar., 1999
26 S. W. Kim, J. H. Yoon, S. H. Park, T. H. Kim, 'Shape-Based Retrieval of Similar Subsequences in Time-Series Databases', In Proc. ACM Intl. Symp. on Applied Computing(ACM SAC), pp.438-445, 2002
27 B. K.Yi, and C. Faloutsos, 'Fast Time Sequence Indexing for Arbitrary $L_p$ Norms', In Proc. Int'l. Conf. on Very Large Data Bases, VLDB, pp.385-394, 2000
28 G. A. Stephen, String Searching Algorithms, World Scientific Publishing, 1994