Browse > Article

Spatial Locality Preservation Metric for Constructing Histogram Sequences  

Lee, Jeonggon (Department of Computer Science, Kangwon National University)
Kim, Bum-Soo (Department of Computer Science, Kangwon National University)
Moon, Yang-Sae (Department of Computer Science, Kangwon National University)
Choi, Mi-Jung (Department of Computer Science, Kangwon National University)
Abstract
This paper proposes a systematic methodology that could be used to decide which one shows the best performance among space filling curves (SFCs) in applying lower-dimensional transformations to histogram sequences. A histogram sequence represents a time-series converted from an image by the given SFC. Due to the high-dimensionality nature, histogram sequences are very difficult to be stored and searched in their original form. To solve this problem, we generally use lower-dimensional transformations, which produce lower bounds among high dimensional sequences, but the tightness of those lower-bounds is highly affected by the types of SFC. In this paper, we attack a challenging problem of evaluating which SFC shows the better performance when we apply the lower-dimensional transformation to histogram sequences. For this, we first present a concept of spatial locality, which comes from an intuition of "if the entries are adjacent in a histogram sequence, their corresponding cells should also be adjacent in its original image." We also propose spatial locality preservation metric (slpm in short) that quantitatively evaluates spatial locality and present its formal computation method. We then evaluate five SFCs from the perspective of slpm and verify that this evaluation result concurs with the performance evaluation of lower-dimensional transformations in real image matching. Finally, we perform k-NN (k-nearest neighbors) search based on lower-dimensional transformations and validate accuracy of the proposed slpm by providing that the Hilbert-order with the highest slpm also shows the best performance in k-NN search.
Keywords
Data Mining; Time-Series Data; Space Filling Curve; Lower-Dimensional Transformation;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Jiang, W., Murugesan, M., Clifton, C., and Si, L., "Similar Document Detection with Limited Information Disclosure," In Proc. of the 24th Int'l Conf. on Data Engineering, IEEE ICDE, Cancun, Mexico, pp. 735-743, Apr. 2008.
2 Keogh, E., and Ratanamahatana, C. A., "Indexing and Mining Large Time Series Databases," In Proc. of Int'l Conf. on Database Systems for Advanced Applications, Tutorial, Bangkok, Thailland, Apr. 2007.
3 Zhu, Y., and Shasha, D., "Warping Indexes with Envelope Transforms for Query by Humming," In Proc. of Int'l Conf. on Management of Data, ACM SIGMOD, San Diego, California, pp. 181-192, June 2003.
4 Chan, K. P., Fu, A. W. C., and Yu, C. T., "Haar Wavelets for Efficient Similarity Search of Time-Series: With and Without Time Warping," IEEE Trans. on Knowledge and Data Engineering, Vol. 15, No. 3, pp. 686-705, May/June 2003.   DOI
5 Faloutsos, C., Ranganathan, M., and Manolopoulos, Y., "Fast Subsequence Matching in Time-Series Databases," In Proc. of Int'l Conf. on Management of Data, ACM SIGMOD, Minneapolis, Minnesota, pp. 419-429, May 1994.
6 Han, W.-S., Lee, J., Moon, Y.-S., and Jiang, H., "Ranked Subsequence Matching in Time-Series Databases," In Proc. of Int'l Conf. on Very Large Data Bases, Vienna, Austria, pp. 423-434, Sept. 2007.
7 Aggarwal, C. C., and Yu, P. S., "Privacy-Preserving Data Mining: A Survey," In Handbook of Database Security: Applications and Trends, pp. 431-460, 2008.
8 Han, W.-S., Lee, J.-S., Moon, Y.-S., Hwang, S.-W., and Yu, H., "A New Approach for Processing Ranked Subsequence Matching Based on Ranked Union," In Proc. of Int'l Conf. on Management of Data, ACM SIGMOD, Athens, Greece, pp. 457-468, June 2011.
9 Hilbert, D., "Ueber Die Stetige Abbildung Einer Line Auf Ein Flachenstuck," Mathematische Annalen, Vol. 38. pp. 459-460, 1891.   DOI
10 Griffin, G., Holub, A. D., and Perona, P., The Caltech-256, Caltech Technical Report.
11 이근우, 양근우, "분산 의사결정지원시스템 구축을 위한 웹서비스 기반 모델-솔버의 통합 설계," 정보기술아키텍처연구, 제9권 제1호, pp. 43-55, 2012년 3월.
12 김현수, 김인규, "온톨로지와 시각화가 적용된 기술참조 모델의 활용방안에 관한 연구," 정보기술아키텍처연구, 제8권 제4호, pp. 347-360, 2011년 12월.
13 Sagan, H., Space-Filling Curves, Springer-Verlag, 1994.
14 Kontos, D., Megalooikonomou, V., Ghubade, N., and Faloutsos, C., "Detecting Discriminative Functional MRI Activation Patterns Using Space Filling Curves," In Proc. of the 25th IEEE Annual Int'l Conf. on Medicine and Biology Society, Cancun, Mexico, pp. 963-967, Sept. 2003.
15 Wang, Q., Kontos, D., Li, G., and Megalooikonomou, V., "Application of Time Series Techniques to Data Mining and Analysis of Spatial Patterns in 3D Images," In Proc. of IEEE Int'l Conf. on Acoustics, speech, and signal Processing, Montreal, Canada, pp. 525-528, May 2004.
16 Klein, A. W., Grant, T., Finkelstein, A., and Cohen, M. F., "Video Mosaic," In Proc. of Int'l Symp. on Non-Photorealistic Animation and Rendering, Annecy, France, pp. 21-28, June 2002.
17 Moon, Y.-S., Whang, K.-Y., and Han, W.-S., "General Match: A Subsequence Matching Method in Time-Series Databases Based on Generalized Windows," In Proc. of Int'l Conf. on Management of Data, ACM SIGMOD, Wisconsin, Madison, pp. 382-393, June 2002.
18 Lee, S.-H., Kim, B.-S., Moon, Y.-S., and Kim, J., "Photo Mosaic Smartphone Application in Client-Server Based Large-Scale Image Databases," In Proc. of Int'l Conf. on World Academy of Science, Engineering and Technology, Singapore, pp. 562-565, Sept. 2011.
19 Agrawal, R., Faloutsos, C., and Swami, A., "Efficient Similarity Search in Sequence Databases," In Proc. of the 4th Int'l Conf. on Foundations of Data Organization and Algorithms, Chicago, Illinois, pp. 69-84, Oct. 1993.
20 Zhang, Y., and Glass, J., "A Piecewise Aggregate Approximation Lower-Bound Estimate for Posteriorgram-based Dynamic Time Warping," In Proc. of Int'l Conf. on Interspeech, Florence, Italy, pp. 1909-1912, Aug. 2011.
21 Lee, A. J. T., Wu, H. W., Lee, T. Y., Liu, Y. H., and Chen, K. T., "Mining Closed Patterns in Multisequence Time-series Databases," Data & Knowledge Engineering, Vol. 68, No. 10, pp. 1071-1090, Oct. 2009.   DOI
22 Lian, X., Chen, L., Yu, J. X., Han, J., and Ma, J., "Multiscale Representations for Fast Pattern Matching in Stream Time Series," IEEE Trans. on Knowledge and Data Engineering, Vol. 21, No. 4, pp. 568-581, Apr. 2009.   DOI
23 Loh, W.-K., Park, Y.-H., and Yoon, Y.-I., "Fast Recognition of Asian Characters based on Database Methodologies," In Proc. of British National Conf. on Databases, Glasgow, UK, pp. 37-48, July 2007.
24 Moon, Y.-S., Whang, K.-Y., and Loh, W.-K., "Dualitybased Subsequence Matching in Time-Series Databases," In Proc. of the 17th Int'l Conf. on Data Engineering, IEEE ICDE, Heidelberg, Germany, pp. 263-272, Apr. 2001.
25 Moon, Y.-S., Kim, B.-S., Kim, M.-S., and Whang, K.-Y., "Scaling-invariant Boundary Image Matching Using Time-series Matching Techniques," Data & Knowledge Engineering, Vol. 69, No. 10, pp. 1022-1042, Oct. 2010.   DOI
26 Kim, B.-S., Moon, Y.-S., and Kim, J., "Noise Control Boundary Image Matching Using Time-series Moving Average Transform," In Proc. of Int'l Conf. on Database and Expert Systems Applications, Turin, Italy, pp. 362-375, Sept. 2008.