Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2004.11D.4.983

Selectivity Estimation using the Generalized Cumulative Density Histogram  

Chi, Jeong-Hee (충북대학교 대학원 전자계산학과)
Kim, Sang-Ho (충북대학교 대학원 전자계산학)
Ryu, Keun-Ho (충북대학교 전기전자 컴퓨터공학부)
Abstract
Multiple-count problem is occurred when rectangle objects span across several buckets. The CD histogram is a technique which selves this problem by keeping four sub-histograms corresponding to the four points of rectangle. Although It provides exact results with constant response time, there is still a considerable issue. Since it is based on a query window which aligns with a given grid, a number of errors nay be occurred when it is applied to real applications. In this paper, we propose selectivity estimation techniques using the generalized cumulative density histogram based on two probabilistic models : \circled1 probabilistic model which considers the query window area ratio, \circled2 probabilistic model which considers intersection area between a given grid and objects. Our method has the capability of eliminating an impact of the restriction on query window which the existing cumulative density histogram has. We experimented with real datasets to evaluate the proposed methods. Experimental results show that the proposed technique is superior to the existing selectivity estimation techniques. Furthermore, selectivity estimation technique based on probabilistic model considering the intersection area is very accurate(less than 5% errors) at 20% query window. The proposed techniques can be used to accurately quantify the selectivity of the spatial range query on rectangle objects.
Keywords
Spatial Query Optimization; Spatial Histogram; Selectivity Estimation; Generalization of Histogram;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Alberto Belussi, Christos Faloutsos, 'Self-Spatial Join Selectivity Estimation Using Fractal Concepts,' In Proc. ACM Symp. on Transactions on Information Systems, Vol. 16, No.2, pp.161-201, April, 1998   DOI   ScienceOn
2 Yossi Matias, Jeffrey Scott Vitter, Min Wang, 'Wavelet-Based Histograms for Selectivity Estimation,' In Proc. ACM SIGMOD Int. Conf. on Management of Data, pp.448-459, 1998   DOI
3 Swarup Acharya, Viswanath Poosals, Sridhar Ramaswamy, 'Selectivity estimation in spatial databases,' In Proc. ACM SIGMOD Int. Conf. on Management of Data, pp.13-24, 1999   DOI
4 A. Aboulnaga, J. Naughton, 'Accurate estimation of the cost of spatial selections,' In Proceedings of the IEEE International Conference on Data Engineering(ICDE), pp.123-134, 2000   DOI
5 Vitter, Wang, 'Approximate Computation of Multidimensional Aggregates of Sparse Data using Wavelets,' In Proc. ACM SIGMOD Int. Conf. on Management of Data, pp.193-204, 1999   DOI
6 Yossi Matias, Jeffrey Scott Vitter, Min Wang, 'Dynamic Maintenance of Wavelet-Based Histograms,' The VLDB, Journal, pp.101-110, Journal , 2000
7 L. Getoor, B. Taskar, D. Koller, 'Selectivity estimationusing probabilistic models,' In Proc. ACM SIGMOD Int. Conf. on Management of Data, pp.461-473, 2001   DOI
8 Jin, N. An, A. Sivasubramaniam, 'Analyzing Range Queries on Spatial Data,' In Proceedings of the IEEE International Conference on Data Engineering(ICDE), pp.525-534, 2000   DOI
9 C. Faloutsos, B. Seeger, A. Traina, and Caetano Traina, 'Spatial Join Selectivity Using Power Laws,' In Proc. ACM SIGMOD Int. Conf. on Management of Data, pp.177-188, 2000
10 Min Wang, Jeffrey Scott Vitter, Lipyeow Lim, Sriram Padmanabhan, 'Wavelet-based cost Estimation for Spatial Queries,' In Proc. Int. Symp. on Spatial and Temporal Databases, pp.175-196, 2001
11 C. Sun, D. Agrawal and A. El Abbadi, 'Exploring Spatial Datasets with Histograms,' In Proceedings of the IEEE International Conference on Data Engineering(ICDE), pp.93-102, 2002   DOI
12 Alberto Belussi, Christos Faloutsos, 'Estimating the Selectivity of Spatial Queries using the 'Correlation' Fractal Dimension,' InProc. 21st Int. Conf. Very Large Data Bases(VLDB), pp.299-310, Nov., 1995
13 Ning An, Zhen-Yu Yang, Sivasubramaniam, A., 'Selectivity estimation for spatial joins,' In Proceedings of the IEEE International Conference on Data Engineering(ICDE), pp.368-375, 2001   DOI