Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2004.11D.2.281

Selectivity Estimation Using Compressed Spatial Histogram  

Chi, Jeong-Hee (충북대학교 대학원 전자계산학)
Lee, Jin-Yul (포인트 아이)
Kim, Sang-Ho (충북대학교 대학원 전자계산학)
Ryu, Keun-Ho (충북대학교 전기전자 및 컴퓨터공학부)
Abstract
Selectivity estimation for spatial query is very important process used in finding the most efficient execution plan. Many works have been performed to estimate accurate selectivity. Although they deal with some problems such as false-count, multi-count, they can not get such effects in little memory space. Therefore, we propose a new technique called MW Histogram which is able to compress summary data and get reasonable results and has a flexible structure to react dynamic update. Our method is based on two techniques : (a) MinSkew partitioning algorithm which deal with skewed spatial datasets efficiently (b) Wavelet transformation which compression effect is proven. The experimental results showed that the MW Histogram which the buckets and wavelet coefficients ratio is 0.3 is lower relative error than MinSkew Histogram about 5%-20% queries, demonstrates that MW histogram gets a good selectivity in little memory.
Keywords
Query Processing; Selectivity Estimation; Compression; Spatial Histogram; Wavelet;
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 Jin, N. An, A. Sivasubramaniam, 'Analyzing Range Queries on Spatial Data,' In Proceedings of the IEEE International Conference on Data Engineering (ICDE), pp.525-534, 2000   DOI
2 S. Muthukrishnan, Viswanath Poosala, Torsten Suel, 'On Rectangular Partitionings in Two Dimensions : Algorithms, Complexity, and Applications,' 7th International Conference on Database Theory, ICDT'99, 1999
3 E. Clementini and P. Di Felice, 'A Comparison of Methods for Representing Topological Relationships,' Information Sciences 3, pp.149-178, 1995   DOI   ScienceOn
4 Jin Yul Lee, Jeong Hee Chi, Keun Ho Ryu, 'Spatial Selectivity Estimation Using Wavelet,' Proceddings of the 4th International Symposium on Advanced Intelligent Systems, ISSN 1738-0073, ISIS2003, pp.459-462, Sepmtember, 2003
5 Jeong Hee Chi, Jin Yul Lee and Keun Ho Ryu, 'Selectivity Estimation for Spatial Databases,' Asian Conference on Remote Sensing & International Symposium on Remote Sensing (ISRS), November, 2003
6 조문증, '데이터베이스 시스템에서 웨이블릿 변환에 기반한 통합 요약정보의 관리', 전자전산학과 전산학전공, 한국과학기술원 박사논문, 2001
7 김홍연, 배해영, 다차원 히스토그램을 이용한 공간 위상 술어의 선택도 추정 기법, 정보처리논문지, 제6권 제4호,pp.841 850, April, 1999   과학기술학회마을
8 엄정옥, 조숙경, 배해영, '시간적 제약을 갖는 공간 질의 처리를 위한 실시간 연산 후배치 기법', 정보과학회논문지 : 컴퓨팅의 실재, 제7권 제3호, pp.l93-21O, June, 2001
9 문현수, 황환규, '공간 영역 질의의 선택율 추정을 위한 향상된 면적 균등 분할 방법', Journal of Telecommunications and Information, Vol. 4, 2000
10 정지훈, 홍석진, 배진욱, 안성준, 송병호, 이석호, '다차원 히스토그램에서 범위 질의의 선택도에 대한 오차 추정', 정보과학회 2001년 추계학술대회, Vo1.28, No.2, pp.211-213   과학기술학회마을
11 Vitter, Wang, 'Approximate Computation of Multidimensional Aggregates of Sparse Data using Wavelets' In Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 193-204, 1999   DOI
12 Poosala et al., 'Improved Histograms for Selectivity Estimation of Range Predicates' In Proc. ACM SIGMOD Int. Conf. on Management of Data, pp.294-305, 1996   DOI
13 Yossi Matias, Jeffrey Scott Vitter, Min Wang,' Wavelet-Based Histograms for Selectivity Estimation,' In Proc. ACM SIGMOD Int. Conf. on Management of Data, pp.448-459, 1998   DOI   ScienceOn
14 Swarup Acharya, Viswanath Poosala, Sridhar Ramaswamy, 'Selectivity estimation in spatial databases' In Proc. ACM SIGMOD Int. Conf. on Management of Data, pp.13-24,1999   DOI
15 Nikos Mamoulis, Dimitris Papadias, 'Selectivity estimation of complex spatial queries,' In Proc. Int. Symp. on Spatial and Temporal Databases, pp.156-174, 2001
16 A. Aboulnaga, J. Naughton, 'Accurate estimation of the cost of spatial selections' In Proceedings of the IEEE International Conference on Data Engineering (ICDE), pp.123-134, 2000   DOI
17 Yossi Matias, Jeffrey Scott Vitter, Min Wang, 'Dynamic Maintenance of Wavelet-Based Histograms,' The VLDB Journal, pp.101-110, 2000
18 L. Getoor, B. Taskar, D. Roller, 'Selectivity estimation using probabilistic models,' In Proc. ACM SIGMOD Int. Conf. on Management of Data, 2001   DOI
19 Min Wang, Jeffrey Scott Vitter, Lipyeow Lim, Sriram Padmanabhan, 'Wavelet-based cost Estimation for Spatial Queries,' In Proc. Int. Symp. on Spatial and Temporal Databases, pp.175-196, 2001
20 Ning An, Zhen-Yu Yang, Sivasubramaniam, A., 'Selectivity estimation for spatial joins,' In Proceedings of the IEEE International Conference on Data Engineering (ICDE), pp.368-375, 2001   DOI
21 Yong-Jin Choi, Chin-Wan Chung, 'Selectivity estimation for spatio-temporal queries to moving objects,' In Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 440-451, 2002   DOI
22 C. Sun, D. Agrawal, A. El Abbadi, 'Selectivity for spatial joins with geometric selections,' Proc. of EDBT, pp.609-626, 2002
23 Tao, Y., Sun, J., Papadias, D., 'Selectivity Estimation for Predictive Spatio-Temporal Queries' In Proceedings of the IEEE International Conference on Data Engineering (ICDE), pp.417-428. 2003
24 Sun, C, Agrawal, D., El Abbadi, A., 'Exploring spatial datasets with histograms (full version),' Technical Report, Computer Science Department, University of California, santa Barbara, 2001
25 Antonios Deligiannakis, Nick Roussopoulos., 'Extended Wavelets for Multiple Measures,' ACM SIGMOD 2003, pp. 229-240, June, 2003   DOI
26 Kaushik C, Minos G., Rajeev R., Kyuseok S., 'Approximate query processing using wavelets,' The VLDB Journal, pp. 199-223, 2001   DOI
27 Minos G., Phillip B.G ., 'Wavelet Synopses with Error Guarantees,' ACM SIGMOD, Jine 4-5, Madison, Wisconsin, USA, 2002   DOI
28 Yannis E. Ioannidis, 'Query Optimization,' ACM survey, 1996