Efficient Execution of Range Mosaic Queries

범위 모자이크 질의의 효율적인 수행

  • 홍석진 (서울대학교 컴퓨터공학과) ;
  • 배진욱 (서울대학교 컴퓨터공학과) ;
  • 이석호 (서울대학교 컴퓨터공학부)
  • Published : 2005.10.01

Abstract

A range mosaic query returns distribution of data within the query region as a pattern of mosaic, whereas a range aggregate query returns a single aggregate value of data within the query region. The range mosaic query divides a query region by a multi-dimensional grid, and calculates aggregate values of grid cells. In this paper, we propose a new type of query, range mosaic query and a new operator, mosaic-by, with which the range mosaic queries can be represented. In addition, we suggest efficient algorithms for processing range mosaic queries using an aggregate R-tree. The algorithm that we present computes aggregate results of every mosaic grid cell by one time traversal of the aggregate R-tree, and efficiently executes the queries with only a small number of node accesses by using the aggregate values of the aggregate R-tree. Our experimental study shows that the range mosaic query algorithm is reliable in terms of performance for several synthetic datasets and a real-world dataset.

질의 영역에 대한 단일 값의 통계 정보를 반환하는 범위 집계 질의와는 달리, 범위 모자이크 질의는 질의 영역 내의 데이타 분포를 모자이크 형태로 반환한다. 즉, 범위 모자이크 질의는 질의 영역을 다차원 격자로 나눈 후, 나뉜 각 영역에 대해 집계값을 구해서 결과로 반환하는 질의이다 이 논문에서는 범위 모자이크 질의와, 범위 모자이크 질의를 SQL문으로 표현하기 위한 mosaic-by 연산자를 제안한다. 그리고 이 논문에서는 집계 R-트리를 이용한 범위 모자이크 질의의 효율적인 수행 알고리즘을 소개한다. 알고리즘은 모든 모자이크 셀의 집계값을 한 번의 트리 순회만으로 계산하며, 집계 R-트리의 집계값을 이용하여 질의 영역 내의 모든 노드를 접근하지 않고도 작은 수의 노드 접근만으로 질의를 수행할 수 있다. 실험 결과를 통해 제안된 알고리즘이 생성된 데이타와 실제 데이타 모두에 대해 좋은 성능을 보이는 것을 알 수 있다.

Keywords

References

  1. Ralf Hartmut Guting, 'An Introduction to Spatial Database Systems,' VLDB Journal Vol.3, No.4, pp.357-399, 1994 https://doi.org/10.1007/BF01231602
  2. Surajit Chaudhuri and Umeshwar Dayal, 'An Overview of Data Warehousing and OLAP technology,' ACM SIGMOD Record, 26(1):65-74, 1997 https://doi.org/10.1145/248603.248616
  3. Seokjin Hong, Byoungho Song, and Sukho Lee, 'Efficient execution of range aggregate queries in data warehouse environments,' Conceptual Modeling-ER 2001: 20th International Conference on Conceptual Modeling, Yokohama, Japan, pp.299-310. Springer, 2001
  4. I. Lazaridis and S. Mehrotra: Progressive Approximate Aggregate Queries with a Multi-Resolution Tree Structure In Proceedings of the ACM SIGMOD Conference on Management of Data, pages 401-412, 2001 https://doi.org/10.1145/375663.375718
  5. Dimitris Papadias, Panos Kalnis, Jun Zhang, and Yufei Tao, 'Efficient OLAP operations in spatial data warehouses,' In Advances in Spatial and Temporal Databases : 7th International Symposium, SSTD 2001, Redondo Beach, CA, USA, July 12-15, 2001, Proceedings, pp.443-459. Springer, 2001
  6. Cbing-Tien Ho, Rakesh Agrawal, Nimrod Megiddo, and Ramakrishnan Srikant, 'Range queries in OLAP data cubes,' Proceedings ACM SIGMOD International Conference on Management of Data, May 13-15, 1997, Tucson, Arizona, USA, pp.73-88. ACM Press, 1997
  7. Zheng Xuan Loh, Tok Wang Ling, Chuan-Heng Ang, and Sin Yeung Lee, 'Adaptive method for range top-k queries in OLAP data cubes,' DEXA'02, pp.648-657. Springer, 2002
  8. Seokjin Hong, Bongki Moon, and Sukho Lee, 'Processing range top-k queries in a sparse data cube,' Proceedings of the International Conference on Information and Knowledge Engineering. CSREA Press, 2004
  9. J.Gray, A.Bosworth, A.Layman, H.Pitamish, Data Cube : A Relational Aggregatioon Operator Generalizing Group-By, Cross-Tab, and Sub-Totals, In Proc. of ICDE, pp. 152-159, 1996 https://doi.org/10.1109/ICDE.1996.492099
  10. A. Guttman, 'R-Trees: A Dynamic Index Structure for Spatial Searching,' In Proceedings of ACM SIGMOD International Conference on Management of Data, pp.47-57, Jun., 1984 https://doi.org/10.1145/602259.602266
  11. S. Berchtold, C. Bohm, and H.P. Kriegel, 'The pyramid-technique: Towards breaking the curse of dimensionality,' Proceedings ACM SIGMOD International Conference on Management of Data, June 2-4, 1998, Seattle, Washington, USA, ed. L.M. Haas and A. Tiwary, pp.142.153, ACM Press, 1998 https://doi.org/10.1145/276304.276318
  12. TPC-H. http://www.tpc.org/tpch/, 2004
  13. TIGER/Line. http://www.census.gov/geo/www/tiger/, 2002