Efficient Execution of Range $Top-\kappa$ Queries using a Hierarchical Max R-Tree

계층 최대 R-트리를 이용한 범위 상위-$\kappa$ 질의의 효율적인 수행

  • 홍석진 (서울대학교 전기컴퓨터공학부) ;
  • 이상준 (서울대학교 전기컴퓨터공학) ;
  • 이석호 (서울대학교 전기컴퓨터공학부)
  • Published : 2004.04.01

Abstract

A range $Top-\kappa$ query returns top k records in order of a measure attribute within a specified region on multi-dimensional data, and it is a powerful tool for analysis in spatial databases and data warehouse environments. In this paper, we propose an algorithm for answering the query via selective traverse of a Hierarchical Max R-Tree(HMR-tree). It is possible to execute the query by accessing only a small part of the leaf nodes in the query region, and the query performance is nearly constant regardless of the size of the query region. The algorithm manages the priority queue efficiently to reduce cost of handling the queue and the proposed HMR-tree can guarantee the same fan-out as the original R-tree.

범위 상위-$\kappa$ 질의는 질의 범위 내의 다차원 데이타 중 값 애트리뷰트를 기준으로 상위 k개의 레코드를 반환하는 질의로 공간 데이타베이스와 데이타 웨어하우스에서 분석을 위해 많이 사용되는 유용한 질의 형태이다. 이 논문에서는 계층 최대 R-트리의 선택적인 탐색을 통해 범위 상위-k 질의를 효과적으로 수행하는 기법을 제시한다. 이 기법은 단말 노드의 일부만을 접근하여 질의를 수행할 수 있으며, 질의 범위의 크기에 관계없이 거의 일정한 성능을 보인다. 또한 이 기법은 우선순위 큐를 효율적으로 관리함으로써 큐의 유지비용을 최소화하며, 기존 R-트리와 같은 팬아웃을 보장할 수 있다.

Keywords

References

  1. Ralf Hartmut Guting, 'An Introduction to Spatial Database Systems,' VLDB Journal, vol.3, No.4, pp.357-399, 1994 https://doi.org/10.1007/BF01231602
  2. Surajit Chaudhuri, Umeshwar Dayal, 'An Overview of Data Warehousing and OLAP Technology,' SIGMOD Record vol.26, No.1, pp.65-74, 1997 https://doi.org/10.1145/248603.248616
  3. Antonin Guttman: R-Trees, 'A Dynamic Index Structure for Spatial Searching,' SIGMOD Conference, pp.47-57, 1984 https://doi.org/10.1145/602259.602266
  4. Norbert Beckmann, Hans-Peter Kriegel, Ralf Schneider, Bernhard Seeger, 'The R*-Tree: An Efficient and Robust Access Method for Points and Rectangles,' SIGMOD Conference, pp.322-331, 1990
  5. J. Gray, A. Bosworth, A. Layman, and H. Piramish, 'Data Cube: A Relational Aggregation Operator Generalizing, Group-By, Crosstab, and Sub-Totals,' Int. Conference on Data Engineering, pp.152-159, 1996
  6. Venky Harinarayan, Anand Rajaraman, Jeffrey D.Ullman, 'Implementing Data Cubes Efficiently,' SIGMOD Conference, pp.205-216, 1996 https://doi.org/10.1145/235968.233333
  7. Ching-Tien Ho, Rakesh Agrawal, Nimrod Megiddo, Ramakrishnan Srikant, 'Range Queries in OLAP Data Cubes,' SIGMOD Conference, pp.73-88, 1997 https://doi.org/10.1145/253262.253274
  8. Steven Geffner, Divyakant Agrawal, Amr El Abbadi, Terence R.Smith, 'Relative Prefix Sums: An Efficient Approach for Querying Dynamic OLAP Data Cubes,' Int. Conference on Data Engineering, pp. 328-335, 1999
  9. Chee Yong Chen, Yannis E. Ioannidis, 'Hierarchical Prefix Cubes for Range-Sum Queries,' VLDB, pp.675-686, 1999
  10. Sin Yeung Lee, Tok Wang Ling, Hua-Gang Li, 'Hierarchical Compact Cube for Range-Max Queries,' VLDB, pp.232-241, 2000
  11. Seok-Ju Chun, Chin-Wan Chung, Ju-Hong Lee, Seok-Lyong Lee, 'Dynamic Update Cube for Range-sum Queries,' VLDB, pp.521-530, 2001
  12. Iosif Lazaridis, Sharad Mehrotra, 'Progressive Approximate Aggregate Queries with a Multi-Resolution Tree-Structure,' SIGMOD Conference, 2001 https://doi.org/10.1145/376284.375718
  13. Seokjin Hong;Byoungho Song;Sekho Lee, 'Efficient Execution of Range-Aggregate Queries in Data Warehouse Environments,' ER, pp.299-310, 2001
  14. TPC-H, http://www.tpc.org/tpch, 2003
  15. U.S.Census Bureau - TIGER/LINE, http://www.census.gov/geo/www/tiger, 2003