An Efficient Search Space Generation Technique for Optimal Materialized Views Selection in Data Warehouse Environment

데이타 웨어하우스 환경에서 최적 실체뷰 구성을 위한 효율적인 탐색공간 생성 기법

  • 이태희 (서울대학교 컴퓨터공학부) ;
  • 장재영 (한성대학교 컴퓨터공학부) ;
  • 이상구 (서울대학교 컴퓨터공학부)
  • Published : 2004.12.01

Abstract

A query processing is a critical issue in data warehouse environment since queries on data warehouses often involve hundreds of complex operations over large volumes of data. Data warehouses therefore build a large number of materialized views to increase the system performance. Which views to materialized is an important factor on the view maintenance cost as well as the query performance. The goal of materialized view selection problem is to select an optimal set of views that minimizes total query response time in addition to the view maintenance cost. In this paper, we present an efficient solution for the materialized view selection problem. Although the optimal selection of materialized views is NP-hard problem, we developed a feasible solution by utilizing the characteristics of relational operators such as join, selection, and grouping.

데이타 웨어하우스에서의 분석 질의는 대체로 복잡한 연산을 포함하고 있기 때문에 질의 처리 과정이 매우 중요하다. 성능 향상을 위해서 데이타 웨어하우스에서 보편적으로 쓰이고 있는 방법은 실체뷰를 구축하는 것이다. 어떤 실체뷰를 구축하느냐 하는 문제는 데이타 웨어하우스 전체의 질의처리 성능과 유지보수 비용에 중요한 영향을 미친다. 실체뷰 구성 문제란 이러한 질의처리 비용과 유지보수비용을 고려하여 최적의 실체뷰를 선택하는 것이다. 본 논문에서는 이러한 최적의 실체뷰를 구성하는 효율적인 해결방안을 제시한다. 최적 실체뷰의 구성문제는 일반적으로 NP-hard 문제이지만, 본 논문에서는 관계형 데이터 베이스에서 사용되는 조인, 선택, 그룹, 집계 연산의 특성을 고려하여 문제해결을 위한 탐색 공간을 획기적으로 줄이는 방법을 제안한다.

Keywords

References

  1. S. Agrawal, S. Chaudhuri, V. Narasayya, Automated Selection of Materialized Views and Indexes for SQL databases, Materialized View Selection in Multidimensional Database, In Proc. of VLDB, pp. 59-68, 2000
  2. E.Baralis, S.paraboschi, E.Teniente, Materialized View Selection in Multidimensional Database, In Proc. of VLDB, pp.156-165, 1997
  3. J.-y. Chang, S.-g. Lee, Extended Conditions for Answering an Aggregate Query Using Materialized Views, Information Processing Letters, Vol. 72, No. 5-6, pp. 205-212, 1999 https://doi.org/10.1016/S0020-0190(99)00147-7
  4. R. Chirkova, A. Y.Halevy, D. Suciu, Formal Perspective on the View Selection Problem, In Proc. of VLDB, pp. 59-68, 2001
  5. Y. Cui, J. Widom, Lineage Tracing for General Data Warehouse Transformations, In Proc. of VLDB, pp. 471-480, 2001
  6. P. M. Deshpande, K. Ramasamy, A. Shukla, J., Naughton, Caching Multidimensional Queries Using Chunks, In Proc. of ACM SIGMOD, pp.259-270, 1998 https://doi.org/10.1145/276305.276328
  7. J.Gray, A.Bosworth, A.Layman, H.Pitamish, Data Cube : A Relational Aggregatioon Operator Generalizing Group-By, Cross-Tab, and Sub-Totals, In Proc. of ICDE, pp. 152-159, 1996 https://doi.org/10.1109/ICDE.1996.492099
  8. H.Gupta, V.Harinarayan, A.Rajaraman, J.Ullman, Index Selection for OLAP, In Proc. of ICDE, pp.208-219, 1997 https://doi.org/10.1109/ICDE.1997.581755
  9. H.Gupta, I.S.Mumick, Selection of views to materialize under a Maintenance cost constraint, In Proc. of ICDT, pp. 453-460, 1999 https://doi.org/10.1007/3-540-49257-7_28
  10. H.Gupta, Selection of Views to Materialize in a Data Warehouse, in Proc. of ICDT, pp.98-112, 1997
  11. V.Harinarayan, A.Rajaraman, J.Ullman, Implementing Data Cubes Efficiently, In Proc. of ACM SIGMOD, pp. 205-216, 1996 https://doi.org/10.1145/235968.233333
  12. Y.Kotidis, N.Roussopoulos, Dynamat : A Dynamic View Management System for Data Warehouses, In Proc. of ACM SIGMOD, pp. 371-382, 1999 https://doi.org/10.1145/304181.304215
  13. W.J.Labio, D.Quass, B.Adelberg, Physical Database Design for Data Warehouses, In Proc. of VLDB, pp. 277-288, 1997 https://doi.org/10.1109/ICDE.1997.581802
  14. H. Mistry, P. Roy, S. Sudarshan, K. Ramamritham, Materialized View Selection and Maintenance Using Multi-Query Optimization, In proc. of ACM SIGMOD, pp. 310-318, 2001 https://doi.org/10.1145/376284.375703
  15. P. O'Neil, D. Quass, Improved query performance with variant indexes, In Proc. of ACM SIGMOD, pp. 38-49, 1997 https://doi.org/10.1145/253260.253268
  16. K. A. Ross, D. Srivastava, S. Sudarshan, Materialized View Maintenance and Integrity Constraint Checking : Trading Space for Time, In Proc. of ACM SIGMOD, pp. 457-448, 1996 https://doi.org/10.1145/233269.233361
  17. D. Srivastava, S. Dar, H. V. Jagadish, and A. Y. Levy. Answering Queries with Aggregation Using Views. In Proc. of VLDB, pp. 318-329, 1996
  18. A.Shukla, P.M.Deshpande, J.F.Naughton, Materialized View Selection for Multidimensional Data-Set, In Proc. of VLDB, pp. 488-499, 1998
  19. Timos K. Sellis, Multiple-query Optimization, ACM Transactions on Database Systems, pp. 23-52, 1988 https://doi.org/10.1145/42201.42203
  20. D.Theodoratos and T.Sellis, Data Warehouse Configuration, In Proc. of VLDB, pp.126-135, 1997
  21. J.Yang, K.Karlapalem, Q.Li, Algorithms for Materialized View Design in Data Warehousing Environment, In Proc. of VLDB, 1997
  22. J. Yang and J. Widom. Making Temporal Views Self-Maintainable for Data Warehousing, In Proc. of EDBT, pp. 395-412, 2000 https://doi.org/10.1007/3-540-46439-5_28