Estimation of Data Distribution Using Multidimensional Dynamic File Organization

다차원 동적 화일 구조를 이용한 데이타 분포의 추정

  • 김상욱 (강원대학교 정보통신공학과)
  • Published : 1995.10.31

Abstract

This paper presents a technique for estimating distribution of data stored in a database. This technique is very useful for accurate selectivity estimation, which is essential in query optimization and physical database design. To maintain data distribution, we employ the directory of the multilevel grid file, a multidimensional dynamic file organization. The major advantage of our technique is that data distribution information is maintained dynamically in the multilevel grid file. In contrast, other static methods such as the histogram method use static date structures, which requires periodic restructuring. Furthermore, we propose a method for keeping the abstract information of data distribution in main memory. This is advantageous in the situation where the size of main memory is not sufficient. Finally, We also suggest formulas for calculating selectivies of various queries based on our data distribution information.

Keywords