• Title/Summary/Keyword: selectivity estimation

Search Result 64, Processing Time 0.033 seconds

Spatial Selectivity Estimation for Intersection region Information Using Cumulative Density Histogram

  • Kim byung Cheol;Moon Kyung Do;Ryu Keun Ho
    • Proceedings of the KSRS Conference
    • /
    • 2004.10a
    • /
    • pp.721-725
    • /
    • 2004
  • Multiple-count problem is occurred when rectangle objects span across several buckets. The Cumulative Density (CD) histogram is a technique which solves multiple-count problem by keeping four sub-histograms corresponding to the four points of rectangle. Although it provides exact results with constant response time, there is still a considerable issue. Since it is based on a query window which aligns with a given grid, a number of errors may be occurred when it is applied to real applications. In this paper, we proposed selectivity estimation techniques using the generalized cumulative density histogram based on two probabilistic models: (1) probabilistic model which considers the query window area ratio, (2) probabilistic model which considers intersection area between a given grid and objects. In order to evaluate the proposed methods, we experimented with real dataset and experimental results showed that the proposed technique was superior to the existing selectivity estimation techniques. The proposed techniques can be used to accurately quantify the selectivity of the spatial range query on rectangle objects.

  • PDF

Selectivity Estimation for Multidimensional Sequence Data in Spatio-Temporal Databases (시공간 데이타베이스에서 다차원 시퀀스 데이타의 선택도추정)

  • Shin, Byoung-Cheol;Lee, Jong-Yun
    • Journal of KIISE:Databases
    • /
    • v.34 no.1
    • /
    • pp.84-97
    • /
    • 2007
  • Selectivity estimation techniques in query optimization have been used in commercial databases and histograms are popularly used for the selectivity estimation. Recently, the techniques for spatio-temporal databases have been restricted to existing temporal and spatial databases. In addition, the selectivity estimation techniques focused on time-series data such as moving objects. It is also impossible to estimate selectivity for range queries with a time interval. Therefore, we construct two histograms, CMH (current multidimensional histogram) and PMH (past multidimensional histogram), to estimate the selectivity of multidimensional sequence data in spatio-temporal databases and propose effective selectivity estimation methods using the histograms. Furthermore, we solve a problem about the range query using our proposed histograms. We evaluated the effectiveness of histograms for range queries with a time interval through various experimental results.

A Selectivity Estimation Technique for Current Query of Moving Objects (이동객체를 위한 현재 질의 선택율 추정 기법)

  • Chi, Jeong-Hee;Ryu, Keun-Ho;Jeong, Doo-Young
    • Journal of the Korea Society of Computer and Information
    • /
    • v.11 no.1 s.39
    • /
    • pp.87-96
    • /
    • 2006
  • Selectivity estimation is one of the query optimization techniques. It is difficult for the previous selectivity estimation techniques for moving objects to apply the position change of moving objects to synopsis. Therefore, they result in much error when estimating selectivity for queries, because they are based on the extended spatial synopsis which does not consider the property of the moving objects. In order to reduce the estimation error, the existing techniques should often rebuild the synopsis. Consequently problem occurs, that is, the whole database should be read frequently. In this paper, we proposed a moving object histogram method based on quad tree to develop a selectivity estimation technique for moving object queries. We then analyzed the performance of the proposed method through the implementation and evaluation of the proposed method. Our method can be used in various location management systems such as vehicle location tracking systems, location based services, telematics services, emergency rescue service, etc in which the location information of moving objects changes over time.

  • PDF

Histogram-based Selectivity Estimation Method in Spatio-Temporal Databases (시공간 데이터베이스를 위한 히스토그램 기반 선택도 추정 기법)

  • Lee Jong-Yun;Shin Byoung-Cheol
    • The KIPS Transactions:PartD
    • /
    • v.12D no.1 s.97
    • /
    • pp.43-50
    • /
    • 2005
  • The Processing domains of spatio-temporal databases are divided into time-series databases for moving objects and sequence databases for discrete historical objects. Recently the selectivity estimation techniques for query optimization in spatio-temporal databases have been studied, but focused on query optimization in time-series databases. There wat no previous work on the selectivity estimation techniques for sequence databates as well. Therefore, we construct T-Minskew histogram for query optimization In sequence databases and propose a selectivity estimation method using the T-Minskew histogram. Furthermore we propose an effective histogram maintenance technique for food performance of the histogram.

Spatial Selectivity Estimation Using Wavelet

  • Lee, Jin-Yul;Chi, Jeong-Hee;Ryu, Keun-Ho
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09a
    • /
    • pp.459-462
    • /
    • 2003
  • Selectivity estimation of queries not only provides useful information to the query processing optimization but also may give users with a preview of processing results. In this paper, we investigate the problem of selectivity estimation in the context of a spatial dataset. Although several techniques have been proposed in the literature to estimate spatial query result sizes, most of those techniques still have some drawback in the case that a large amount of memory is required to retain accurate selectivity. To eliminate the drawback of estimation techniques in previous works, we propose a new method called MW Histogram. Our method is based on two techniques: (a) MinSkew partitioning algorithm that processes skewed spatial datasets efficiently (b) Wavelet transformation which compression effect is proven. We evaluate our method via real datasets. With the experimental result, we prove that the MW Histogram has the ability of providing estimates with low relative error and retaining the similar estimates even if memory space is small.

  • PDF

Selectivity Estimation for Timestamp Queries (시점 질의를 위한 선택율 추정)

  • Shin, Byoung-Cheol;Lee, Jong-Yun
    • Journal of KIISE:Databases
    • /
    • v.33 no.2
    • /
    • pp.214-223
    • /
    • 2006
  • Recently there is a need to store and process enormous spatial data in spatio-temporal databases. For effective query processing in spatio-temporal databases, selectivity estimation in query optimization techniques, which approximate query results when the precise answer is not necessary or early feedback is helpful, has been studied. There have been selectivity estimation techniques such as sampling-based techniques, histogram-based techniques, and wavelet-based techniques. However, existing techniques in spatio-temporal databases focused on selectivity estimation for future extent of moving objects. In this paper, we construct a new histogram, named T-Minskew, for query optimization of past spatio-temporal data. We also propose an effective selectivity estimation method using T-Minskew histogram and effective histogram maintenance technique to prevent frequent histogram reconstruction using threshold.

Selectivity Estimation for Spatial Databases

  • Chi, Jeong-Hee;Lee, Jin-Yul;Ryu, Keun-Ho
    • Proceedings of the KSRS Conference
    • /
    • 2003.11a
    • /
    • pp.766-768
    • /
    • 2003
  • Selectivity estimation for spatial query is curial in Spatial Database Management Systems(SDBMS). Many works have been performed to estimate accurate selectivity. Although they deal with some problems such as false-count, multi-count arising from properties of spatial dataset, they can not get such effects in little memory space.* Therefore, we need to compress spatial dataset into little memory. In this paper, we propose a new technique called MW Histogram which is able to compress summary data and get reasonable results. Our method is based on two techniques:(a)MinSkew partitioning algorithm which deal with skewed spatial datasets. efficiently (b) Wavelet transformation which compression effect is proven. We evaluate our method via real datasets. The experimental result shows that the MW Histogram has the ability of providing estimates with low relative error and retaining the similar estimates even if memory space is small.

  • PDF

A Compressed Histogram Technique for Spatial Selectivity Estimation (공간 선택률 추정을 위한 압축 히스토그램 기법)

  • Chung, Jae-Du;Chi, Jeong-Hee;Ryu, Keun-Ho
    • 한국공간정보시스템학회:학술대회논문집
    • /
    • 2004.12a
    • /
    • pp.69-74
    • /
    • 2004
  • Selectivity estimation for spatial query is very important process in finding the most efficient execution plan. Many works have been performed to estimate accurately selectivity. Although they deal with some problems such as false-count, multi-count, they require a large amount of memory to retain accurate selectivity, so they can not get good results in little memory environments such as mobile-based small database. In order to solve this problem, we propose a new technique called MW histogram which is able to compress summary data and get reasonable results. It also has a flexible structure to react dynamic update. The experimental results showed that the MW histogram has lower relative error than MinSkew histogram and gets a good selectivity in little memory.

  • PDF

Spatial Selectivity Estimation using Cumulative Wavelet Histograms (누적밀도 웨이블릿 히스토그램을 이용한 공간 선택율 추정)

  • Chi, Jeong-Hee;Jeong, Jae-Hyuk;Ryu, Keun-Ho
    • Journal of KIISE:Databases
    • /
    • v.32 no.5
    • /
    • pp.547-557
    • /
    • 2005
  • The purpose of selectivity estimation is to maintain the summary data in a very small memory space and to minimize the error of estimated value and query result. In case of estimating selectivity for large spatial data, the existing works need summary information which reflect spatial data distribution well to get the exact result for query. In order to get such summary information, they require a much memory space. Therefore In this paper, we propose a new technique cumulative density wavelet Histogram, called CDW Histogram, which gets a high accurate selectivity in small memory space. The proposed method is to utilize the sub-histograms created by CD histogram. The each sub-histograms are used to generate the wavelet summary information by applying the wavelet transform. This fact gives us good selectivity even if the memory sire is very small. The experimental results show that the proposed method simultaneously takes full advantage of their strong points - gets a good selectivity using the previous histogram in ($25\%\~50\%$) memory space and is superior to the existing selectivity estimation techniques. The proposed technique can be used to accurately quantify the selectivity of the spatial range query in databases which have very restrictive memory.

Selectivity Estimation using the Generalized Cumulative Density Histogram (일반화된 누적밀도 히스토그램을 이용한 공간 선택율 추정)

  • Chi, Jeong-Hee;Kim, Sang-Ho;Ryu, Keun-Ho
    • The KIPS Transactions:PartD
    • /
    • v.11D no.4
    • /
    • pp.983-990
    • /
    • 2004
  • Multiple-count problem is occurred when rectangle objects span across several buckets. The CD histogram is a technique which selves this problem by keeping four sub-histograms corresponding to the four points of rectangle. Although It provides exact results with constant response time, there is still a considerable issue. Since it is based on a query window which aligns with a given grid, a number of errors nay be occurred when it is applied to real applications. In this paper, we propose selectivity estimation techniques using the generalized cumulative density histogram based on two probabilistic models : \circled1 probabilistic model which considers the query window area ratio, \circled2 probabilistic model which considers intersection area between a given grid and objects. Our method has the capability of eliminating an impact of the restriction on query window which the existing cumulative density histogram has. We experimented with real datasets to evaluate the proposed methods. Experimental results show that the proposed technique is superior to the existing selectivity estimation techniques. Furthermore, selectivity estimation technique based on probabilistic model considering the intersection area is very accurate(less than 5% errors) at 20% query window. The proposed techniques can be used to accurately quantify the selectivity of the spatial range query on rectangle objects.