통합 검색 | Korea Science

Transformation of Continuous Aggregation Join Queries over Data Streams

Tran, Tri Minh;Lee, Byung-Suk
- Journal of Computing Science and Engineering
- /
- 제3권1호
- /
- pp.27-58
- /
- 2009
Aggregation join queries are an important class of queries over data streams. These queries involve both join and aggregation operations, with window-based joins followed by an aggregation on the join output. All existing research address join query optimization and aggregation query optimization as separate problems. We observe that, by putting them within the same scope of query optimization, more efficient query execution plans are possible through more versatile query transformations. The enabling idea is to perform aggregation before join so that the join execution time may be reduced. There has been some research done on such query transformations in relational databases, but none has been done in data streams. Doing it in data streams brings new challenges due to the incremental and continuous arrival of tuples. These challenges are addressed in this paper. Specifically, we first present a query processing model geared to facilitate query transformations and propose a query transformation rule specialized to work with streams. The rule is simple and yet covers all possible cases of transformation. Then we present a generic query processing algorithm that works with all alternative query execution plans possible with the transformation, and develop the cost formulas of the query execution plans. Based on the processing algorithm, we validate the rule theoretically by proving the equivalence of query execution plans. Finally, through extensive experiments, we validate the cost formulas and study the performances of alternative query execution plans.
https://doi.org/10.5626/JCSE.2009.3.1.027 인용 PDF

분산 데이타베이스에서의 질의실행시간 최소화를 위한 유전자알고리즘: 총 시간 대 반응시간 (A Genetic Algorithm for Minimizing Query Processing Time in Distributed Database Design: Total Time Versus Response Time)

송석규
- 정보처리학회논문지D
- /
- 제16D권3호
- /
- pp.295-306
- /
- 2009
질의실행시간최소화는 분산 데이타베이스 설계에 있어 가장 중요한 목적중의 하나이다. 총시간최소화는 온라인거래처리시스템의 목적인 반면, 반응시간최소화는 의사결정지원 질의시스템의 목적이다. 본 논문에서는 질의실행시간최소화를 달성하기 위해 질의를 세분화하여 최적의 데이터베이스 사이트에 할당하는 분석모델을 개발하였으며, 문제해결방법으로 유전자알고리즘을 채택하였다. 총시간최소화 관점에서의 질의실행 계획은 반응시간최소화 관점의 질의실행계획에는 적합하지 않다는 것을 증명하였으며, 그 반대의 경우도 증명하였다. 최대 20개의 조인이 포함되는 질의를 설계하여 시뮬레이션 실험을 통해 테스트를 수행하였고, 유전자알고리즘과 완전한 전수조사와의 결과를 비교함으로써 모든 경우에 유전자알고리즘을 채택한 해결책이 최적의 결과를 도출하였음을 증명하였다.
https://doi.org/10.3745/KIPSTD.2009.16-D.3.295 인용 PDF KSCI

Distributed Database Design using Evolutionary Algorithms

Tosun, Umut
- Journal of Communications and Networks
- /
- 제16권4호
- /
- pp.430-435
- /
- 2014
The performance of a distributed database system depends particularly on the site-allocation of the fragments. Queries access different fragments among the sites, and an originating site exists for each query. A data allocation algorithm should distribute the fragments to minimize the transfer and settlement costs of executing the query plans. The primary cost for a data allocation algorithm is the cost of the data transmission across the network. The data allocation problem in a distributed database is NP-complete, and scalable evolutionary algorithms were developed to minimize the execution costs of the query plans. In this paper, quadratic assignment problem heuristics were designed and implemented for the data allocation problem. The proposed algorithms find near-optimal solutions for the data allocation problem. In addition to the fast ant colony, robust tabu search, and genetic algorithm solutions to this problem, we propose a fast and scalable hybrid genetic multi-start tabu search algorithm that outperforms the other well-known heuristics in terms of execution time and solution quality.
https://doi.org/10.1109/JCN.2014.000073 인용 PDF KSCI

태그 기법을 이용한 공간 질의 수행기의 설계 (Desgin of a Spatial QueryExecutor using Tag Technique)

이찬근;박호현;이용주;정진완
- 한국정보과학회논문지:컴퓨팅의 실제 및 레터
- /
- 제5권5호
- /
- pp.543-552
- /
- 1999
질의 수행기의 물리적 연산자를 구현하기 위해 사용되는 반복자 기법은 효율성과 확장성이 우수하다고 알려져 있다. 그리고 공간 객체에 대한 연산 처리 방법들 중 가장 널리 사용되는 것은 여과 및 정제 단계로 나누어 처리하는 방법이다. 최근에 객체 대수의 수준에서 공간 연산자를 여과 및 정제 단계로 나누어 정의하여 기존의 방법보다 더욱 효율적인 질의 수행 계획을 만들어 낼 수 있는 질의 최적화기에 대한 연구가 있었다. 그러나 기존의 질의 수행기는 이러한 질의 수행 계획을 고려하지 않고 설계되었기 때문에 연산자 간에 여과 연산의 결과를 전달하는 기능이 없다. 우리는 이러한 여과/정제 단계가 분리되어 두 단계 사이에 다른 연산이 들어올 수 있는 질의 수행 계획하에서 반복자 기법을 사용할 때 여과 단계의 연산 결과를 해당 정제 단계 연산자로 전달하는 방법으로 조사 기법과 태그 기법 두 가지를 제안한다. 조사 기법은 연산자 내부의 상태 레코드의 용도를 확장하여 사용하는 방법인 반면 태그 기법은 중간 결과에 태그의 형태로 여과 단계의 연산 결과를 저장한다. 이들 방법을 비교하여 확장성에서 우월한 태그 기법을 사용한 질의 수행기를 설계 및 구현한다. 구현된 질의 수행기는 공간 질의를 위해 확장된 OQL을 처리하기 위한 공간 객체 대수에 정의된 연산자들을 수행할 수 있다.

검색결과 4건 처리시간 0.019초

Transformation of Continuous Aggregation Join Queries over Data Streams

분산 데이타베이스에서의 질의실행시간 최소화를 위한 유전자알고리즘: 총 시간 대 반응시간 (A Genetic Algorithm for Minimizing Query Processing Time in Distributed Database Design: Total Time Versus Response Time)

Distributed Database Design using Evolutionary Algorithms

태그 기법을 이용한 공간 질의 수행기의 설계 (Desgin of a Spatial QueryExecutor using Tag Technique)

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)