Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2009.16-D.3.295

A Genetic Algorithm for Minimizing Query Processing Time in Distributed Database Design: Total Time Versus Response Time  

Song, Suk-Kyu (영산대학교 호텔경영학과)
Abstract
Query execution time minimization is an important objective in distributed database design. While total time minimization is an objective for On Line Transaction Processing (OLTP), response time minimization is for Decision Support queries. We formulate the sub-query allocation problem using analytical models and solve with genetic algorithm (GA). We show that query execution plans with total time minimization objective are inefficient from response time perspective and vice versa. The procedure is tested with simulation experiments for queries of up to 20 joins. Comparison with exhaustive enumeration indicates that GA produced optimal solutions in all cases in much less time.
Keywords
Distributed Databases; Query Optimization; Sub-Query Allocation; Query Execution Plans; Genetic Algorithms;
Citations & Related Records
연도 인용수 순위
  • Reference
1 W. Du, M. Shan and U. Dayal, 'Reducing Multidatabase Query Response Time by Tree Balancing,' Proceedings of the 1995 ACM SIGMOD International Conference on Management of Data, San Jose, California, pp.293-303, May, 1995   DOI
2 R. Florin and D. Alin, 'Sketches for Size of Join Estimation,' ACM Transactions on Database Systems, Vol.33, No.3, pp.1- 46, 2008   DOI   ScienceOn
3 O. Frieder and C. Baru, 'Site and Query Scheduling Policies in Multicomputer Database Systems,' IEEE Transactions on Knowledge and Data Engineering, Vol.6, No.4, pp.609-619, Aug., 1994   DOI   ScienceOn
4 G. Graefe, 'Query Evaluation Techniques for Large Databases,' ACM Computing Surveys, Vol.25, pp.73-90, June, 1993   DOI   ScienceOn
5 J. M. Johansson, S. T. March and J. D. Naumann, 'Modeling Network Latency and Parallel Processing in Distributed Database Design,' Decision Sciences, Vol.34, No.4, pp.677-706, 2003   DOI   ScienceOn
6 D. Kossmann, 'The State of the Art in Distributed Query Processing,' ACM Computing Surveys, Vol.32, No.4, pp.422-469, Dec., 2000   DOI   ScienceOn
7 U R. Kulkarni and H. K. Jain, 'Interaction Between Concurrent Transactions in the Design of Distributed Databases,' Decision Sciences, Vol.24, No.2, pp.253-277, 1993   DOI   ScienceOn
8 B. Li and W. Jiang, 'A novel stochastic optimization algorithm,' IEEE Trans. on Systems, Man, and Cybernetics, Part B, Vol.30, No.1, 2000   DOI   ScienceOn
9 D. E. Goldberg, Genetic Algorithms in Search, Optimization, and Machine Learning, Addison-Wesley Publishing, 1989
10 S-J. Lim and Y-K Ng, 'Vertical Fragmentation and Allocation in Distributed Deductive Database Systems,' Information Systems, Vol.22, No.1, pp.1-24, 1997   DOI   ScienceOn
11 S.T. March and S. Rho, 'Allocating Data and Operations to Nodes in Distributed Database Design,' IEEE Trans. on Knowledge and Data Engineering, Vol.7, No.2, April, 1995   DOI   ScienceOn
12 T. Martin, K. Lam and J. Russel, 'An Evaluation of Site Selection Algorithms for Distributed Query Processing,' The Computer Journal, Vol.33, No.1, pp.61-70, 1990   DOI
13 Z. Michalewicz and D. Fogel, How to Solve It: Modern Heuristics, 2nd edition, Springer, Berlin, 2004
14 M. Ozsu and P. Valduriez, Principles of Distributed Database Systems, Englewood Cliffs, Prentice-Hall Inc., 1991
15 S. Seshadri and B. Cooper, 'Routing Queries through a Peer-to-Peer InfoBeacons Network Using Information Retrieval Techniques,' IEEE Transactions on Parallel & Distributed Systems, Vol.18, No.12, pp.1754-1765, Dec., 2007   DOI   ScienceOn
16 A.M. Tamhankar and S. Ram, 'Database Fragmentation and Allocation: An Integrated Methodology and Case Study,' IEEE Trans. on Systems, Man, and Cybernetics, Vol.28, No.3, pp.288-305, May, 1998   DOI   ScienceOn
17 L. The, 'Distributing Data Without Choking the Net,' Datamation, Vol.40, pp.35-36, Jan. 7, 1994
18 C. T. Yu, C. Chang, M. Templeton, D. Brin and E. Lund, 'Query Processing in a Fragmented Relational Distributed System: Mermaid,' IEEE Transactions on Software Engineering, Vol.11, pp.795-809. Aug., 1985   DOI   ScienceOn
19 J. Srivastava and G. Elsesser, 'Optimizing Multi-Join Queries in Parallel Relational Databases,' Proceedings of the 2nd International Conference on Parallel and Distributed Information Systems, pp.84-92, 1993
20 J. D. Schaffer, R. A. Caruana, L. J. Eshlman and R. Das, 'A Study of Control Parameters Affecting Online Performance of Genetic Algorithms for Function Optimization, In J. D. Schaffer, (ed.), Proceedings of the Third International Conference on Genetic Algorithms, pp.51-60, 1989
21 J. Du, R. Alhajj and K. Barker, 'Genetic Algorithms Based Approach to Database Vertical Partitioning,' Journal of Intelligent Information Systems, Vol.26, No.2, pp.167-183, 2006   DOI
22 J. Arcangeli, A. Hameurlain, E. Migeon and F. Morvan, 'Mobile Agent Based Self-Adaptive Join for Wide-Area Distributed Query Processing,' Journal of Database Management, Vol.15, No.4, pp.25-44, 2004   DOI
23 N. Gorla, 'An Object-Oriented Database Design for Improved Performance,' Data and Knowledge Engineering, Vol.37, pp. 117-138, 2001   DOI   ScienceOn
24 A. Kumar, and R. Pathak, 'Genetic Algorithm Based Approach for File Allocation on Distributed Systems,' Computers & Operations Research, Vol.22, No.1, pp.41-55, 1995   DOI   ScienceOn
25 B. Bergsten, M. Couprie and P. Valduriez, 'Overview of Parallel Architectures for Database,' The Computer Journal, Vol.36, pp.734-740, Aug., 1993   DOI   ScienceOn
26 M. Syam, 'Allocating Fragments in Distributed Databases,' IEEE Transactions on Parallel and Distributed Systems, Vol. 16, No.7, pp.577-585, Jul., 2005   DOI   ScienceOn
27 X. Gu, W. Lin and V. Bharadwaj, 'Practically Realizable Efficient Data Allocation and Replication Strategies for Distributed Databases with Buffer Constraints,' IEEE Transactions on Parallel & Distributed Systems, Vol.17, No.9, pp.1001-1013, Sep., 2006   DOI   ScienceOn
28 S.K. Song and N. Gorla, 'A Genetic Algorithm for Vertical Fragmentation and Access Path Selection,' The Computer Journal, Vol.43, No.1, pp.81-93, 2000   DOI   ScienceOn
29 M. Ziane, M. Zait and P. Borla-Salamet, 'Parallel Query Processing in DBS 3,' Proceedings of the 2nd International Conference on Parallel and Distributed Information Systems, pp.93-102. 1993
30 P.M.G Apers, 'Data Allocation in Distributed Database Systems,' ACM Trans. on Database Systems, Vol.13, No.3, pp.263-304, Sep., 1988   DOI   ScienceOn
31 D.W. Cornell and P.S. Yu, 'On Optimal Site Assignment for Relations in the Distributed Database Environment,' IEEE Transactions on Software Engineering, Vol.15, No.8, pp.1004-1009, Aug., 1989   DOI   ScienceOn
32 J. Atkin and M. Norris, Total Area Networking: ATM, Frame Relay and SMDS Explained, John Wiley & Son, New York, N.Y., 1995
33 F. Baiao, M. Mattoso and G. Zaverucha, 'A Distribution Design Methodology for Object DBMS,' Journal of Distributed and Parallel Databases, Vol.16, No.1, pp.45-90, 2004   DOI
34 C-H Cheng, W-K Lee and K-F Wong, 'A Genetic Algorithm- Based Clustering Approach for Database Partitioning,' IEEE Transactions on Systems, Man, and Cybernetics, Vol.32, No.3, pp.215-230, 2002   DOI   ScienceOn
35 J. Cuadrado, Optimize Database Queries, Byte, pp.57-63, July, 1995
36 L. Davis, Handbook of Genetic Algorithms, Van Nostrand Reinhold, New York, N.Y., 1991