Browse > Article

Estimating Join Selectivity of Global XQuery Queries in Distributed Environments  

Park, Jong-Hyun (충남대학교 컴퓨터)
Kang, Ji-Hoon (충남대학교 컴퓨터)
Abstract
One of the methods for integrating XML data in distributed environments is using XML view. User can query toward distributed local XML views by using global XQuery queries in XQuery which is a standard query language for searching XML data. The global XQuery queries naturally contain join operations because of integrating and searching distributed heterogeneous data. Since join operations are generally expensive for processing a query, its processing technique is very important for efficient processing of global XQuery queries. Therefore there are some studies on the efficient processing of join operations and one of these studies is that selects minimum join cost by estimating a join selectivity. In case of SQL, there are already some researches for estimating a join selectivity and join cost of global SQL queries. However we can not apply their methods for estimating the selectivity of join operations in SQL queries into XQuery queries because of the structural difference between relational data and XML data. Therefore this paper proposes a method for estimating a selectivity of join operations in XQuery queries using the information of XML views. Our contribution is three threefold. First, we define the difference point for estimating join selectivity between SQL and XQuery. Second, we estimate join selectivity in XQuery queries by referring XML views. Third, we evaluate our estimating method.
Keywords
Global XQuery Processing; Estimating Join Selectivity;
Citations & Related Records
연도 인용수 순위
  • Reference
1 W3C, XQuery 1.0 and XPath 2.0 Functions and Operators, W3C Recommendation 23 January 2007, (http://www.w3.org/TR/2007/REC-xpath-functions-20070123/)
2 W3C, XQuery 1.0: An XML Query Language, W3C Recommendation 23 January 2007, (http://www.w3.org/TR/2007/REC-xquery-20070123)
3 J. Freire, J. Haritsa, M. Ramanath, P. Roy, & J. Simeon, 'Statix: Making XML count,' Proc. of ACM SIGMOD Intl. Conf. on Management of Data, pp. 181-191, 2002
4 N. May, S. Helmer, C. C. Kanne, G. Moerkotte, 'XQuery Processing in Natix with an Emphasis on Join Ordering,' Proc. 1st Int. Workshop on XIME-P 2004, Paris, France, June 17-18, 2004
5 S. Groppe, & S. Bottcher, 'Schema-based Query Optimization for XQuery Queries,' Proc. ADBIS 2005, Tallinn, Estonia, September, 2005
6 A. Aboulnaga, A. R. Alameldeen, & J. F. Naughton, 'Estimating the selectivity of XML path expressions for internet scale applications,' Proc. 27th VLDB, Roma, Italy, 2001
7 C. Shahabi, L. Khan, & D. McLeod, 'A Probe-Based Technique to Optimize Join Queries in Distributed Internet Databases,' Knowledge and Information Systems, pp. 373-385, 2001
8 S. B. Navathe & R. Elmasri, 'Fundamentals of Database Systems,' fourth edition, Addi-son Wesley, 2003
9 L. Liu, C. Pu & K. Richine, 'Distributed Query Scheduling Service: An Architecture and Its Implementation,' IJCIS, Vol.7, No.2-3, pp. 123-166, 1998
10 L. Wang, M. Mulchandani & E. A. Rundensteiner, 'Updating XQuery Views Published over Relational Data : A Roundtrip Case Study,' Xsym 2003, pp. 223-237, Berlin, Germany, September 8, 2003
11 X. Lin & M. E. Orlowska, 'An Efficient Processing of a Chain Join with the Minimum Communication Cost in Distributed Database Systems,' Distributed and Parallel Databases 3, pp. 69-83, 1995   DOI
12 M. J. Yu & P. C.-Y. Sheu, 'Adaptive Join Algorithms in Dynamic Distributed Databases,' Distrib. Parallel Databases, Vol.5, No.1, pp. 5-30, January, 1997   DOI   ScienceOn
13 A. Halverson, J. Burger, L. Galanis, A. Kini, R. Krishnamurthy, A. N. Rao, F. Tian, S. D. Viglas, Y. Wang, J. F. Naughton, & D. J. DeWitt, 'Mixed Mode XML Query Processing,' VLDB 2003
14 I. Eldosouky, H. Arafat, & A. A. Eldin, 'New Heuristic Approaches for Improving Dis-tributed Query Processing based on The Enhancement of Semi-Join Strategies,' Proc. the International Conference on Statistics, Computer Science, and Operational Research, Egypt Dec, 2001
15 I. Manolescu, D. Florescu & D Kossmann, 'Answering XML Queries over Heterogeneous Data Sources,' Proc. 27th International Conference on Very Large Data Bases, Roma, Italy, pp. 241-250, September 11-14, 2001
16 M. Steinbrunn, G. Moerkotte, & A. Kemper, 'Optimizing Join Orders,' Technical Report MIP-9307, Faculty of Mathematic, Univ. of Passau, Passau, Germany, 1993
17 M. Steinbrunn, G. Moerkotte & A. Kemper, 'Heuristic and Randomized Optimization for the Join Ordering Problem,' VLDB Journal, 6(3), pp. 191-208, 1997   DOI   ScienceOn
18 A. N. Swami & K. B. Schiefer, 'On the Estimation of Join Result Sizes,' Proc. EDBT 1994, pp. 287-300, Cambridge, March 28-31, 1994
19 W3C, XML Schema Part 0: Primer Second Edition W3C Recommendation 28 October 2004, (http://www.w3.org/TR/2004/REC-xmlschema -0-20041028)
20 W3C, Extensible Markup Language (XML) 1.1 (Second Edition) W3C Recommendation 16 August 2006, (http://www.w3.org/TR/2006/REC-x ml11-20060816)