Normalization of XQuery Queries for Efficient XML Query Processing

효율적인 XML질의 처리를 위한 XQuery 질의의 정규화

  • 김서영 (삼성전자 기술총괄 소프트웨어센터) ;
  • 이기훈 (한국과학기술원 전산학과) ;
  • 황규영 (한국과학기술원 전산학과)
  • Published : 2004.10.01

Abstract

As XML becomes a standard for data representation, integration, and exchange on the Web, several XML query languages have been proposed. World Wide Web Consortium(W3C) has proposed XQuery as a standard for the XML query language. Like SQL, XQuery allows nested queries. Thus, normalization rules have been proposed to transform nested XQuery queries to semantically equivalent ones that could be executed more efficiently. However, previous normalization rules are applicable only to restricted forms of nested XQuery queries. Specifically, they can not handle FLWR expressions having nested expressions in the where clause. In this paper, we propose normalization rules for XQuery queries by extending those for SQL queries. Our proposed rules can handle FLWR expressions haying nested expressions in every clause. The major contributions of this paper are as follows. First, we classily nesting types of XQuery queries according to the existence of correlation and aggregation. We then propose normalization rules for each nesting type. Second, we propose detailed algorithms that apply the normalization rules to nested XQuery queries.

XML이 웹 상에서의 정보 표현, 통합, 교환을 위한 표준이 됨에 따라 다양한 XML 질의 언어들이 제안되었으며, World Wide Web Consortium(W3C)은 XQuery를 XML 질의 언어의 표준으로 권고하였다. XQuery는 SQL과 유사하게 중첩 질의를 허용하므로, 중첩된 XQuery 질의를 동일한 의미를 가지면서 보다 효율적으로 실행될 수 있는 질의로 변환하는 정규화 규칙들이 제안되었다. 하지만 제안된 정규화 규칙들은 제한적인 형태의 중첩 질의에만 적용되는 문제점을 가지고 있다. 특히, FLWR 표현식의 where 절에 있는 중첩을 처리할 수 없다. 본 논문에서는 SQL 질의의 정규화 규칙들을 확장하여 XQuery 질의의 정규화 규칙들을 제안한다. 제안한 정규화 규칙들은 FLWR 표현식의 모든 절에 나타나는 중첩을 처리할 수 있다. 본 논문의 주요 공헌은 다음과 같다. 첫째, 상관과 집계의 유무에 따라 XQuery 질의의 중첩 유형을 분류하고, 각 유형 별로 정규화 규칙들을 제안한다. 둘째, 중첩된 XQuery 질의에 정규화 규칙들을 적용하는 세부 알고리즘들을 제안한다.

Keywords

References

  1. Simon, H., Strategic Analysis of XML for Web Application Development, Computer Research Corp., 2000
  2. Chamberiin, D., Robie, J., and Florescu, D., 'Quilt: An XML Query Language for Heterogeneous Data Sources,' In Proc SIGMOD/PODS Workshop on the Web and Database, Dallas, Texas, pp. 53-62, May 2000
  3. Robie, J., Lapp, J., and Schach, D., 'XML Query Language (XQL),' In Proc. QL'98, Cambridge, Mass., Dec. 1998
  4. Deutsch, A. et al., 'XML-QL: A Query Language for XML,' In Proc. 8th Int'l Conf. on Worid Wide Web, Toronto, May 1999
  5. World wide web Consortium, XML Path Language (XPath) Version 1.0, W3C Recommendation, Nov. 1999 (availiable from http://www.w3.org/TR/xpath.html)
  6. Worid Wide Web Consortium, XQuery 1.0: An XML Query Language, W3C Working Draft, Aug. 2003 (availiable from http://www.w3.org/TR/xquery/)
  7. World Wide Web Consortium (available from http://www.w3.org/)
  8. Manolescu, L, Florescu, D., and Kossmann, D., 'Answering XML Queries over Heterogeneous Data Sources,' In Proc. 27th Int'l Conf. on Very Large Data Bases, Roma, Italy, pp. 241-250, Sept. 2001
  9. Kim, W., 'On Optimizing an SQL-like Nested Query,' ACM Trans. on Database Systems, Vol. 7, No. 3, pp. 443-469, Sept. 1982 https://doi.org/10.1145/319732.319745
  10. Kiessling, W., SQL-like and Quel-like Correlation Queries with Aggregates Revisited, UCB/ERL Memo 84/75, Electronics Research Laboratory, Univ Califomia, Berkeley, Sept. 1984
  11. Ganski, R. and Wong, H., 'Optimization of Nested SQL Queries Revisited,' In Proc. Int'l Conf. on Management of Data, ACM SIGMOD, San Francisco, USA, pp. 23-33, May 1987
  12. Dayal, U., 'Of Nests and Trees: A Unified Approach to Processing Queries that contain Nested Subqueries, Aggregates and Quantifiers,' In Proc. 13th Int'I Conf. on Very Large Data Bases, Brighton, England, pp. 197-208, Sept. 1987
  13. Seshadri, P., Pirahesh, H., and Leung, T., 'Complex Query Decorrelation,' In Proc. the 17th Int'1 Conf. on Data Engineering, pp. 450-458, Feb. 1996 https://doi.org/10.1109/ICDE.1996.492194
  14. Pirahesh, H., Hellerstein, J. M., and Hasan, W., 'Extensible/Rule Based Query Rewrite optimization in Starburst,' In Proc, Int'l Conf. on Management of Data, ACM SIGMOD, San Diego, California, pp. 39-48, June 1992 https://doi.org/10.1145/130283.130294