The Query Optimization Techniques for XML Data using DTDs

DTD를 이용한 XML 데이타에 대한 질의 최적화 기법

  • 정태선 (서울대학교 컴퓨터공학부) ;
  • 김형주 (서울대학교 컴퓨터공학부)
  • Published : 2001.12.01

Abstract

As XML has become and emerging standard for information exchange on the World Wide Web it has gained attention in database communities of extract information from XML seen as a database model. Data in XML can be mapped to semistructured dta model based on edge-labeled graph and queries can be processed against it Here we propose new query optimization techniques using DTDs(Document Type Definitions) which have the schema information about XML data. Our techniques reduce traditional index techniques Also, as they preserve source database structure, they can process many kinds of complex queries. we implemented our techniques and provided preliminary performance results.

XML이 웹상에서 정보 교환의 표준으로 채택되면서 XML을 데이타베이스의 데이타로 보고 정보를 추출하는 분야가 주목받고 있다. XML은 그래프 기반의 비정형 데이타(semistructured data) 모델 과 매우 비슷하기 때문에 XML 데이타를 그래프 기반의 비정형 데이타 모델로 매핑한 후, 이에 대하여 질의를 처리할 수 있다. 본 논문에서는 XML 데이타에 대하여 스키마 정보를 가지는 DTB(Document Type Definition)를 이용한 질의 최적화 기법을 제안한다. 제안하는 기법은 인덱싱 효과를 내면서도 기존 방법에 비하여 부가의 메모리를 적게 필요로하고, 입력 데이타의 구조를 그대로 유지하기 때문에 다양한 형태의 질의를 효율적으로 처리할 수 있다. 간단한 예제 데이타베이스에 대하여 제안하는 기법의 실험 결 과를 보였다.

Keywords

References

  1. T. Bray, J. Paoli, and C. Sperberg-McQueen,'Extensible markup language(XML) 1.0,' Technical report, W3C Recommendation, 1998
  2. J. McHugh and J. Widom,'Query optimization for XML,' In Proceedings of the Conference on Very Large Data Bases, 1999
  3. Mary Fernandez and Dan Suciu, 'Optimizing regular path expressions using graph schemas,' In IEEE International Conference on Data Engineering, 1998 https://doi.org/10.1109/ICDE.1998.655753
  4. Roy Goldman and Jennifer Widom,'DataGuides: enabling query formulation and optimization in semistructured databases,' In Proceedings of the Conference on Very Large Data Bases, 1997
  5. Svetlozar Nestorov, Jeffrey Ullman, Janet Wiener, and Sudarshan Chawathe,'Representative objects: concise representations of semistructured, hierarchical data,' In IEEE International Conference on Data Engineering, 1997 https://doi.org/10.1109/ICDE.1997.581741
  6. Tova Milo and Dan Suciu,'Index structures for. path expressions,' In Proceedings of the International Conference on Database Theory, 1999
  7. Yannis Papakonstantinou and Serge Abiteboul, 'Object fusion in mediator systems,' In Proceedings of the Conference on Very Large Data Bases, 1996
  8. Alin Deutsch, Marl Fernandez, and Dan Sueiu, 'Storing semistructed data with STORED,' In Proceedings of the ACM SIGMOD International Conference on the Management of Data, 1999 https://doi.org/10.1145/304182.304220
  9. Jayavel Shanrnugasundaram, H. Gang, Kristin Tufte, Chun Zhang, David DeWitt, and Jeffrey F. Naughton.Relational Databases for Querying XML Documents: Limitations and Opportunities,' In Proceedings of the Conference on Very Large Data Bases, 1999
  10. S. Abiteboul, Dallan Quass, Jason McHugh, Jennifer Widom, Janet Wiener, 'The lorel query language for semistructured data,' International Journal on Digital Libraries 1996
  11. A. Deutsch, M. Fernandez, D. Florescu, A. Levy, and D. Suciu,'Query language for XML,' In Proceedings of Eighth International World Wide Web Conference, 1999
  12. Don Chamberlin, Jonathan Robie, and Daniela Florescu, 'Quilt: An XML Query Language for Heterogeneous Data Sources,' In Invited paper, WebDB, 2000
  13. D. Chamberlin, D. Florescu, J. Robie, J. Simeon, and M. Stefanescu,'XQuery: A Query Language for XML,' Technical report, W3C Working Draft, February 2001
  14. Dan Suciu, Mary Fernandez, Susan Davidson, and Peter Bunernan, 'Adding structure to unstructured data,' In Proceedings of the International Conference on Database Theory, 1997
  15. Minos Garofalakis, Aristides Gionis, Rajeev Rastogi, S. Seshadri, and Kyuseok Shim, 'XTRACT : A System for Extracting Document Type Descriptors from XML Documents,' In Proceedings of the ACM SIGMOD International Conference on the Management of Data, 2000 https://doi.org/10.1145/342009.335409
  16. Dan Suciu, 'Semistructured data and XML,' In Proceedings of International Conference on Foundations of Data Organization, 1998
  17. Y. Papakonstantinou, and P. Velikhov, 'Enhancing semistructured data mediators with document type definitions,' In IEEE International Conference on Data Engineering, 1999 https://doi.org/10.1109/ICDE.1999.754916
  18. S. Abiteboul, J. McHugh, M. Rys, V. Vassalos, and J. Weiner. 'Incremental maintenance for materialized views over semistructured data,' In Proceedings of the Conference on Very Large Data Bases, 1998