Multi-Path Index Scheme for the Efficient Retrieval of XML Data

XML 데이타의 효과적인 검색을 이한 다중 경로 인덱스

  • 송하주 (서울대학교 컴퓨터공학부) ;
  • 김형주 (서울대학교 컴퓨터공학부)
  • Published : 2001.02.01

Abstract

Extended path expressions are used to denote multiple paths concisely by using '$\ast$' character. They are convenient for expressing OQL queries to retrieve XML data stored in OODBs. In this paper, we propose a multi-path index scheme as a new index scheme to efficiently process queries with extended path expressions. Our proposed index scheme allocates a unique path identifier for every possible single path in an extended path expression and provides functionalities of both a single path indexing and multiple path indexing through the composition of index key and path identifier while using only a index structure. The proposed index scheme provides better performance than single-path index schemes, and is practical since it can be implemented by little modification of leaf records of a B+-tree index.

확장 경로식은 '$\ast$'문자를 이용하여 여러 개의 경로를 간단하게 나타내기 위해 사용하는 것으로 객체지향 데이타베이스(object-oriented database : OODB)에 저장된 XML 데이타를 검색하는 질의를 표현하기에 유리하다. 본 논문은 이러한 확장 경로식을 포함하는 OQL 질의를 효과적으로 처리하기 위한 인덱스 구조로서 다중 경로 인덱스 기법을 제안한다. 제안하는 기법은 확장 경로식에 포함되는 각각의 단일 경로에 대해 고유한 경로 식별자를 부여한다. 그리고 인덱스 키값과 경로 식별자를 조합하여 저장하므로써 하나의 인덱스만을 이용하여 다수의 경로에 대한 검색과 단일한 경로에 대한 인덱스 기능을 동시에 지원하도록 하였다. 이 기법은 확장된 경로식에 대해 기존 인덱스를 여러 개 사용하는 방법보다 검색 성능을 높일 수 있고 B+-트리 인덱스 구조를 크게 변형하지 않고 사용할 수 있어 실용성이 우수하다.

Keywords

References

  1. Neil Bradley, The XML Companion, Second Edition, Addison-Wesley, 1999
  2. Alin Deutsch, Mary F. Fernandez, and Dan Suciu. Storing semistructured data with stored.In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 431-442, 1999 https://doi.org/10.1145/304182.304220
  3. Vassilis Christophides, rge Abiteboul, phie Cluet, and Scholl. From Structured documents to novel query facilities. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 313--324, 1994 https://doi.org/10.1145/191839.191901
  4. Poet Software Corp. XML-The Foundation for the Future. http://www.poet.com/products/cms/whitepapers/xml/index.html
  5. Zhaohui Xie and Jiawei Han. Join index hierarchies for supporting efficient navigations in object-oriented databases. In Proceedings of the International Conference on Very Large Data Bases, pages 522--533, 1994
  6. Elisa Bertino and Won Kim. Indexing techniques for queries on nested objects. IEEE Trans. on Knowledge and Database Eng., 1(2):196--214, 1989 https://doi.org/10.1109/69.87960
  7. Alfons Kemper and Guido Moerkotte. Access support in object bases. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 364--374, 1990 https://doi.org/10.1145/93597.98745
  8. Jayavel Shanmugasundaram, He Gang, Kristin Tufte, Chun Zhang, David DeWitt, and Jeffrey Naughton. Relational databases for querying xml documents: Limitations and opportunities. In Proceedings of the International Conference on Very Large Data Bases, pates 302--314, 1999
  9. Daniela Florescu and Donald Kossmann. Storing and querying xml data using an rdmbs. IEEE Data Engineering Bulletin, 22(3):27--34, 1999
  10. Kien A. Hua. Object-oriented database systems. In Proceedings of the International Conference on Data Engineering, pages 508--517, 1994
  11. Ehud Gudes. A uniform indexing scheme for object-oriented databases. In Proceedings International Conference on Data Engineering, pages 238--246, 1996
  12. Tova Milo and Dan Suciu. Index structures for path expressions. In Proceedings of the International Conference on Databases Theory, pages 277--295, 1999
  13. Alfons Kemper and Guido Moerkotte. Access support relations: An indexing method for object bases. Information Systems, 17(2):117--145, 1992 https://doi.org/10.1016/0306-4379(92)90008-B
  14. Roy Goldman and Jennifer Widom. Dataguides: Enabling query formulation and optimization in semistructured databases. In VLDB Journal, pages 436--445, 1997
  15. David Maier, Jacob Stein, Allen Otis, and Alan Purdy. Development of an object-oriented dbms. In Proceedings of the ACM Conference on Object-Oriented Programming Systems, Languages, and Applications(OOPSLA), pages 472--482, 1986 https://doi.org/10.1145/28697.28746