Browse > Article

An RDBMS-based Inverted Index Technique for Path Queries Processing on XML Documents with Different Structures  

민경섭 (서울대학교 컴퓨터공학과)
김형주 (서울대학교 컴퓨터공학과)
Abstract
XML is a data-oriented language to represent all types of documents including web documents. By means of the advent of XML-based document generation tools and grow of proprietary XML documents using those tools and translation from legacy data to XML documents at an accelerating pace, we have been gotten a large amount of differently-structured XML documents. Therefore, it is more and more important to retrieve the right documents from the document set. But, previous works on XML have mainly focused on the storage and retrieval methods for a large XML document or XML documents had a same DTD. And, researches that supported the structural difference did not efficiently process path queries on the document set. To resolve the problem, we suggested a new inverted index mechanism using RDBMS and proved it outperformed the previous works. And especially, as it showed the higher efficiency in indirect containment relationship, we argues that the index structure is fit for the differently-structured XML document set.
Keywords
XML; XML; Database; Inverted Index Table;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Brian F. Cooper, Neal Samfple, Michael J. Franklin, Gisli R. Hjaltason and Moshe Shadmon, A fast index for semistructured data, VLDB, pp. 341-350, January 2001
2 Chin Wan Chung, Jun-Ki Min, Kyuseok Shim, APEX : An Adaptive Path Index for XML Data, SIGMOD, pp. 121-132, June 2002   DOI
3 Neil Bradley, The XML companion second edition, Addison Wesley, 2000
4 Daniela Florescu and Donald Kossmann, Storing and Querying XML Data using an RDBMS, Bulletin of Data Engineering, pp. 27-34, September 1999
5 Shurug Al-Khalifa, H. V. Jagadish, Nick Koudas, Jignesh M. Patel, Divesh Srivastava, and Yuqing Wu, Structural Joins : A Primitive for Efficient XML Query Pattern Matching, ICDE, pp. 141-153   DOI
6 Shu-Yao Chien, Zografoula Vagena, Donghui Zhang, Vassilis J. Tsotras, and Carlo Zaniolo, Efficient Structural Joins on Indexed XML Documents, VLDB, pp. 263-274, August 2002
7 Chun Zhang, Jeffery Nahgton, David DeWitt, Qiong Luo, and Guy Lohman, On Supporting Containment Queries in Relational Database Management Systems, SIGMOD, pp. 425-436, May 2001   DOI
8 Chiyoung Seo, Sang-won Lee, and Hyoung-Joo Kim, An Efficient Inverted Index Technique for XML Documents using RDBMS, Information and Software Technology (Elsevier Science), Volume 45, Issue 1, pp. 11-22, January 2003   DOI   ScienceOn
9 XML Schema, http://www.w3.org/XML/Schema#dev
10 XSL Transformations (XSLT) Version 1.0 W3C Recommendation, http://www.w3.org/TR/xslt, 1999
11 XMLSpy, http://www.xmlspy.com/
12 Xeena, http://www.alphaworks.ibm.com/tech/xeena
13 Roy Goldman and Jennifer Widom, DataGuides : Enabling Query Formulation and Optimization in Semistructured Databases, VLDB , pp. 436-445, August 1997
14 Tova Milo and Dan Suciu, Index Structures for Path Expressions, ICDT, pp. 277-295, January 1999
15 XML Path Language(XPath) Version 1.0 W3C Recommendation, http://www.w3.org/TR/xpath, 1999
16 Masatoshi Yoshikawa and Toshiyuki Amagasa, XRel : A Path-Based Approach to Storage and Retrieval of XML Documents Using Relational Databases, ACM TOIT, Volume 1, Number 1, pp. 110-141, August 2001   DOI
17 Wisconsin XML Data Set, http://www.cs.wisc.edu/niagara/data.html
18 XQuery 1.0: An XML Query Language W3C Working Draft, http://www.w3.org/TR/xquery/, 2002