Browse > Article

Processing of Multiple Regular Path Expressions using PID  

Kim, Jong-Ik (서울대학교 컴퓨터공학부)
Jeong, Tae-Seon (서울대학교 컴퓨터공학부)
Kim, Hyeong-Ju (서울대학교 컴퓨터공학부)
Abstract
Queries on XML are based on paths in the data graph, which is represented as an edge labeled graph model. All proposed query languages for XML express queries using regular expressions to traverse arbitrary paths in the data graph. A meaningful query usually has several regular path expressions in it, but much of recent research is more concerned with optimizing a single path expression. In this paper, we present an efficient technique to process multiple path expressions in a query. We developed a data structure named as the path identifier(PID) to identify whether two given nodes lie on the fame path in the data graph or not, and utilized the PID for efficient processing of multiple path expressions. We implement our technique and present preliminary performance results.
Keywords
XML; regular path; query processing; path identifier;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Serge Abiteboul, Dallan Quass, Jason McHugh, Jennifer Widom, and Janet Wiener, 'The lore! query language for semistructured data,' International Journal on Digital Libraries, 1996   DOI
2 Serge Abiteboul, 'Querying semi-structured data,' In Proceedings of the International Conference on Database Theory, 1997
3 Sara Baase, 'Computer Algorithms. Introduction to Design and Analysis,' Addison-Wesley Publishing Company, 1988
4 T.Bray, J.Paoli, and C.Sperberg-McQueen, 'Extensible markup language(XML) 1.0,' Technical report, W3C Recommandation, 1998
5 Peter Buneman, Susan Davidson, Gerd Hillebrand, and Dan Suciu, 'A query language and optimization techniques for unstructured data,' In Proceedings of the ACM SlGMOD International Conference on the Management of Data, 1996   DOI   ScienceOn
6 Hobin Cover, 'The XML Cover Pages,' http://xml.coverpages.org
7 D.Chamberlin, D.Florescu, I.Robie, j.Simeon, and M.Stefanescu, 'XQuery: A Query Language for XML,' Technical report, W3C Working Draft, February 2001
8 'The Internet Movie Database:' http://www.imdb.com
9 A.Deutsch, M.Fernandez, D.Florescu, ALevy, and D.Suciu, 'Query language for XML:' In Proceedings of Eighth International World Wide Web Conference, 1999   DOI
10 Mary Fernandez and Dan Suciu, 'Optimizing regular path expressions using graph schemas,' In IEEE International Conference on Data Engineering, 1998   DOI
11 Roy Goldman and Jennifer Widom, 'Datafluides: enabling query formulation and optimization in semistructured databases,' In Proceedings of the Conference on Very Large Data Bases, 1997
12 'Astronomical Data Center:' http://tarantella.gsfc.nasa.gov/xml/
13 Quanzhong Li and Bongki Moon, 'Indexing and Querying XML Data for Regular Path Expressions:' In Proceedings of the Conference on Very Large Data Bases, 2001
14 J. McHugh and J. Widom, 'Compile-Time Path Expansion in Lore:' In Proceedings the Workshop on Query Processing for Semistructured Data and Non-Standard Data Formats, 1999
15 Tova Milo and Dan Suciu, 'Index structure for path expressions:' In Proceedings of the International Conference on Database Theory, 1999
16 Peter Buneman, 'Semi-structured data,' In Proceedings of ACM Symposium on Principles of Database Systems, 1997
17 Svetlozar Nestorov, Jeffery D. Ullman, Janet Wiener, and Sudarshan Chawathe, 'Representative objects: concise representations of semistructured, hierarchical data:' In IEEE International Conference on Data Engineering, 1997   DOI
18 Dan Suciu, Mary Fernandez, Susan Davidson, and Peter Bunernan, 'Adding structure to unstructured data:' In Proceedings of the International Conference on Database Theory, 1997
19 W3C, 'XML Path Language(XPath) 1.0:' In W3C Recommendation, 1999
20 John E. Hopcroft and Jeffery D. Ullman, 'Introduction to automata theory, languages, and computation:' Addison-Wesley Publishing Company, 1979
21 Dan Suciu, 'Semistructured data and XML:' In Proceedings of International Confernece on Foundataions of Data Organization, 1998