경로 식별자를 이용한 다중 정규경로 처리기법

Processing of Multiple Regular Path Expressions using PID

  • 김종익 (서울대학교 컴퓨터공학부) ;
  • 정태선 (서울대학교 컴퓨터공학부) ;
  • 김형주 (서울대학교 컴퓨터공학부)
  • 발행 : 2002.08.01

초록

XML에 대한 질의는 데이타 그래프 내 경로에 대한 질의를 기반으로 하며 그래프 내의 임의의 경로를 표시하기 위해 정규식을 사용한다. 일반적으로 의미있는 질의를 표현하기 위해서는 질의 내에 여러 개의 정규식을 포함하는 경우가 많다. 그럼에도 불구하고 기존의 연구는 주로 단일 정규식으로 이루어진 질의의 최적화 방법을 다루고 있다. 본 논문에서는 데이타 그래프를 탐색하지 않고 다중 정규 경로 질의를 처리할 수 있는 방을 제시한다. 본 논문에서는 다중 정규 경로를 효율적으로 처리하기 위해서 데이타 그래프 상의 임의의 두 노드 사이에 경로가 존재하는지를 직접 알 수 있는 방법인 경로 식별자를 제안하고 경로 식별자를 만드는 방법을 제공한다. 또한 본 논문에서 제안된 방법을 구현하여 그 성능에 대한 결과를 제공한다.

Queries on XML are based on paths in the data graph, which is represented as an edge labeled graph model. All proposed query languages for XML express queries using regular expressions to traverse arbitrary paths in the data graph. A meaningful query usually has several regular path expressions in it, but much of recent research is more concerned with optimizing a single path expression. In this paper, we present an efficient technique to process multiple path expressions in a query. We developed a data structure named as the path identifier(PID) to identify whether two given nodes lie on the fame path in the data graph or not, and utilized the PID for efficient processing of multiple path expressions. We implement our technique and present preliminary performance results.

키워드

참고문헌

  1. Serge Abiteboul, 'Querying semi-structured data,' In Proceedings of the International Conference on Database Theory, 1997
  2. Serge Abiteboul, Dallan Quass, Jason McHugh, Jennifer Widom, and Janet Wiener, 'The lore! query language for semistructured data,' International Journal on Digital Libraries, 1996 https://doi.org/10.1007/s007990050005
  3. Sara Baase, 'Computer Algorithms. Introduction to Design and Analysis,' Addison-Wesley Publishing Company, 1988
  4. T.Bray, J.Paoli, and C.Sperberg-McQueen, 'Extensible markup language(XML) 1.0,' Technical report, W3C Recommandation, 1998
  5. Peter Buneman, 'Semi-structured data,' In Proceedings of ACM Symposium on Principles of Database Systems, 1997
  6. Peter Buneman, Susan Davidson, Gerd Hillebrand, and Dan Suciu, 'A query language and optimization techniques for unstructured data,' In Proceedings of the ACM SlGMOD International Conference on the Management of Data, 1996 https://doi.org/10.1145/235968.233368
  7. Hobin Cover, 'The XML Cover Pages,' http://xml.coverpages.org
  8. D.Chamberlin, D.Florescu, I.Robie, j.Simeon, and M.Stefanescu, 'XQuery: A Query Language for XML,' Technical report, W3C Working Draft, February 2001
  9. 'The Internet Movie Database:' http://www.imdb.com
  10. A.Deutsch, M.Fernandez, D.Florescu, ALevy, and D.Suciu, 'Query language for XML:' In Proceedings of Eighth International World Wide Web Conference, 1999 https://doi.org/10.1145/383952.383985
  11. Mary Fernandez and Dan Suciu, 'Optimizing regular path expressions using graph schemas,' In IEEE International Conference on Data Engineering, 1998 https://doi.org/10.1109/ICDE.1998.655753
  12. Roy Goldman and Jennifer Widom, 'Datafluides: enabling query formulation and optimization in semistructured databases,' In Proceedings of the Conference on Very Large Data Bases, 1997
  13. John E. Hopcroft and Jeffery D. Ullman, 'Introduction to automata theory, languages, and computation:' Addison-Wesley Publishing Company, 1979
  14. Quanzhong Li and Bongki Moon, 'Indexing and Querying XML Data for Regular Path Expressions:' In Proceedings of the Conference on Very Large Data Bases, 2001
  15. J. McHugh and J. Widom, 'Compile-Time Path Expansion in Lore:' In Proceedings the Workshop on Query Processing for Semistructured Data and Non-Standard Data Formats, 1999
  16. Tova Milo and Dan Suciu, 'Index structure for path expressions:' In Proceedings of the International Conference on Database Theory, 1999
  17. 'Astronomical Data Center:' http://tarantella.gsfc.nasa.gov/xml/
  18. Svetlozar Nestorov, Jeffery D. Ullman, Janet Wiener, and Sudarshan Chawathe, 'Representative objects: concise representations of semistructured, hierarchical data:' In IEEE International Conference on Data Engineering, 1997 https://doi.org/10.1109/ICDE.1997.581741
  19. Dan Suciu, 'Semistructured data and XML:' In Proceedings of International Confernece on Foundataions of Data Organization, 1998
  20. Dan Suciu, Mary Fernandez, Susan Davidson, and Peter Bunernan, 'Adding structure to unstructured data:' In Proceedings of the International Conference on Database Theory, 1997
  21. W3C, 'XML Path Language(XPath) 1.0:' In W3C Recommendation, 1999