A Shared Inlining Method for Resolving the Overlapping Problem of Elements

엘리먼트의 중첩 문제를 해결한 Shared Inlining 저장 기법

  • 홍은지 (성공회대학교 소프트웨어공학과) ;
  • 이영호 ((사)iCOOP생협연대 전산팀)
  • Published : 2008.10.15

Abstract

The number of XML documents, which are widely used as a standard method for information expression and exchange in the web-based environment, increases rapidly along with the growing production of large XML documents. Many studies have been made to store and retrieve these XML documents on RDBMS, among which Shared Inlining storage method has a higher level of retrieval efficiency. The Shared Inlining method is the technique that analyzes the DTD information and stores the XML document in RDBMS by dividing for each node component. This study proposes the technique to resolve the overlapping problem that occurs in the element with several child nodes in the existing Shared Inlining method. The suggested method stores the XML document in the Shared Inlining structures appropriate to the DTD definition and enhances the accuracy of retrieval.

웹 환경에서 정보 표현과 교환을 위한 표준 방식으로 널리 사용되고 있는 XML 문서의 양은 급속히 증가하고 있으며, 대용량의 XML 문서도 많이 생산된다. 이러한 XML 문서들을 RDBMS에 효율적으로 저장하고 검색하는 연구가 활발히 진행되어 왔는데 이들 연구들 중에서 Shared Inlining 저장 방법이 높은 검색 효율을 보인다. Shared Inlining 방법은 DTD의 정보를 분석하여 XML 문서를 노드의 성분별로 분할하여 관계형 데이타베이스로 저장하는 기법이다. 본 논문은 기존의 Shared Inlining 방법에서 여러 하위 노드를 가지는 엘리먼트에서 발생되는 중첩 문제를 해결하기 위한 기법을 제안한다. 이 방법에서는 DTD 정의에 맞게 XML 문서를 Shared Inlining 구조로 저장하고 검색의 정확성을 높인다.

Keywords

References

  1. 이혜자, 정병수, 김대호, 이영구, "경로정보의 중복을 제거한 XML 문서의 저장 및 질의처리 기법", 한국정보처리학회 논문지, VOL.12-D, NO.05, pp. 0663- 0672, 2005
  2. Schmidt, A.R., Waas, F., Kersten, M.L., Florescu, D., Manolescu, I., Carey, M.J., Busse, R., The XML benchmark project, CWI Technical Report, 2001
  3. HONGJUN LU, JEFFREY XU YU et al., "What Makes the Differences: Benchmarking XML Database Implementations," ACM Transactions on Internet Technology, Vol.5, No.1, pp. 154-194, February 2005 https://doi.org/10.1145/1052934.1052940
  4. MASATOSHI YOSHIKAWA and TOSHIYUKI AMAGASA et al., "XRel: A Path-Based Approach to Storage and Retrieval of XML Documents Using Relational Databases," ACM Transactions on Internet Technology, Vol. 1, No. 1, pp. 110-141, August 2001 https://doi.org/10.1145/383034.383038
  5. Roy Goldman and Jennifer Widom, "DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases," Proceedings of 23rd International Conference on Very Large Data Bases, 1997
  6. Kentarou Kido, Toshiyuki Amagasa, Hiroyuki Kitagawa, "Processing XPath Queries in PC- Clusters Using XML Data Partitioning," ICDE Workshops, pp. 114, 2006
  7. Dao Dinh Kha, Masatoshi Yoshikawa, Shunsuke Uemura, "An XML Indexing Structure with Relative Region Coordinate," 17th International Conference on Data Engineering (ICDE'01), pp. 313-320, 2001
  8. J. Shanmugasundaram, K. Tufte, G. He, C. Zhang, D. De-Witt, J. Naughton, "Relational Databases for Querying XML Documents: Limitations and Opportunities," Proceedings of the 25th VLDB Conference, 1999
  9. "XML Path Language (XPath) 2.0 W3C Recommendation 23 January 2007," http://www.w3.org/ TR/xpath20/
  10. "XML Query (XQuery)," http://www.w3.org/XML/ Query/
  11. "XMark," http://monetdb.cwi.nl/xml/
  12. Shiyong Lu, Yezhou Sun, Mustafa Atay, Farshad Fotouhi, "A New Inlining Algorithm for Mapping XML DTDs to Relational Schemas," In Proceedings of the 1st International Workshop on XML Schema and Data Management, in Conjunction with the 22nd ACM International Conference on Conceptual Modeling, Vol. 2814 of Lecture Notes in Computer Science, pp. 366-377, 2003
  13. Mustafa Atay, Artem Chebotko, Dapeng Liu, Shiyong Lu, Farshad Fotouhi, "Efficient Schema- Based XML-to-Relational Data Mapping," Information Systems (IS), Vol. 32, No. 3, pp. 458-476, 2007 https://doi.org/10.1016/j.is.2005.12.008