Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2002.9D.6.999

A Transformation Technique of XML DTD to Relational Database Schema Based On Extracting Common Structure in XML Documents  

Ahn, Sung-Eun (강원대학교 대학원 컴퓨터정보통신공학과)
Choi, Hwang-Kyu (강원대학교 전기전자정보통신학과)
Abstract
XML is emerging as a standard data format to exchange and to present data on the Web. There are increasing needs to efficiently store and to query XML data. In this paper. we propose a new schema transformation algorithm based on a common structure extracting technique from XML documents. The common structure is shared by all XML documents referenced by DTD and the uncommon structure is ununiformly appeared on all XML documents referenced by DTD. Based on the extracted common and uncommon structures, we transform XML DTD into relational database schema. We conduct a performance evaluation based on the number of the generated tables, the size of the record, query processing time and the number of joins on the query. The performance of our algorithm is compared with the existing algorithms, then in most cates, our algorithm is better than the existing ones with respect to the number of the generated tables and appearance of NULL values in the tables.
Keywords
XML(Extensible Markup Language); DTD(Document Type Definition); Relational Database; Schema; Cardinality Operator;
Citations & Related Records
연도 인용수 순위
  • Reference
1 T. Shimura, M. Yoshikawa, and S. Uemura, 'Storage and Retrieval of XML Documents Using Object-Relational Databases,' DEXA, 1999
2 J. Shanmugasundaram, H. Gang, K. Tufte, C. Zhang, D. J. DeWitt, and J. F. Naughton, 'Relational Databases for Querying XML Documents : Limitation and Opportunities,' Proc. of VLDB, Edinburgh, Scotland, pp.302-304, 1999
3 D. Florescu and D. Kossmann, 'Storing and Querying XML Data Using an RDBMS,' Proc. of Int. Conf. on Data Eng., 1999
4 C. Kanne and G. Moerkotte, 'Efficient Storage of XML Data,' Proc. of Int. Conf. on Data Eng., 1998   DOI
5 J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom, 'Lore : A Database Management System for Semi Structured Data,' Technical Report, Stanford University Database Group, February, 1997
6 A. Deutsch, M. F. Fernandez, and D. Suciu, 'Storing Semi-structured Data with STORED,' Proc. of ACM SIGMOD Conference, 1999
7 A. Deutsch, M. Fernandez, D. Florescu, A. Levy, D. Suciu, 'A Query Language for XML,' Proc. of 8th International World Wide Web Conference, 1999
8 S. Chawathe, H. Garcia-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, J. Ullman, and J. Wisdom, 'The TSIMMIS Project : Integration of Heterogeneous Information Sources,' Proc. of IPSJ Conference, pp.7-18, 1994
9 P. Buneman, S. Davidson, G. Hillebrand, and D. Suciu, 'A Query Language and Optimization Technique for Unstructured Data,' Proc. of ACM SIGMOD International Conference on Management of Data, 1996   DOI   ScienceOn
10 A. Bonifati and S. Ceri, 'Comparative Analysis of Five XML Query Languages,' ACM SIGMOD Record, 29(1), 2000   DOI   ScienceOn
11 http://www.rpbourret.com/xmldbms/index.htm/
12 J. McHugh and J. Widom, 'Query Optimization for XML,' Proc. of Very Large Data Bases, Edinburgh, U.K., 1999
13 M. N. Garofalakis, A. Gionis, R. Rastogi, S. Seshadri, and K. Shim, 'XTRACT : A System for Extracting Document Type Descriptors from XML Documents,' Proc. of ACM SIGMOD Conference on Management of Data, Dallas, Texas, May, 2000   DOI   ScienceOn
14 D. W. Lee and W. W. Chu, 'Constraints-preserving Transformation from XML Documents Type Definition to Relation Schema,' UCLA-CS-TR, 2000
15 T. Bray, J. Paoli, C. M. Sperberg-McQueen, 'Extensible Markup Language (XML) 1.0 (Second Edition),' http://www.w3.org/TR/REC-xml, W3C Recommendation 6, October, 2000
16 http://www.cobase.cs.ucla.edu/projects/xpress/