Browse > Article

XML Schema Matching based on Ontology Update for the Transformation of XML Documents  

Lee, Kyong-Ho (연세대학교 컴퓨터과학과)
Lee, Jun-Seung (연세대학교 컴퓨터과학과)
Abstract
Schema matching is important as a prerequisite to the transformation of XML documents. This paper presents a schema matching method for the transformation of XML documents. The proposed method consists of two steps: preliminary matching relationships between leaf nodes in the two XML schemas are computed based on proposed ontology and leaf node similarity, and final matchings are extracted based on a proposed path similarity. Particularly, for a sophisticated schema matching, the proposed ontology is incrementally updated by users' feedback. furthermore, since the ontology can describe various relationships between concepts, the proposed method can compute complex matchings as well as simple matchings. Experimental results with schemas used in various domains show that the proposed method is superior to previous works, resulting in a precision of 97% and a recall of 83 % on the average. Furthermore, the dynamic ontology increased by 9 percent overall.
Keywords
XML; Schema matching; Ontology; Complex matching;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Sergey Melnik, Hector Garcia-Molina, and Erhard Rahm, 'Similarity Flooding - A Versatile Graph Matching Algorithm,' Proc. Int'l Conf. Data Engineering, pp. 117-128, 2002   DOI
2 Li Xu and David W. Embley, 'Discovering direct and indirect matches for schema elements,' Proc. Int'l Conf. Database Systems for Advanced Applications, pp. 39-46, 2003   DOI
3 Robin Dhamankar, Yoonkyong Lee, AnHai Doan, and Alon Halevy, 'iMAP: Discovering Complex Semantic Mappings between Database Schemas,' Proc. Int'l Conf. SIGMOD, pp. 383-394, 2004   DOI
4 George A. Miller, 'WordNet: A Lexical Database for English,' Communications of the ACM, Vol. 38, No. 11, pp, 39-41, 1995   DOI   ScienceOn
5 Hong Hai Do, Sergey Melnik, and Erhard Rahm, 'Comparison of Schema Matching Evaluations,' Lecture Notes in Computer Science, Vol. 2593, pp. 221-237, 2002
6 Renee J. Miller, Laura M. Haas, Mauricio A. Hernandez, Lingling Yan, C. T. Howard Ho, Ronald Fagin, and Lucian Popa, 'The Clio Project: Managing Heterogeneity,' SIGMOD Record, Vol. 30, No. 1, pp. 78-83, 2001   DOI   ScienceOn
7 Hong-Hai Do and Erhard Rahm, 'COMA - A System for Flexible Combination of Schema Matching Approaches,' Proc. Int'l Conf. Very Large Data Bases, pp. 610-621, 2002
8 Jayant Madhavan, Philip A. Bernstein, and Erhard Rahm, 'Generic Schema Matching with Cupid,' Proc. Int'l Conf. Very Large Data Bases, pp, 49-58, 2001
9 Hong Su, Harumi Kuno, and Elke A. Rundensteiner, 'Automating the Transformation of XML Documents,' Proc. Int'l Workshop Web Information and Data Management, pp. 68-75, 2001   DOI
10 Mong Li Lee, Wynne Hsu, LiangHuai Yang, and Xia Yang, 'XClust: Clustering XML Schemas for Effective Integration,' Proc. Int'l Conf. Information and Knowledge Management, pp, 292-299, 2002   DOI
11 Wen-Syan Li and Chris Clifton, 'Semantic Integration in Heterogeneous Databases Using Neural Networks,' Proc. Int'l Conf. Very Large DataBase, pp. 1-12, 1994
12 AnHai Doan, Pedro Domingos, and Alon Halevy, 'Learning to Match Schemas of Data Sources: A Multistrategy Approach,' Machine Learning, Vol. 50, No. 3, pp. 279-301, 2003   DOI
13 Sonia Bergamaschi, Silvana Castano, Sabrina De Capitani di Vimercati, S. Montanari, and Maurizio Vincini, 'An Intelligent Approach to Information Integration,' Proc. Int'l Conf. Formal Ontology in Information Systems, pp. 253-267, 1998
14 Tova Milo and Sagit Zohar, 'Using Schema Matching to Simplify Heterogeneous Data Translation,' Proc. Int'l Conf. Very Large Data Bases, pp. 122-133, 1998
15 Barbara Staudt Lerner, 'A Model for Compound Type Changes Encountered in Schema Evolution,' ACM Transactions Database Systems, Vol. 25, No. 1, pp. 83-127, 2000   DOI   ScienceOn
16 Eila Kuikka, Paula Leinonen, and Martti Penttonen, 'Toward Automating of Document Structure Transformations,' Proc. ACM Symposium Document Engineering, pp. 103-110, 2002   DOI
17 World Wide Web Consortium, Extensible Markup Language(XML) 1.0(Second Edition), W3C Recommendation, http://www.w3c.org/TR/REC-xml, 2000
18 World Wide Web Consortium, XML Schema 1.0, W3C Recommendation, http://www.w3.org/TR/xmlschema-0/, 2001
19 World Wide Web Consortium, XSL Transformations (XSLT) 1.0, W3C Recommendation, http://www.w3.org/TR/1999/REC-xslt-19991116, 1999
20 MicroSoft biztalk mapper, http://www.microsoft.com/biztalk/
21 XSL Wiz, http://www.induslogic.com/
22 Erhard Rahm and Philip A. Bernstein, 'A Survey of Approaches to Automatic Schema Matching,' Very Large Data Bases Journal, Vol. 10, No. 4, pp. 334-350, 2001   DOI