Browse > Article

An Algorithm Generating Edit Scripts for XML Documents  

Lee, Suk-Kyoon (Division of Computer Science and Engineering, Dankook University)
Publication Information
Abstract
While detecting changes in XML documents has emerged as a major research area, the level of research on edit scripts, which are the result of the change detection, is not far from satisfactory. In this paper, we present X-treeESgen, the algorithm generating edit scripts used in X-treeDiff+, a change detection algorithm for XML documents. X-treeESgen shows the time complexity of O(n), and support move and copy operations in addition to the basic set that is update, insert, and delete operations. Note that no other change detection algorithm provide all the operations. Also, since the process of generating edit scripts in X-treeESgen is designed independent of the process of matching, various tuning techniques may be applied.
Keywords
XML; diff; edit script; change detection;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Y. Wang, D. DeWitt, J. Cai, "X-Diff: An Effective Change Detection Algorithm for XML Documents," in Proc. of ICDE, pp519-530, Mar., 2003.
2 S. Chawathe, "Comparing Hierarchical Data in External Memory," Proc. of VLDB, Sept. 1999.
3 diffxml, http://diffxml.sourceforge.net/
4 R. Fontaine, "Change Control for XML:Do it right," In Proc. of XML Europe 2003.
5 DeltaXML, http://www.deltaxml.com
6 김동아, "XML 문서에 대한 변화 탐지 및 관리," 단국대학교 전산통계학과 박사학위논문, pp.1-111, 2005.
7 한글과 컴퓨터, http://www.hancom.co.kr/
8 S. Chawathe, A. Garcia-Molina, and J Widom, "Change Detection in hierarchically structured information," In Proc. of SIGMOD, pp493-504, 1996.
9 이석균, "X-treeDiff+기반의 프로그램 복제 탐지," 전자공학회논문지, 제47권 CI편, 4호, 2010년 7월.
10 S. Ronnau, J. Scheffczyk, and U. Borghoff, "Towards XML Version Control of Office Documents," In Proc. of ACE Symposium on Document Engineering, pp. 10-19, Nov. 2005.
11 R. Wagner and M. Fischer, "The string-tostring correction problem," Journal of the ACM, 21, pp.168-173, 1974.   DOI   ScienceOn
12 K. Tai, "The tree-to-tree correction problem," Journal of the ACM, 26(3), pp.422-433, July 1979.   DOI   ScienceOn
13 S. Selkow, "The tree-to-tree editing problem," Information Processing Letters, 6, 1977.
14 K. Zhang and D. Shasha, "Simple fast algorithms for the editing distance between trees and related problems," SIAM Journal of Computing, 18(6), pp.1245-1262, 1989.   DOI
15 E. W. Myers, "An O(ND) Difference Algorithm and Its Variations," Algorithmica, 1(2), pp.251-266, 1986.
16 S. Chawathe and H. Molina, "Meaningful Change Detection in Structured Data," In SIGMOD '97, pp.26-37, 1997.
17 G. Cobena, S. Abiteboul and A. Marian, "Detecting Changes in XML Documents," The 18th ICDE, 2002.
18 S. Chawathe and H. G. Molina, "Meaningful Change Detection in Structured Data," In SIGMOD '97, pp.26-37, 1997.
19 S. Lee and D. Kim, "X-treeDiff+: Efficient Change Detection Algorithm in XML Documents," LNCS 4096, pp.1037-1046, 2006.