The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.08 - August (2004 vol.16)
pp: 965-979
Kyong-Ho Lee , IEEE Computer Society
Yoon-Chul Choy , IEEE Computer Society
Sung-Bae Cho , IEEE Computer Society
ABSTRACT
<p><b>Abstract</b>—SGML/XML are having a profound impact on data modeling and processing. This paper presents an efficient algorithm to compute differences between old and new versions of an SGML/XML document. The difference between the two versions can be considered to be an edit script that transforms one document tree into another. The proposed algorithm is based on a hybridization of bottom-up and top-down methods: The matching relationships between nodes in the two versions are produced in a bottom-up manner and then the top-down breadth-first search computes an edit script. Faster matching is achieved because the algorithm does not need to investigate the possible existence of matchings for all nodes. Furthermore, it can detect structurally meaningful changes such as the movement and copy of a subtree as well as simple changes to the node itself like insertion, deletion, and update.</p>
INDEX TERMS
Change detection, difference computation, edit script, edit operation, structured document, SGML, XML.
CITATION
Kyong-Ho Lee, Yoon-Chul Choy, Sung-Bae Cho, "An Efficient Algorithm to Compute Differences between Structured Documents", IEEE Transactions on Knowledge & Data Engineering, vol.16, no. 8, pp. 965-979, August 2004, doi:10.1109/TKDE.2004.19
61 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool