|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
17th International Conference on Database and Expert Systems Applications (DEXA'06)
Finding Syntactic Similarities Between XML Documents
Krakow, Poland
September 04-September 08
ISBN: 0-7695-2641-1
| ASCII Text | x | ||
| Davood Rafiei, Daniel L. Moise, Dabo Sun, "Finding Syntactic Similarities Between XML Documents," 2012 23rd International Workshop on Database and Expert Systems Applications, pp. 512-516, 17th International Conference on Database and Expert Systems Applications (DEXA'06), 2006. | |||
| BibTex | x | ||
| @article{ 10.1109/DEXA.2006.62, author = {Davood Rafiei and Daniel L. Moise and Dabo Sun}, title = {Finding Syntactic Similarities Between XML Documents}, journal ={2012 23rd International Workshop on Database and Expert Systems Applications}, volume = {0}, year = {2006}, issn = {1529-4188}, pages = {512-516}, doi = {http://doi.ieeecomputersociety.org/10.1109/DEXA.2006.62}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - 2012 23rd International Workshop on Database and Expert Systems Applications TI - Finding Syntactic Similarities Between XML Documents SN - 1529-4188 SP512 EP516 A1 - Davood Rafiei, A1 - Daniel L. Moise, A1 - Dabo Sun, PY - 2006 KW - null VL - 0 JA - 2012 23rd International Workshop on Database and Expert Systems Applications ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/DEXA.2006.62
Detecting structural similarities between XML documents has been the subject of several recent work, and the proposed algorithms mostly use tree edit distance between the corresponding trees of XML documents. However, evaluating a tree edit distance is computationally expensive and does not easily scale up to large collections. We show in this paper that a tree edit distance computation often is not necessary and can be avoided. In particular, we propose a concise structural summary of XML documents and show that a comparison based on this summary is both fast and effective. Our experimental evaluation shows that this method does an excellent job of grouping documents generated by the same DTD, outperforming some of the previously proposed solutions based on a tree comparison. Furthermore, the time complexity of the algorithm is linear on the size of the structural description.
Citation:
Davood Rafiei, Daniel L. Moise, Dabo Sun, "Finding Syntactic Similarities Between XML Documents," dexa, pp.512-516, 17th International Conference on Database and Expert Systems Applications (DEXA'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.
