19th IEEE International Conference on Tools with Artificial Intelligence - Vol.1 (ICTAI 2007)
Multisets and Clustering XML Documents
Paris, France
October 29-October 31
ISBN: 0-7695-3015-X
We propose a novel and efficient solution to the prob- lem of clustering XML documents based on their structure. We use operations on multisets of paths of document trees to define certain metrics on multi- sets. These metrics are used for clustering real and synthesized XML documents to produce high-quality clusterings.
Citation:
Swami Iyer, Dan A. Simovici, "Multisets and Clustering XML Documents," ictai, vol. 1, pp.267-274, 19th IEEE International Conference on Tools with Artificial Intelligence - Vol.1 (ICTAI 2007), 2007