loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
22nd International Conference on Data Engineering Workshops (ICDEW'06)
A Path-sequence Based Discrimination for Subtree Matching in Approximate XML Joins
Atlanta, Georgia
April 03-April 07
ISBN: 0-7695-2571-7
Wenxin Liang, Tokyo Institute of Technology, Japan
Haruo Yokota, Tokyo Institute of Technology, Japan
In this paper, we discuss the one-to-multiple matching problem in leaf-clustering based approximate XML join algorithms and propose a path-sequence based discrimination method to solve this problem. In our method, each path sequence from the top node to the matched leaf in the base and target subtree is extracted, and the most similar target subtree for the base one is determined by the pathsequence based subtree similarity degree. We conduct experiments to evaluate our method by using both real bibliography and bioinformatics XML documents. The experimental results show that our method can effectively decrease the occunence rate of one-to-multiple matching for both bibliography and bioinformatics XML data, and hence improve the precision of the leaf-clustering based approximate XML join algorithms.
Citation:
Wenxin Liang, Haruo Yokota, "A Path-sequence Based Discrimination for Subtree Matching in Approximate XML Joins," icdew, pp.x116, 22nd International Conference on Data Engineering Workshops (ICDEW'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.