|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
17th International Conference on Data Engineering (ICDE'01)
Counting Twig Matches in a Tree
Heidelberg, Germany
April 02-April 06
ISBN: 0-7695-1001-9
| ASCII Text | x | ||
| Zhiyuan Chen, H.V. Jagadish, Flip Korn, Nick Koudas, S. Muthukrishnan, Divesh Srivastava, Raymond Ng, "Counting Twig Matches in a Tree," Data Engineering, International Conference on, pp. 0595, 17th International Conference on Data Engineering (ICDE'01), 2001. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDE.2001.914874, author = {Zhiyuan Chen and H.V. Jagadish and Flip Korn and Nick Koudas and S. Muthukrishnan and Divesh Srivastava and Raymond Ng}, title = {Counting Twig Matches in a Tree}, journal ={Data Engineering, International Conference on}, volume = {0}, year = {2001}, isbn = {0-7695-1001-9}, pages = {0595}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDE.2001.914874}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Data Engineering, International Conference on TI - Counting Twig Matches in a Tree SN - 0-7695-1001-9 SP EP A1 - Zhiyuan Chen, A1 - H.V. Jagadish, A1 - Flip Korn, A1 - Nick Koudas, A1 - S. Muthukrishnan, A1 - Divesh Srivastava, A1 - Raymond Ng, PY - 2001 VL - 0 JA - Data Engineering, International Conference on ER - | |||
Abstract: We describe efficient algorithms for accurately estimating the number of matches of a small node-labeled tree, i.e., a twig, in a large node-labeled tree, using a summary data structure. This problem is of interest for queries on XML and other hierarchical data, to provide query feedback and for cost-based query optimization. Our summary data structure scalably represents approximate frequency information about twiglets (i.e., small twigs) in the data tree. Given a twig query, the number of matches is estimated by creating a set of query twiglets, and combining two complementary approaches: Set Hashing, used to estimate the number of matches of each query twiglet, and Maximal Overlap, used to combine the query twiglet estimates into an estimate for the twig query. We propose several estimation algorithms that apply these approaches on query twiglets formed using variations on different twiglet decomposition techniques. We present an extensive experimental evaluation using several real XML data sets, with a variety of twig queries. Our results demonstrate that accurate and robust estimates can be achieved, even with limited space.
Citation:
Zhiyuan Chen, H.V. Jagadish, Flip Korn, Nick Koudas, S. Muthukrishnan, Divesh Srivastava, Raymond Ng, "Counting Twig Matches in a Tree," icde, pp.0595, 17th International Conference on Data Engineering (ICDE'01), 2001
Usage of this product signifies your acceptance of the Terms of Use.
