The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.11 - Nov. (2012 vol.24)
pp: 2008-2024
Dunren Che , Southern Illinois University, Carbondale
Tok Wang Ling , National University of Singapore, Singapore
Wen-Chi Hou , Southern Illinois University, Carbondale
ABSTRACT
Twig pattern matching is a critical operation for XML query processing, and the holistic computing approach has shown superior performance over other methods. Since Bruno et al. introduced the first holistic twig join algorithm, TwigStack, numerous so-called holistic twig join algorithms have been proposed. Yet practical XML queries often require support for more general twig patterns, such as the ones that allow arbitrary occurrences of an arbitrary number of logical connectives (AND, OR, and NOT); such types of twigs are referred to as B-twigs (i.e., Boolean-Twigs) or AND/OR/NOT-twigs. We have seen interesting work on generalizing the holistic twig join approach to AND/OR-twigs and AND/NOT-twigs, but have not seen any further effort addressing the problem of AND/OR/NOT-Twigs at the full scale, which therefore forms the main theme of this paper. In this paper, we investigate novel mechanisms for efficient B-twig pattern matching. In particular, we introduce “B-twig normalization” as an important first-step in our approach toward eventually conquering the complexity of B-twigs, and then present BTwigMerge—the first holistic twig join algorithm designed for B-twigs. Both analytical and experimental results show that BTwigMerge is optimal for B-twig patterns with AD (Ancestor-Descendant) edges and/or PC (Parent-Child) edges.
INDEX TERMS
XML, Pattern matching, Anodes, Algorithm design and analysis, Complexity theory, Query processing, logical predicate., Query processing, database management, XML data querying, twig join, boolean twig
CITATION
Dunren Che, Tok Wang Ling, Wen-Chi Hou, "Holistic Boolean-Twig Pattern Matching for Efficient XML Query Processing", IEEE Transactions on Knowledge & Data Engineering, vol.24, no. 11, pp. 2008-2024, Nov. 2012, doi:10.1109/TKDE.2011.128
REFERENCES
[1] Stylus Studio XML Generator, http://www.stylusstudio.comxml_generator.html , 2012.
[2] Univ. of Washington XML Repository, http://www.cs. washington.edu/researchxmldata sets /, 2012.
[3] XMark ? An XML Benchmark Project, http:/www.xml- benchmark.org/, 2012.
[4] S. Al-Khalifa et al., "Structural Joins: A Primitive for Efficient XML Query Pattern Matching," Proc. 18th Int'l Conf. Data Eng. Conf. (ICDE' 02), pp. 141-152, 2002.
[5] N. Bruno, N. Koudas, and D. Srivastava, "Holistic Twig Joins: Optimal XML Pattern Matching," Proc. ACM SIGMOD Int'l Conf. Management of Data (SIGMOD' 02), pp. 310-321, June 2002.
[6] T. Chen, J. Lu, and T.W. Ling, "On Boosting Holism in XML Twig Pattern Matching Using Structural Indexing Techniques," Proc. ACM SIGMOD Int'l Conf. Management of Data (SIGMOD' 05), pp. 455-466, June 2005.
[7] H. Jiang, H. Lu, and W. Wang, "Efficient Processing of Twig Queries with OR-Predicates," Proc. ACM SIGMOD Int'l Conf. Management of Data (SIGMOD' 04), pp. 59-70, 2004.
[8] H. Jiang, W. Wang, H. Lu, and J.X. Yu, "Holistic Twig Joins on Indexed XML Documents," Proc. 29th Int'l Conf. Very Large Data Bases (VLDB' 03), pp. 273-284, Sept. 2003.
[9] J. Lu, T. Chen, and T.W. Ling, "Efficient Processing of XML Twig Patterns with Parent Child Edges: A Look-ahead Approach," Proc. 13th ACM Int'l Conf. Information and Knowledge Management (CIKM' 04), pp. 533-542, Nov. 2004.
[10] J. Lu, T.W. Ling, C.-Y. Chan, and T. Chen, "From Region Encoding to Extended Dewey: On Efficient Processing of XML Twig Pattern Matching," Proc. 31st Int'l Conf. Very Large Data Bases (VLDB' 05), pp. 193-204, Aug. 2005.
[11] J. Lu et al., "Efficient Processing of Ordered XML Twig Pattern," Proc. 16th Int'l Conf. Database and Expert Systems Applications (DEXA' 05), pp. 300-309, 2005.
[12] X. Xu, Y. Feng, and F. Wang, "Efficient Processing of XML Twig Queries with All Predicates," Proc. IEEE/ACIS Int'l Conf. Computer and Information Science (ICIS '09), pp. 457-462, June 2009.
[13] T. Yu, T.W. Ling, and J. Lu, "$twigstacklist\lnot$ : A Holistic Twig Join Algorithm for Twig Query with Not-Predicates on XML Data," Proc. 11th Int'l Conf. Database Systems for Advanced Applications (DASFAA' 06), pp. 249-263, 2006.
[14] C. Zhang et al., "On Supporting Containment Queries in Relational Database Management Systems," Proc. ACM SIGMOD Int'l Conf. Management of Data (SIGMOD' 01), pp. 425-436, May 2001.
22 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool