The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.01 - January (2012 vol.24)
pp: 100-113
Liang Xu , National University of Singapore, Singapore
Tok Wang Ling , National University of Singapore, Singapore
Huayu Wu , Institute for Infocomm Research, Singapore
ABSTRACT
Dynamic XML labeling schemes have important applications in XML Database Management Systems. In this paper, we explore dynamic XML labeling schemes from a novel order-centric perspective. We compare the various labeling schemes proposed in the literature with a special focus on their orders of labels. We show that the order of labels fundamentally impacts the update performance of a labeling scheme and develop an order-based framework to classify and characterize XML labeling schemes. Although there are dynamic XML labeling schemes that can completely avoid relabeling, the gain in update performance all come with considerable costs such as larger label size and lower query performance, even if the XML documents are hardly updated. We introduce vector order which is the foundation of the dynamic labeling schemes we propose. Compared with previous solutions that are based on natural order or lexicographical order, vector order is a simple, yet most effective solution to process updates in XML DBMS. We show that vector order can be gracefully applied to both range-based and prefix-based labeling schemes with little overhead introduced. Moreover, vector order-based labeling schemes are not only efficient to process, but also resilient to skewed insertions. Qualitative and experimental evaluations confirm the benefits of our approach compared to previous solutions.
INDEX TERMS
Tree node labeling, XML data management, update processing, query processing.
CITATION
Liang Xu, Tok Wang Ling, Huayu Wu, "Labeling Dynamic XML Documents: An Order-Centric Approach", IEEE Transactions on Knowledge & Data Engineering, vol.24, no. 1, pp. 100-113, January 2012, doi:10.1109/TKDE.2010.221
REFERENCES
[1] T. Amagasa, M. Yoshikawa, and S. Uemura, "QRS: A Robust Numbering Scheme for XML Documents," Proc. 19th Int'l Conf. Data Eng. (ICDE), 2003.
[2] E. Cohen, H. Kaplan, and T. Milo, "Labeling Dynamic XML Trees," Proc. 21st ACM SIGMOD-SIGACT-SIGART Symp. Principles of Database Systems (SPDS), 2002.
[3] K. Kobayashi et al., "VLEI Code: An Efficient Labeling Method for Handling XML Documents in an RDB," Proc. 21st Int'l Conf. Data Eng. (ICDE), 2005.
[4] C. Li and T.W. Ling, "QED: A Novel Quaternary Encoding to Completely Avoid Re-Labeling in XML Updates," Proc. Int'l Conf. Information and Knowledge Management (CIKM), 2005.
[5] C. Li, T.W. Ling, and M. Hu, "Efficient Processing of Updates in Dynamic XML Data," Proc. 22nd Int'l Conf. Data Eng. (ICDE), 2006.
[6] C. Li, T.W. Ling, and M. Hu, "Efficient Updates in Dynamic XML Data: From Binary String to Quaternary String," Int'l J. Very Large Data Bases, vol. 17, pp. 573-601, 2008.
[7] I. Tatarinov, S. Viglas, K.S. Beyer, J. Shanmugasundaram, E.J. Shekita, and C. Zhang, "Storing and Querying Ordered XML Using a Relational Database System," Proc. ACM SIGMOD Int'l Conf. Management of Data, 2002.
[8] X. Wu, M.L. Lee, and W. Hsu, "A Prime Number Labeling Scheme for Dynamic Ordered XML Trees," Proc. 20th Int'l Conf. Data Eng. (ICDE), 2004.
[9] L. Xu, Z. Bao, and T.W. Ling, "A Dynamic Labeling Scheme Using Vectors," Proc. 18th Int'l Conf. Database and Expert Systems Applications (DEXA), 2007.
[10] L. Xu, T.W. Ling, H. Wu, and Z. Bao, "DDE: From Dewey to a Fully Dynamic XML Labeling Scheme," Proc. 35th ACM SIGMOD Int'l Conf. Management of Data, 2009.
[11] C. Sun, C.-Y. Chan, and A.K. Goenka, "Multiway SLCA-Based Keyword Search in XML Data," Proc. 16th Int'l Conf. World Wide Web (WWW), 2007.
[12] P. O'Neil, E. O'Neil, S. Pal, I. Cseri, G. Schaller, and N. Westbury, "ORDPATHs: Insert-Friendly XML Node Labels," Proc. ACM SIGMOD Int'l Conf. Management of Data, 2004.
[13] P.F. Dietz, "Maintaining Order in a Linked List," Proc. 14th Ann. ACM Symp. Theory of Computing, 1982.
[14] Q. Li and B. Moon, "Indexing and Querying XML Data for Regular Path Expressions," Proc. 27th Int'l Conf. Very Large Data Bases (VLDB), 2001.
[15] S. Abiteboul, S. Alstrup, H. Kaplan, T. Milo, and T. Rauhe, "Compact Labeling Scheme for Ancestor Queries," SIAM J. Computing Systems, vol. 40, pp. 55-99, 2006.
[16] Univ. of Washington XML Repository, http://www.cs. wash ington.edu/ researchxmldatasets/, 2010.
[17] XMark - An XML Benchmark Project, http://monetdb.cwi.nl/xmldownloads.html, 2011.
[18] C. Zhang, J.F. Naughton, D.J. DeWitt, Q. Luo, and G.M. Lohman, "On Supporting Containment Queries in Relational Database Management Systems," Proc. ACM SIGMOD Int'l Conf. Management of Data, 2001.
16 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool