Proceedings 18th International Conference on Data Engineering (2002)
San Jose, California
Feb. 26, 2002 to Mar. 1, 2002
Philip Bohannon , Bell Laboratories
Juliana Freire , Bell Laboratories
Prasan Roy , Bell Laboratories
Jérôme Siméon , Bell Laboratories
As Web applications manipulate an increasing amount of XML, there is a growing interest in storing XML data in relational databases. Due to the mismatch between the complexity of XML's tree structure and the simplicity of flat relational tables, there are many ways to store the same document in and RDBMS, and a number of heuristic techniques have been proposed. These techniques typically define fixed mappings and do not take application characteristics into account. However a fixed mapping is unlikely to work well for all possible applications. In contrast, LegoDB is a cost-based XML storage mapping engine that explores and space of possible XML-to-relational mappings and selects the best mapping for a given application. LegoDB leverages current XML and relational technologies: 1) is models the target application with an XML Schema, XML data statistics, and an Xquery workload; 2) the space of configurations is generated through XML-Schema rewritings; and 3) the best among the derived configurations is selected using cost estimates obtained through a standard relational optimizer. In this paper, we describe the LegoDB storage engine and provide experimental results that demonstrate the effectiveness of this approach.
J. Freire, P. Bohannon, J. Siméon and P. Roy, "From XML Schema to Relations: A Cost-Based Approach to XML Storage," Proceedings 18th International Conference on Data Engineering(ICDE), San Jose, California, 2002, pp. 0064.