|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
21st International Conference on Data Engineering (ICDE'05)
Vectorizing and Querying Large XML Repositories
Tokyo, Japan
April 05-April 08
ISBN: 0-7695-2285-8
| ASCII Text | x | ||
| Peter Buneman, Byron Choi, Wenfei Fan, Robert Hutchison, Robert Mann, Stratis D. Viglas, "Vectorizing and Querying Large XML Repositories," Data Engineering, International Conference on, pp. 261-272, 21st International Conference on Data Engineering (ICDE'05), 2005. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDE.2005.150, author = {Peter Buneman and Byron Choi and Wenfei Fan and Robert Hutchison and Robert Mann and Stratis D. Viglas}, title = {Vectorizing and Querying Large XML Repositories}, journal ={Data Engineering, International Conference on}, volume = {0}, year = {2005}, issn = {1084-4627}, pages = {261-272}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDE.2005.150}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Data Engineering, International Conference on TI - Vectorizing and Querying Large XML Repositories SN - 1084-4627 SP261 EP272 A1 - Peter Buneman, A1 - Byron Choi, A1 - Wenfei Fan, A1 - Robert Hutchison, A1 - Robert Mann, A1 - Stratis D. Viglas, PY - 2005 KW - null VL - 0 JA - Data Engineering, International Conference on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDE.2005.150
Vertical partitioning is a well-known technique for optimizing query performance in relational databases. An extreme form of this technique, which we call vectorization, is to store each column separately. We use a generalization of vectorization as the basis for a native XML store. The idea is to decompose an XML document into a set of vectors that contain the data values and a compressed skeleton that describes the structure. In order to query this representation and produce results in the same vectorized format, we consider a practical fragment of XQuery and introduce the notion of query graphs and a novel graph reduction algorithm that allows us to leverage relational optimization techniques as well as to reduce the unnecessary loading of data vectors and decompression of skeletons. A preliminary experimental study based on some scientific and synthetic XML data repositories in the order of gigabytes supports the claim that these techniques are scalable and have the potential to provide performance comparable with established relational database technology.
Citation:
Peter Buneman, Byron Choi, Wenfei Fan, Robert Hutchison, Robert Mann, Stratis D. Viglas, "Vectorizing and Querying Large XML Repositories," icde, pp.261-272, 21st International Conference on Data Engineering (ICDE'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.
