The Community for Technology Leaders
Green Image
Issue No. 08 - August (2011 vol. 23)
ISSN: 1041-4347
pp: 1261-1273
Jennifer Widom , Stanford University, Stanford
Robert Ikeda , Stanford University, Stanford
Raghotham Murthy , Stanford University, Stanford
ABSTRACT
We describe how aggregation is handled in the Trio system for uncertain and probabilistic data. Because “exact” aggregation in uncertain databases can produce exponentially sized results, we provide three alternatives: a low bound on the aggregate value, a high bound on the value, and the expected value. These variants return a single result instead of a set of possible results, and they are generally efficient to compute for both full-table and grouped aggregation queries. We provide formal definitions and semantics and a description of our open source implementation for single-table aggregation queries. We study the performance and scalability of our algorithms through experiments over a large synthetic data set. We also provide some preliminary results on aggregations over joins.
INDEX TERMS
Database management, query processing.
CITATION
Jennifer Widom, Robert Ikeda, Raghotham Murthy, "Making Aggregation Work in Uncertain and Probabilistic Databases", IEEE Transactions on Knowledge & Data Engineering, vol. 23, no. , pp. 1261-1273, August 2011, doi:10.1109/TKDE.2010.166
93 ms
(Ver )