15th International Conference on Scientific and Statistical Database Management
A Quad-Tree Based Multiresolution Approach for Two-dimensional Summary Data
Cambridge, Massachusetts, USA
July 09-July 11
ISBN: 0-7695-1964-4
In many application contexts, like statistical databases, scientific databases, query optimizers, OLAP, and so on, data are often summarized into synopses of aggregate values. Summarization has the great advantage of saving space, but querying aggregate data rather than the original ones introduces estimation errors which cannot be in general avoided, as summarization is a lossy compression. A central problem in designing summarization techniques is to retain a certain degree of accuracy in reconstructing query answers. In this paper we restrict our attention to two-dimensional data, which are relevant for a number of applications, and propose a hierarchical summarization technique which is combined with the use of indices, i.e. compact structures providing an approximate description of portions of the original data. Experimental results show that the technique gives approximation errors much smaller than other "general purpose" techniques, such as wavelets and various types of multi-dimensional histogram.
Citation:
Francesco Buccafurri, Filippo Furfaro, Domenico Sacc?, Cristina Sirangelo, "A Quad-Tree Based Multiresolution Approach for Two-dimensional Summary Data," ssdbm, pp.127, 15th International Conference on Scientific and Statistical Database Management, 2003