Scientific and Statistical Database Management, International Conference on (2003)
Cambridge, Massachusetts, USA
July 9, 2003 to July 11, 2003
Andreas Bauer , T-Systems Nova GmbH
Wolfgang Lehner , Dresden University of Technology
The use of materialized views in a data warehouse installation is a common tool to speed up mostly aggregation queries. The problems coming along with materialized aggregate views have triggered a huge variety of proposals, such as picking the optimal set of aggregation combinations, transparently rewriting user queries to take advantage of the summary data, or synchronizing pre-computed summary data as soon as the base data changes. This paper focusses on the problem of view selection in the context of distributed data warehouse architectures. While much research was done with regard to the view selection problem in the central case, we are not aware to any other work discussing the problem of view selection in distributed data warehouse systems. The paper proposes an extension of the concept of an aggregation lattice to capture the distributed semantics. Moreover, we extend a greedy-based selection algorithm based on an adequate cost model for the distributed case. Within a performance study, we finally compare our findings with the approach of applying a selection algorithm locally to each node in a distributed warehouse environment.
Andreas Bauer, Wolfgang Lehner, "On Solving the View Selection Problem in Distributed Data Warehouse Architectures", Scientific and Statistical Database Management, International Conference on, vol. 00, no. , pp. 43, 2003, doi:10.1109/SSDM.2003.1214953