First IEEE International Symposium on Cluster Computing and the Grid (CCGrid'01)
A Cluster Architecture for Parallel Data Warehousing
Brisbane, Australia
May 15-May 18
ISBN: 0-7695-1010-8
We describe the parallel, cluster-based implementation of an algorithm for the computation of a database operator known as the datacube. Though a number of efficient sequential algorithms have recently been proposed for this problem, very little research effort has been expended upon cost-effective parallelization techniques. Our approach builds directly upon the existing sequential proposals and is designed to be both load balanced and communication efficient. We also provide experimental results that demonstrate the viability of our technique under a variety of test conditions. Ultimately, we show that parallel performance relative to the underlying sequential algorithm (speedup) is near optimal.
Citation:
Frank Dehne, Todd Eavis, Andrew Rau-Chaplin, "A Cluster Architecture for Parallel Data Warehousing," ccgrid, pp.161, First IEEE International Symposium on Cluster Computing and the Grid (CCGrid'01), 2001