Eighth International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT 2007)
Parallel Database Join Operations in Heterogeneous Grids
Adelaide, Australia
December 03-December 06
ISBN: 0-7695-3049-4
This paper presents an analytical comparison of parallel join algorithms in a generalized multiprocessor framework and a simplified, heterogenous Grid Environment. We develop a concise but comprehensive analytical model for the well-known Hash Join algorithm and com- pare it to Nested-Loop and Sort-Merge Join algorithms. We concentrate on a limited number of characteristic param- eters to keep the analytical model clear and focused. We justify that a meaningful model can be built upon only three characteristic parameter sets, describing node processing performance, the I/O and the disk bandwidth, which are the parameters for the optimization the Grid workflow by a smart brokerage mechanism. Based on these results the pa- per proves that by a smart enhancement of the algorithms exploiting the specifics of the Grid the known performance results for a homogenous multi-processor architecture are to be revised for a heterogenous Grid environment.
Citation:
Werner Mach, Erich Schikuta, "Parallel Database Join Operations in Heterogeneous Grids," pdcat, pp.236-243, Eighth International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT 2007), 2007