Subscribe

Issue No.04 - April (2002 vol.13)

pp: 413-431

DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/71.995821

ABSTRACT

<p>In this paper, we present four efficient parallel algorithms for computing a nonequijoin, called <it> range-join</it>, of two relations on N\hbox{-}{\rm dimensional} mesh-connected computers. Range-joins of relations R and S are an important generalization of conventional equijoins and band-joins and are solved by permutation-based approaches in all proposed algorithms. In general, after sorting all subsets of both relations, the proposed algorithms permute every sorted subset of relation S to each processor in turn, where it is joined with the local subset of relation R. To permute the subsets of S efficiently, we propose two data permutation approaches, namely, the <it>shifting</it> approach which permutes the data recursively from lower dimensions to higher dimensions and the <it>Hamiltonian-cycle</it> approach which first constructs a Hamiltonian cycle on the mesh and then permutes the data along this cycle by repeatedly transferring data from each processor to its successor. We apply the shifting approach to meshes with different storage capacities which results in two different join algorithms. The <it>Basic Shifting Join</it> (BASHJ) algorithm can minimize the number of subsets stored temporarily at a processor, but requires a large number of data transmissions, while the <it>Buffering Shifting Join</it> (BUSHJ) algorithm can achieve a high parallelism and minimize the number of data transmissions, but requires a large number of subsets stored at each processor. For constructing a Hamiltonian cycle on a mesh, we propose two different methods which also result in two different join algorithms. The <it>Recursive Hamiltonian-Cycle Join</it> (REHCJ) algorithm uses a single processor to construct a Hamiltonian cycle recursively, while the <it>Parallel Hamiltonian-Cycle Join</it> (PAHCJ) algorithm uses all processors to construct a Hamiltonian cycle in parallel. We analyze and compare these algorithms. The results shows that both Hamiltonian cycle algorithms require less storage and local join operations than the shifting algorithms, but more data movement steps.</p>

INDEX TERMS

analysis of algorithms, data permutation, N\hbox{-}{\rm dimensional} meshes, relational databases, parallel processing, performance, range-join operations

CITATION

Shao Dong Chen, Hong Shen, Rodney Topor, "Permutation-Based Range-Join Algorithms on N-Dimensional Meshes",

*IEEE Transactions on Parallel & Distributed Systems*, vol.13, no. 4, pp. 413-431, April 2002, doi:10.1109/71.995821