Fourth IEEE International Conference on Cluster Computing (CLUSTER'02)
Cluster Based Hybrid Hash Join: Analysis and Evaluation
Chicago, Illinois
September 23-September 26
ISBN: 0-7695-1745-5
The join is the most important, but also the most time consuming operation in relational database systems. We implemented the parallel Hybrid Hash Join algorithm on a PC-cluster architecture and analyzed its performance behavior. We show that off-the-shelf, cost saving cluster systems can build a viable platform for parallel database systems.