Third International Symposium on Parallel and Distributed Computing/Third International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Networks (ISPDC/HeteroPar'04) Extending Collective Operations with Application Semantics for Improving Multi-Cluster Performance Cork, Ireland July 05-July 07 ISBN: 0-7695-2210-6
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ISPDC.2004.24
We identify two ways of increasing the performance of allreduce-style of collective operations in a multi-cluster with large WAN latencies: (i) hiding latency in system noise, and (ii) conditional-allreduce where knowledge about the application is used to reduce the number of WAN messages. In our multicluster, system noise was not large enough to hide the WAN latency. But, the latency could be hidden using conditional-allreduce, since on many iterations only cluster-local values were needed, and many of the values needed from other clusters were prefetched. A speedup of 2.4 was achieved for a microbenchmark. Prefetching introduced a small overhead in the cluster with the slowest hosts.
Citation:
Lars Ailo Bongo, Otto Anshus, John Markus Bjorndalen, Tore Larsen, "Extending Collective Operations with Application Semantics for Improving Multi-Cluster Performance," ispdc, pp.320-327, Third International Symposium on Parallel and Distributed Computing/Third International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Networks (ISPDC/HeteroPar'04), 2004 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||