Proceedings Fifth International Symposium on High-Performance Computer Architecture (1999)
Jan. 9, 1999 to Jan. 12, 1999
Yuanyuan Yang , University of Vermont
Jianchao Wang , GTE Laboratories
All-to-all communication is one of the most dense communication patterns and occurs in many important applications in parallel computing. In this paper, we present a new all-to-all broadcast algorithm in all-port mesh and torus networks. Unlike existing all-to-all broadcast algorithms, the new algorithm takes advantage of overlapping of message switching time and transmission time, and achieves optimal transmission time for all-to-all broadcast. In addition, in most cases, the total communication delay is close to the lower bound of all-to-all broadcast within a small constant range. Finally, the algorithm is conceptually simple, and symmetrical for every message and every node so that it can be easily implemented in hardware and achieves the optimum in practice.
Parallel computing, collective communication, all-to-all communication, all-to-all broadcast, all-to-all personalized exchange, routing, interprocessor communication.
J. Wang and Y. Yang, "Efficient All-to-All Broadcast in All-Port Mesh and Torus Networks," Proceedings Fifth International Symposium on High-Performance Computer Architecture(HPCA), Orlando, Florida, 1999, pp. 290.