2013 IEEE 19th International Symposium on High Performance Computer Architecture (HPCA) (1999)
Jan. 9, 1999 to Jan. 12, 1999
Jianchao Wang , GTE Laboratories
Yuanyuan Yang , University of Vermont
All-to-all communication is one of the most dense communication patterns and occurs in many important applications in parallel computing. In this paper, we present a new all-to-all broadcast algorithm in all-port mesh and torus networks. Unlike existing all-to-all broadcast algorithms, the new algorithm takes advantage of overlapping of message switching time and transmission time, and achieves optimal transmission time for all-to-all broadcast. In addition, in most cases, the total communication delay is close to the lower bound of all-to-all broadcast within a small constant range. Finally, the algorithm is conceptually simple, and symmetrical for every message and every node so that it can be easily implemented in hardware and achieves the optimum in practice.
Parallel computing, collective communication, all-to-all communication, all-to-all broadcast, all-to-all personalized exchange, routing, interprocessor communication.
Jianchao Wang, Yuanyuan Yang, "Efficient All-to-All Broadcast in All-Port Mesh and Torus Networks", 2013 IEEE 19th International Symposium on High Performance Computer Architecture (HPCA), vol. 00, no. , pp. 290, 1999, doi:10.1109/HPCA.1999.744382