The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.10 - Oct. (2013 vol.62)
pp: 1959-1971
Zhemin Zhang , Stony Brook University, Stony Brook
Zhiyang Guo , Stony Brook University, Stony Brook
Yuanyuan Yang , Stony Brook University, Stony Brook
ABSTRACT
With the development of multiprocessor system on chips (MPSoCs), it is expected that hundreds of computing cores will be operating on a single chip in the near future. This will require high-performance on-chip networks with very low latency to provide a communication substrate for the increasing number of cores. In this paper, we consider Gaussian on-chip networks that are of significant topological advantages over traditional mesh and torus networks in terms of diameter and average hop distance. Many applications on MPSoCs need global data movement and global control to exchange data and synchronize the execution among cores, which require all-to-all broadcast communication. In this paper, we propose an all-to-all broadcast algorithm suitable for on-chip implementation on the Gaussian network topology. The algorithm utilizes controlled message flooding based on a broadcast pattern, which can be described in a formal, generic way for each node in terms of a few simple operations and can be easily built into router hardware. Furthermore, the generic broadcast pattern also ensures a balanced traffic load in all dimensions in the network so that minimum total latency for all-to-all broadcast can be achieved. The algorithm overlaps message switching time with transmission time in a pipelined fashion to further reduce the total communication latency of all-to-all broadcast. Comparison results demonstrate the topological merits of Gaussian networks and ultralow latency of the proposed all-to-all broadcast algorithm.
INDEX TERMS
System-on-a-chip, Network topology, Algorithm design and analysis, Delay, Topology, Clocks, Routing, Gaussian network, Network on chips, all-to-all broadcasting, hardware-based, pipeline, routing, multiprocessor system on chip
CITATION
Zhemin Zhang, Zhiyang Guo, Yuanyuan Yang, "Efficient All-to-All Broadcast in Gaussian On-Chip Networks", IEEE Transactions on Computers, vol.62, no. 10, pp. 1959-1971, Oct. 2013, doi:10.1109/TC.2012.126
REFERENCES
[1] W.J. Dally and B. Towles, "Route Packets, Not Wires: On-Chip Interconnection Networks," Proc. Design Automation Conf. (DAC), pp. 683-689, 2001.
[2] R. Marculescu, U.Y. Ogras, L. Peh, N.E. Jerger, and Y. Hoskote, "Outstanding Research Problems in NoC Design: System, Microarchitecture, and Circuit Perspectives." IEEE Trans. Computer-Aided Design of Integrated Circuits and Systems, vol. 28, no. 1, pp. 3-21, Jan. 2009.
[3] D. Bertozzi et al., "NoC Synthesis Flow for Customized Domain Specific Multiprocessor Systems-on-Chip," IEEE Trans. Parallel Distributed Systems, vol. 16, no. 2, pp. 113-129, Feb. 2005.
[4] S.B. Akers and B. Krishnamurthy, "A Group-Theoretic Model for Symmetric Interconnection Networks," IEEE Trans. Computers, vol. 38, no. 4, pp. 555-566, Apr. 1989.
[5] F. Karim, A. Nguyen, and S. Dey, "An Interconnect Architecture for Networking Systems on Chip," IEEE Micro, vol. 22, no. 5, pp. 36-45, Sept. 2002.
[6] Y. Pan, P. Kumar, J. Kim, G. Memik, Y. Zhang, and A. Choudhary, "Firefly: Illuminating Future Network-on-Chip with Nanophotonics Categories and Subject Descriptors," Proc. Int'l Symp. Computer Architecture (ISCA), 2009.
[7] C. Mart, R. Beivide, E. Stafford, M. Moret, and E.M. Gabidulin, "Modeling Toroidal Networks with the Gaussian Integers," IEEE Trans. Computers, vol. 57, no. 8, pp. 1046-1056, Aug. 2008.
[8] C.H. Sequin, "Doubly Twisted Torus Networks for VLSI Processor Arrays," Proc. Eighth Ann. Int'l Symp. Computer Architecture, pp. 471-480, 1981.
[9] J. Duato, S. Yalamanchili, and L. Ni, Interconnection Networks: An Engineering Approach. Morgan Kaufmann, 2003.
[10] C. Calvin, S. Perennes, and D. Trystram, "All-to-All Broadcast in Torus with Wormhole-Like Routing," Proc. Seventh IEEE Symp. Parallel and Distributed Processing, pp. 130-137, 1995.
[11] S. Fujita and M. Yamashita, "Fast Gossiping on Mesh-Bus Computers," IEEE Trans. Computers, vol. 45, no. 11, pp. 1326-1330, Nov. 1996.
[12] B.H. Juurlink, J.F. Sibeyn, and P.S. Rao, "Gossiping on Meshes and Tori," IEEE Trans. Parallel and Distributed Systems, vol. 9, no. 6, pp. 513-525, June 1998.
[13] U. Meyer and J.F. Sibeyn, "Time-Independent Gossiping on Full-Port Tori," Technical Report MPI-I-98-1-014, Max-Planck Institutfur Informatik, Sept. 1998.
[14] M. Soch and P. Tvrdik, "Time-Optimal Gossip of Large Packets in Noncombining 2D Tori and Meshes," IEEE Trans. Parallel and Distributed Systems, vol. 10, no. 12, pp. 1252-1261, Dec. 1999.
[15] Y. Yang and J. Wang, "Pipelined All-to-All Broadcast in All-Port Meshes and Tori," IEEE Trans. Computers, vol. 50, no. 10, pp. 1020-1032, Oct. 2001.
[16] Y. Yang and J. Wang, "Near-Optimal All-to-All Broadcast in Multidimensional All-Port Meshes and Tori," IEEE Trans. Parallel and Distributed Systems, vol. 13, no. 2, pp. 128-141, Feb. 2002.
[17] M. Flahive and B. Bose, "The Topology of Gaussian and Eisenstein-Jacobi Interconnection Networks," IEEE Trans. Parallel and Distributed Systems, vol. 21, no. 8, pp. 1132-1142, Aug. 2010.
[18] T. Schonwald, J. Zimmermann, O. Bringmann, and W. Rosenstiel, "Network-on-Chip Architecture Exploration Framework," Proc. 12th Euromicro Conf. Digital System Design, Architectures, Methods and Tools, pp. 375-382, 2009.
[19] S. Hassoun, C.J. Alpert, and M. Thiagarajan, "Optimal Buffered Routing Path Constructions for Single and Multiple Clock Domain Systems," Proc. IEEE/ACM Int'l Conf. Computer-Aided Design, pp. 247-253, Nov. 2002.
[20] P. Bogdan, T. Dumitras, and R. Marculescu, "Stochastic Communication: A New Paradigm for Fault-Tolerant Networks-on-Chip," VLSI Design, vol. 2007, article 95348, 2007.
[21] U.Y. Ogras and R. Marculescu, "It's a Small World After All": Noc Performance Optimization via Long-Range Link Insertion," IEEE Trans. Very Large Scale Integration Systems, vol. 14, no. 7, pp. 693-706, July 2006.
[22] G. Michelogiannakis, D. Sanchez, W.J. Dally, and C. Kozyrakis, "Evaluating Bufferless Flow Control for On-Chip Networks," Proc. Fourth ACM/IEEE Int'l Symp. Networks-on-Chip (NOCS), pp. 9-16, May 2010.
47 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool