Issue No. 01 - January/February (1998 vol. 18)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/40.653035
This article describes a protocol for a general-purpose cluster communication system that supports multiprogramming with virtual networks, direct and protected network access, reliable message delivery using message time-outs and retransmissions, a powerful return-to-send error model for applications, and automatic network mapping. The protocols use simple, low-cost mechanisms that exploit properties of our interconnect without limiting flexibility, usability, or robustness. We have implemented the protocols in an active message communication system that runs a network of 100+ Sun UltraSPARC workstations interconnected with 40 Myrinet switches. A progression of microbenchmarks demonstrate good performance - 42 microsecond round-trip times and 31 MB/s node-to-node bandwidth - as well as scalability under heavy load and graceful performance degradation in the presence of high contention.
Network transport protocols, Myrinet LAN, cluster communications, multiprogramming
B. N. Chun, A. M. Mainwaring and D. E. Culler, "Virtual Network Transport Protocols for Myrinet," in IEEE Micro, vol. 18, no. , pp. 53-63, 1998.