2008 IEEE International Conference on Cluster Computing (2008)
Sept. 29, 2008 to Oct. 1, 2008
E. Brunet , LaBRI, Univ. Bordeaux 1, Talence
F. Trahay , LaBRI, Univ. Bordeaux 1, Talence
A. Denis , LaBRI, INRIA Bordeaux - Sud-Ouest, Talence
The current trend in clusters architecture leads toward a massive use of multicore chips. This hardware evolution raises bottleneck issues at the network interface level. The use of multiple parallel networks allows to overcome this problem as it provides an higher aggregate bandwidth. But this bandwidth remains theoretical as only a few communication libraries are able to exploit multiple networks. In this paper, we present an optimization strategy for the NEWMADELEINE communication library. This strategy is able to efficiently exploit parallel interconnect links. By sampling each networkpsilas capabilities, it is possible to estimate a transfer duration a priori. Splitting messages and sending chunks of messages over parallel links can thus be performed efficiently to reach the theoretical aggregate bandwidth. NEWMADELEINE is multithreaded and exploits multicore chips to send small packets, that involve CPU-consuming copies, in parallel.
parallel processing, multiprocessing systems, multi-threading
E. Brunet, F. Trahay and A. Denis, "A multicore-enabled multirail communication engine," 2008 IEEE International Conference on Cluster Computing(CLUSTER), Tsukuba Japan, 2009, pp. 316-321.