The Community for Technology Leaders
2013 21st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (2013)
Belfast, United Kingdom United Kingdom
Feb. 27, 2013 to Mar. 1, 2013
ISSN: 1066-6192
ISBN: 978-1-4673-5321-2
pp: 249-253
This paper describes a delegation based high throughput MPIcommunication mechanism under tough memory utilization constrains on a many-core oriented hybrid parallel computer. Towards the Exascale era, hybrid parallel computers consisting of many-core and multi-core architectures both on the same node are focused. Although many-core architectures such as GPU or Intel MIC has high potential in computing power by the large number of computing cores, per-core computing power is lower than that of multi-core CPUs. Furthermore, available memory resources for the many-core CPUs are quite smaller than those for multi-core CPUs. Thus we may have a sort of penalty in memory utilization in MPI communications when we utilize a normal MPI library. Here we deploy a delegatee process on each node to merge MPI communications and minimize memory utilization for an MPI communicator. Another advantage of the delegatee process scheme is minimization of memory utilization on many-core CPUs by delegating MPI requests to associated delegatee process on multi-core CPUs. In this paper, we show performance advantages and effective resource utilization by our proposed scheme compared with the original MPI implementation.
light-weight OS kernel, many-core architecture, MPI delegation, MPI communicator, collective communication

K. Yoshinaga, Y. Tsujita, A. Hori, M. Sato, M. Namiki and Y. Ishikawa, "A Delegation Mechanism on Many-Core Oriented Hybrid Parallel Computers for Scalability of Communicators and Communications in MPI," 2013 21st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP 2013)(PDP), Belfast, 2013, pp. 249-253.
168 ms
(Ver 3.3 (11022016))