loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers
A Hardware Acceleration Unit for MPI Queue Processing
Denver, Colorado
April 04-April 08
ISBN: 0-7695-2312-9
Keith D. Underwood, Sandia National Laboratories, Albuquerque, NM
K. Scott Hemmert, Sandia National Laboratories, Albuquerque, NM
Arun Rodrigues, Sandia National Laboratories, Albuquerque, NM
Richard Murphy, Sandia National Laboratories, Albuquerque, NM
Ron Brightwell, Sandia National Laboratories, Albuquerque, NM
With the heavy reliance of modern scientific applications upon the MPI Standard, it has become critical for the implementation of MPI to be as capable and as fast as possible. This has led some of the fastest modern networks to introduce the capability to offload aspects of MPI processing to an embedded processor on the network interface. With this important capability has come significant performance implications. Most notably, the time to process long queues of posted receives or unexpected messages is substantially longer on embedded processors. This paper presents an associative list matching structure to accelerate the processing of moderate length queues in MPI. Simulations are used to compare the performance of an embedded processor augmented with this capability to a baseline implementation. The proposed enhancement significantly reduces latency for moderate length queues while adding virtually no overhead for extremely short queues.
Citation:
Keith D. Underwood, K. Scott Hemmert, Arun Rodrigues, Richard Murphy, Ron Brightwell, "A Hardware Acceleration Unit for MPI Queue Processing," ipdps, vol. 1, pp.96b, 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers, 2005
Usage of this product signifies your acceptance of the Terms of Use.