loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05)
HUNTing the Overlap
St. Louis, Missouri
September 17-September 21
ISBN: 0-7695-2429-X
Costin Iancu, Computational Research Division, Lawrence Berkeley National Laboratory
Parry Husbands, Computational Research Division, Lawrence Berkeley National Laboratory
Paul Hargrove, Computational Research Division, Lawrence Berkeley National Laboratory

Hiding communication latency is an important optimization for parallel programs. Programmers or compilers achieve this by using non-blocking communication primitives and overlapping communication with computation or other communication operations. Using non-blocking communication raises two issues: performance and programmability. In terms of performance, optimizers need to find a good communication schedule and are sometimes constrained by lack of full application knowledge. In terms of programmability, efficiently managing nonblocking communication can prove cumbersome for complex applications. In this paper we present the design principles of HUNT, a runtime system designed to search and exploit some of the available overlap present at execution time in UPC programs. Using virtual memory support, our runtime implements demand-driven synchronization for data involved in communication operations. It also employs message decomposition and scheduling heuristics to transparently improve the non-blocking behavior of applications. We provide a user level implementation of HUNT on a variety of modern high performance computing systems. Results indicate that our approach is successful in finding some of the overlap available at execution time. While system and application characteristics influence performance, perhaps the determining factor is the time taken by the CPU to execute a signal handler. Demand driven synchronization at execution time eliminates the need for the explicit management of non-blocking communication. Besides increasing programmer productivity, this feature also simpli- fies compiler analysis for communication optimizations.

Citation:
Costin Iancu, Parry Husbands, Paul Hargrove, "HUNTing the Overlap," pact, pp.279-290, 14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.