IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Parallel and Distributed Systems (TPDS) is a scholarly archival journal published monthly. Parallelism and distributed computing are foundational research and technology to rapidly advance computer systems and their applications. Read the full scope of TPDS
IEEE Transactions on Parallel and Distributed Systems (TPDS) has moved to the OnlinePlus publication model.
From the May 2015 Issue
Runtime and Architecture Support for Efficient Data Exchange in Multi-Accelerator Applications
By Javier Cabezas, Isaac Gelado, John E. Stone, Nacho Navarro, David B. Kirk, and Wen-mei Hwu
Heterogeneous parallel computing applications often process large data sets that require multiple GPUs to jointly meet their needs for physical memory capacity and compute throughput. However, the lack of high-level abstractions in previous heterogeneous parallel programming models force programmers to resort to multiple code versions, complex data copy steps and synchronization schemes when exchanging data between multiple GPU devices, which results in high software development cost, poor maintainability, and even poor performance. This paper describes the HPE runtime system, and the associated architecture support, which enables a simple, efficient programming interface for exchanging data between multiple GPUs through either interconnects or cross-node network interfaces. The runtime and architecture support presented in this paper can also be used to support other types of accelerators. We show that the simplified programming interface reduces programming complexity. The research presented in this paper started in 2009. It has been implemented and tested extensively in several generations of HPE runtime systems as well as adopted into the NVIDIA GPU hardware and drivers for CUDA 4.0 and beyond since 2011. The availability of real hardware that support key HPE features gives rise to a rare opportunity for studying the effectiveness of the hardware support by running important benchmarks on real runtime and hardware. Experimental results show that in a exemplar heterogeneous system, peer DMA and double-buffering, pinned buffers, and software techniques can improve the inter-accelerator data communication bandwidth by 2$\times$ . They can also improve the execution speed by 1.6$\times$ for a 3D finite difference, 2.5 $\times$ for 1D FFT, and 1.6$\times$ for merge sort, all measured on real hardware. The proposed architecture support enables the HPE runtime to transparently deploy these optimizations under simple portable user code, allowing system designers to freely employ devices of different capabilities. We further argue that simple interfaces such as HPE are needed for most applications to benefit from advanced hardware features in practice.
Editorials and Announcements
- EICs Undergoing Reappointment for 2016-2017 Terms: IEEE Computer Society publications have editors in chief who are currently standing for reappointment to a second two-year term. The Publications Board invites comments upon the tenures of the individual editors. Please click here for more details.
- According to Thomson Reuters' 2013 Journal Citation Report, TPDS has an impact factor of 2.173.
- TPDS celebrates its 25th Anniversary. Editor-in-Chief David A. Bader says, "Congratulations to TPDS on its Silver Jubilee! For 25 years, TPDS has been the parallel and distributed computing community's flagship journal for research breakthroughs!"
- Get Your Journals as eBooks for Free
- Editor's Note (Jan 2015)
- State of the Journal (Jan 2014)
- Editor's Note: EIC Farewell and New EIC Introduction (Dec 2013)
- Editor's Note (Nov 2013)
- Editor's Note (Jan 2013)
- Editor's Note (April 2012)
- Editor's Note (January 2012)
- Editorial: Media Center (November 2011)
- Editor's Note: How to Write Research Articles in Computing and Engineering Disciplines by Ivan Stojmenovic
- Full Supplemental PDF of Editor's Note: How to Write Research Articles in Computing and Engineering Disciplines by Ivan Stojmenovic and Veljko Milutinovic (PDF)
- Special Issue on Trust, Security, and Privacy in Parallel and Distributed Systems (Feb 2014)
- Special Issue on Cloud Computing (June 2013)
- Special Issue on Cyber-Physical Systems (CPS) (Sept 2012)
- Special Section on Many-Task Computing (June 2011)
Access recently published TPDS articles
Subscribe to the RSS feed of latest TPDS content added to the digital library.
Sign up for the Transactions Connection newsletter.
Listen to the OnlinePlus Podcast: Computer Society Publishing—two more titles migrate to OnlinePlus™ in 2012.
In this podcast, VP of Publications, David Alan Grier talks about Transactions on Mobile Computing and Transactions on Parallel and Distributed Systems migrating to OnlinePlus™.
TPDS is indexed in ISI