IEEE Transactions on Parallel and Distributed Systems

IEEE Transactions on Parallel and Distributed Systems (TPDS) is a scholarly archival journal published monthly. Parallelism and distributed computing are foundational research and technology to rapidly advance computer systems and their applications. Read the full scope of TPDS.


Expand your horizons with Colloquium, a monthly survey of abstracts from all CS transactions! Replaces OnlinePlus in January 2017.


From the March 2019 Issue

Exploiting Hardware Multicast and GPUDirect RDMA for Efficient Broadcast 

By Ching-Hsiang Chu, Xiaoyi Lu, Ammar A. Awan, Hari Subramoni, Bracy Elton, and Dhabaleswar K. Panda

Free Featured Article
Broadcast is a widely used operation in many streaming and deep learning applications to disseminate large amounts of data on emerging heterogeneous High-Performance Computing (HPC) systems. However, traditional broadcast schemes do not fully utilize hardware features for Graphics Processing Unit (GPU)-based applications. In this paper, a model-oriented analysis is presented to identify performance bottlenecks of existing broadcast schemes on GPU clusters. Next, streaming-based broadcast schemes are proposed to exploit InfiniBand hardware multicast (IB-MCAST) and NVIDIA GPUDirect technology for efficient message transmission. The proposed designs are evaluated in the context of using Message Passing Interface (MPI) based benchmarks and applications. The experimental results indicate improved scalability and up to 82 percent reduction of latency compared to the state-of-the-art solutions in
the benchmark-level evaluation. Furthermore, compared to the state-of-the-art, the proposed design yields stable higher throughput for a synthetic streaming workload, and 1.3x faster training time for a deep learning framework.
 

download PDF View the PDF of this article      csdl View this issue in the digital library


Editorials and Announcements

Announcements

  • We are pleased to announce that Manish Parashar, a Distinguished Professor of Computer Science at Rutgers, The State University of New Jersey University, has been selected as the new Editor-in-Chief of the IEEE Transactions on Parallel and Distributed Systems starting in 2018.
  • We are pleased to announce that Xian-He Sun, a Distinguished Professor of Computer Science at The Illinois Institute of Technology, has been selected as the new Associate Editor-in-Chief of the IEEE Transactions on Parallel and Distributed Systems starting in 2018.
  • TPDS now offers authors access to Code Ocean. Code Ocean is a cloud-based executable research platform that allows authors to share their algorithms in an effort to make the world’s scientific code more open and reproducible. Learn more or sign up for free.
  • According to Clarivate Analytics' 2016 Journal Citation Report, TPDS has an impact factor of 4.181.

Editorials


Guest Editorials


Reviewers List


Annual Index


Access recently published TPDS articles

RSS Subscribe to the RSS feed of recently published TPDS content

mail icon Sign up for e-mail notifications through IEEE Xplore Content Alerts

preprints icon View TPDS preprints in the Computer Society Digital Library


TPDS is indexed in ISI