From the March 2014 Issue
An Optimized FFT-Based Direct Poisson Solver on CUDA GPUs
By Jing Wu, Joseph JaJa, and Elias Balaras
A highly multithreaded FFT-based direct Poisson solver that makes effective use of the capabilities of the current NVIDIA graphics processing units (GPUs) is presented. Our algorithms carefully manage the multiple layers of the memory hierarchy of the GPUs such that almost all the global memory accesses are coalesced into 128-byte device memory transactions, and all computations are carried out directly on the registers. A new strategy to interleave the FFT computation along each dimension with other computations is used to minimize the total number of accesses to the 3D grid. We illustrate the performance of our algorithms on the NVIDIA Tesla and Fermi architectures for a wide range of grid sizes, up to the largest size that can fit on the device memory ( $512\times 512\times 512$ on the Tesla C1060/C2050 and $512\times 256\times 256$ on the GeForce GTX 280/480). We achieve up to 140 GFLOPS and a bandwidth of 70 GB/s on the Tesla C1060, and up to 375 GFLOPS with a bandwidth of 120GB/s on the GTX 480. The performance of our algorithms is superior to what can be achieved using the CUDA FFT library in combination with well-known parallel algorithms for solving tridiagonal linear systems of equations.
Editorials and Announcements
- TPDS celebrates its 25th Anniversary. Editor-in-Chief David A. Bader says, "Congratulations to TPDS on its Silver Jubilee! For 25 years, TPDS has been the parallel and distributed computing community's flagship journal for research breakthroughs!"
- Get Your Journals as eBooks for Free
- Print on Demand is Now Available for OnlinePlus Titles
- eBooks of issues of TPDS can now be downloaded from the Computer Society Digital Library
- State of the Journal (Jan 2014)
- Editor's Note: EIC Farewell and New EIC Introduction (Dec 2013)
- Editor's Note (Nov 2013)
- Editor's Note (Jan 2013)
- Editor's Note (April 2012)
- Editor's Note (January 2012)
- Editorial: Media Center (November 2011)
- Editor's Note: How to Write Research Articles in Computing and Engineering Disciplines by Ivan Stojmenovic
- Full Supplemental PDF of Editor's Note: How to Write Research Articles in Computing and Engineering Disciplines by Ivan Stojmenovic and Veljko Milutinovic (PDF)
- Special Issue on Trust, Security, and Privacy in Parallel and Distributed Systems (Feb 2014)
- Special Issue on Cloud Computing (June 2013)
- Special Issue on Cyber-Physical Systems (CPS) (Sept 2012)
- Special Section on Many-Task Computing (June 2011)
Access recently published TPDS articles
Subscribe to the RSS feed of latest TPDS content added to the digital library.
Sign up for the Transactions Connection newsletter.
A word from the Editor-in-Chief Ivan Stojmenovic
Listen to the OnlinePlus Podcast: Computer Society Publishing—two more titles migrate to OnlinePlus™ in 2012.
In this podcast, VP of Publications, David Alan Grier talks about Transactions on Mobile Computing and Transactions on Parallel and Distributed Systems migrating to OnlinePlus™.
TPDS is indexed in ISI
IEEE Transactions on Parallel and Distributed Systems (TPDS) is a scholarly archival journal published monthly. Parallelism and distributed computing are foundational research and technology to rapidly advance computer systems and their applications.
Read the full scope of TPDS