Issue No. 01 - January (2011 vol. 22)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPDS.2010.125
Rick Weber , University of Tennessee, Knoxville
Akila Gothandaraman , University of Pittsburgh, Pittsburgh
Robert J. Hinde , University of Tennessee, Knoxville
Gregory D. Peterson , University of Tennessee, Knoxville
Multicore processors and a variety of accelerators have allowed scientific applications to scale to larger problem sizes. We present a performance, design methodology, platform, and architectural comparison of several application accelerators executing a Quantum Monte Carlo application. We compare the application's performance and programmability on a variety of platforms including CUDA with Nvidia GPUs, Brook+ with ATI graphics accelerators, OpenCL running on both multicore and graphics processors, C++ running on multicore processors, and a VHDL implementation running on a Xilinx FPGA. We show that OpenCL provides application portability between multicore processors and GPUs, but may incur a performance cost. Furthermore, we illustrate that graphics accelerators can make simulations involving large numbers of particles feasible.
Accelerator, OpenCL, FPGA, GPU, multicore, CUDA, computational science.
G. D. Peterson, A. Gothandaraman, R. Weber and R. J. Hinde, "Comparing Hardware Accelerators in Scientific Applications: A Case Study," in IEEE Transactions on Parallel & Distributed Systems, vol. 22, no. , pp. 58-68, 2010.