The Community for Technology Leaders
RSS Icon
Subscribe
Rome
May 23, 2009 to May 29, 2009
ISBN: 978-1-4244-3751-1
pp: 1-10
Olaf Schenk , High Performance and Web Computing Group, Computer Science Dept., University of Basel, Switzerland
Esra Neufeld , IT'IS Foundation, ETH Zurich, Switzerland
Peter Messmer , Tech-X Corporation, Boulder CO, USA
Matthias Christen , High Performance and Web Computing Group, Computer Science Dept., University of Basel, Switzerland
ABSTRACT
Novel micro-architectures including the Cell Broadband Engine Architecture and graphics processing units are attractive platforms for compute-intensive simulations. This paper focuses on stencil computations arising in the context of a biomedical simulation and presents performance benchmarks on both the Cell BE and GPUs and contrasts them with a benchmark on a traditional CPU system. Due to the low arithmetic intensity of stencil computations, typically only a fraction of the peak performance of the compute hardware is reached. An algorithm is presented, which reduces the bandwidth requirements and thereby improves performance by exploiting temporal locality of the data. We report on performance improvements over CPU implementations.
CITATION
Olaf Schenk, Esra Neufeld, Peter Messmer, Matthias Christen, "Parallel data-locality aware stencil computations on modern micro-architectures", IPDPS, 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2009 IEEE International Symposium on Parallel & Distributed Processing (IPDPS) 2009, pp. 1-10, doi:10.1109/IPDPS.2009.5161031
32 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool