Performance Evaluation of Parallel Large-Scale Lattice Boltzmann Applications on Three Supercomputing Architectures
SC Conference (2004)
Nov. 6, 2004 to Nov. 12, 2004
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/SC.2004.37
Thomas Pohl , University of Erlangen
Frank Deserno , University of Erlangen
Nils Thürey , University of Erlangen
Ulrich Rüde , University of Erlangen
Peter Lammers , Höchstleistungsrechenzentrum Stuttgart
Gerhard Wellein , Regionales Rechenzentrum Erlangen
Thomas Zeiser , Regionales Rechenzentrum Erlangen
Computationally intensive programs with moderate communication requirements such as CFD codes suffer from the standard slow interconnects of commodity "off the shelf" (COTS) hardware. We will introduce different large-scale applications of the Lattice Boltzmann Method (LBM) in fluid dynamics, material science, and chemical engineering and present results of the parallel performance on different architectures. It will be shown that a high speed communication network in combination with an efficient CPU is mandatory in order to achieve the required performance. An estimation of the necessary CPU count to meet the performance of 1 TFlop/s will be given as well as a prediction as to which architecture is the most suitable for LBM. Finally, ratios of costs to application performance for tailored HPC systems and COTS architectures will be presented.
U. Rüde et al., "Performance Evaluation of Parallel Large-Scale Lattice Boltzmann Applications on Three Supercomputing Architectures," SC Conference(SC), Pittsburgh, Pennsylvania, 2004, pp. 21.