Parallel and Distributed Systems, International Conference on (2010)
Dec. 8, 2010 to Dec. 10, 2010
The recent advent of multicore processors, and especially the introduction of many-core GPUs, opens new horizons to large-scale, high-resolution, simulations for a broad range of scientific fields. Among them, the scientific area of CFD appears to be one of the candidates that could significantly benefit from the utilization of many-core GPUs. In o rder to investigate such a potential, we evaluate the performance of a high-order accurate method for the simulation of compressible flows. Current implementation is taking place on a GPU cluster. Nevertheless, a novel approach is followed concerning the utilization of GPU clusters that does not involve explicit message passing. Instead, the presented implementation resides on Software Distributed Shared Memory (SDSM) to propagate changes across the simulation phases. The first results prove to be emboldening and lay grounds for further research along the use of shared memory abstraction in order to utilize future GPU clusters.
GPU Clusters, SDSM, OpenMP, CUDA, WENO
E. D. Polychronopoulos, J. A. Ekaterinaris and K. I. Karantasis, "Acceleration of a High Order Accurate Method for Compressible Flows on SDSM Based GPU Clusters," 2010 IEEE 16th International Conference on Parallel and Distributed Systems (ICPADS 2010)(ICPADS), Shanghai, 2010, pp. 460-467.