16th Annual International Symposium on High Performance Computing Systems and Applications
Numerical Applications and Sub-Word Parallelism: The NAS Benchmarks on a Pentium 4
Moncton, NB, Canada
June 16-June 19
ISBN: 0-7695-1626-2
We examine the impact of Pentium 4 SIMD instructions on the Fortran and C versions of the NAS benchmarks, either by compiler vectorization or by assembly code in-lining. If few functions generally profit from the SIMD operations, the ones using complex numbers or random number generators can be efficiently accelerated.