2006 IEEE International Conference on Cluster Computing The Influence of Operating Systems on the Performance of Collective Operations at Extreme Scale Barcelona September 25-September 28 ISBN: 1-4244-0327-8
We investigate operating system noise, which we identify as one of the main reasons for a lack of synchronicity in parallel applications. Using a microbenchmark, we measure the noise on several contemporary platforms and find that, even with a general-purpose operating system, noise can be limited if certain precautions are taken. We then inject artificially generated noise into a massively parallel system and measure its influence on the performance of collective operations. Our experiments indicate that on extreme-scale platforms, the performance is correlated with the largest interruption to the application, even if the probability of such an interruption is extremely small. We demonstrate that synchronizing the noise can significantly reduce its negative influence
Index Terms:
extreme-scale platforms, operating system noise, parallel applications, microbenchmark
Citation:
P. Beckman, K. Iskra, K. Yoshii, S. Coghlan, "The Influence of Operating Systems on the Performance of Collective Operations at Extreme Scale," cluster, pp.1-12, 2006 IEEE International Conference on Cluster Computing, 2006 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||