|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Optimization of Parallel Particle-to-Grid Interpolation on Leading Multicore Platforms
Oct. 2012 (vol. 23 no. 10)
pp. 1915-1922
| ASCII Text | x | ||
| Kamesh Madduri, Jimmy Su, Samuel Williams, Leonid Oliker, Stéphane Ethier, Katherine Yelick, "Optimization of Parallel Particle-to-Grid Interpolation on Leading Multicore Platforms," IEEE Transactions on Parallel and Distributed Systems, vol. 23, no. 10, pp. 1915-1922, Oct., 2012. | |||
| BibTex | x | ||
| @article{ 10.1109/TPDS.2012.28, author = {Kamesh Madduri and Jimmy Su and Samuel Williams and Leonid Oliker and Stéphane Ethier and Katherine Yelick}, title = {Optimization of Parallel Particle-to-Grid Interpolation on Leading Multicore Platforms}, journal ={IEEE Transactions on Parallel and Distributed Systems}, volume = {23}, number = {10}, issn = {1045-9219}, year = {2012}, pages = {1915-1922}, doi = {http://doi.ieeecomputersociety.org/10.1109/TPDS.2012.28}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE Transactions on Parallel and Distributed Systems TI - Optimization of Parallel Particle-to-Grid Interpolation on Leading Multicore Platforms IS - 10 SN - 1045-9219 SP1915 EP1922 EPD - 1915-1922 A1 - Kamesh Madduri, A1 - Jimmy Su, A1 - Samuel Williams, A1 - Leonid Oliker, A1 - Stéphane Ethier, A1 - Katherine Yelick, PY - 2012 KW - Heart KW - Synchronization KW - Optimization KW - Interpolation KW - Multicore processing KW - Kernel KW - Computational modeling KW - lock free KW - Particle mesh KW - particle-to-grid interpolation KW - multicore performance tuning KW - synchronization KW - atomic VL - 23 JA - IEEE Transactions on Parallel and Distributed Systems ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPDS.2012.28
Web Extra: View Supplemental Material(PDF)
We are now in the multicore revolution which is witnessing a rapid evolution of architectural designs due to power constraints and correspondingly limited microprocessor clock speeds. Understanding how to efficiently utilize these systems in the context of demanding numerical algorithms is an urgent challenge to meet the ever growing computational needs of high-end computing. In this work, we examine multicore parallel optimization of the particle-to-grid interpolation step in particle-mesh methods, an inherently complex optimization problem due to its low computation intensity, irregular data accesses, and potential fine-grained data hazards. Our evaluated kernels are derived from two important numerical computations: a biological simulation of the heart using the Immersed Boundary (IB) method, and a Gyrokinetic Particle-in-Cell (PIC)-based application for studying fusion plasma microturbulence. We develop several novel synchronization and grid decomposition schemes, as well as low-level optimization techniques to maximize performance on three modern multicore platforms: Intel's Xeon X5550 (Nehalem), AMD's Opteron 2356 (Barcelona), and Sun's UltraSparc {\rm T}2{+} (Niagara). Results show that our optimizations lead to significant performance improvements, achieving up to a 5.6{\times} speedup compared to the reference parallel implementation. Our work also provides valuable insight into the design of future autotuning frameworks for particle-to-grid interpolation on next-generation systems.
Index Terms:
Heart,Synchronization,Optimization,Interpolation,Multicore processing,Kernel,Computational modeling,lock free,Particle mesh,particle-to-grid interpolation,multicore performance tuning,synchronization,atomic
Citation:
Kamesh Madduri, Jimmy Su, Samuel Williams, Leonid Oliker, Stéphane Ethier, Katherine Yelick, "Optimization of Parallel Particle-to-Grid Interpolation on Leading Multicore Platforms," IEEE Transactions on Parallel and Distributed Systems, vol. 23, no. 10, pp. 1915-1922, Oct. 2012, doi:10.1109/TPDS.2012.28
Usage of this product signifies your acceptance of the Terms of Use.

