|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Fifth IEEE International Conference on Cluster Computing (CLUSTER'03)
Improving the Performance of MPI Derived Datatypes by Optimizing Memory-Access Cost
Hong Kong
December 01-December 04
ISBN: 0-7695-2066-9
| ASCII Text | x | ||
| Surendra Byna, William Gropp, Xian-He Sun, Rajeev Thakur, "Improving the Performance of MPI Derived Datatypes by Optimizing Memory-Access Cost," 2012 IEEE International Conference on Cluster Computing, pp. 412, Fifth IEEE International Conference on Cluster Computing (CLUSTER'03), 2003. | |||
| BibTex | x | ||
| @article{ 10.1109/CLUSTR.2003.1253341, author = {Surendra Byna and William Gropp and Xian-He Sun and Rajeev Thakur}, title = {Improving the Performance of MPI Derived Datatypes by Optimizing Memory-Access Cost}, journal ={2012 IEEE International Conference on Cluster Computing}, volume = {0}, year = {2003}, isbn = {0-7695-2066-9}, pages = {412}, doi = {http://doi.ieeecomputersociety.org/10.1109/CLUSTR.2003.1253341}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - 2012 IEEE International Conference on Cluster Computing TI - Improving the Performance of MPI Derived Datatypes by Optimizing Memory-Access Cost SN - 0-7695-2066-9 SP EP A1 - Surendra Byna, A1 - William Gropp, A1 - Xian-He Sun, A1 - Rajeev Thakur, PY - 2003 KW - null VL - 0 JA - 2012 IEEE International Conference on Cluster Computing ER - | |||
The MPI Standard supports derived datatypes, which allow users to describe noncontiguous memory layout and communicate noncontiguous data with a single communication function. This feature enables an MPI implementation to optimize the transfer of noncontiguous data. In practice, however, few MPI implementations implement derived datatypes in a way that performs better than what the user can achieve by manually packing data into a contiguous buffer and then calling an MPI function. In this paper, we present a technique for improving the performance of derived datatypes by automatically using packing algorithms that are optimized for memory-access cost. The packing algorithms use memory-optimization techniques that the user cannot apply easily without advanced knowledge of the memory architecture. We present performance results for a matrix-transpose example that demonstrate that our implementation of derived datatypes significantly outperforms both manual packing by the user and the existing derived-datatype code in the MPI implementation (MPICH).
Citation:
Surendra Byna, William Gropp, Xian-He Sun, Rajeev Thakur, "Improving the Performance of MPI Derived Datatypes by Optimizing Memory-Access Cost," cluster, pp.412, Fifth IEEE International Conference on Cluster Computing (CLUSTER'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.
