|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2011 International Conference on Parallel Architectures and Compilation Techniques
Parameterized Micro-benchmarking: An Auto-tuning Approach for Complex Applications
Galveston, Texas USA
October 10-October 14
ISBN: 978-0-7695-4566-0
| ASCII Text | x | ||
| Wenjing Ma, Sriram Krishnamoorthy, Gagan Agrawal, "Parameterized Micro-benchmarking: An Auto-tuning Approach for Complex Applications," Parallel Architectures and Compilation Techniques, International Conference on, pp. 181-182, 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011. | |||
| BibTex | x | ||
| @article{ 10.1109/PACT.2011.30, author = {Wenjing Ma and Sriram Krishnamoorthy and Gagan Agrawal}, title = {Parameterized Micro-benchmarking: An Auto-tuning Approach for Complex Applications}, journal ={Parallel Architectures and Compilation Techniques, International Conference on}, volume = {0}, year = {2011}, issn = {1089-795X}, pages = {181-182}, doi = {http://doi.ieeecomputersociety.org/10.1109/PACT.2011.30}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Parallel Architectures and Compilation Techniques, International Conference on TI - Parameterized Micro-benchmarking: An Auto-tuning Approach for Complex Applications SN - 1089-795X SP181 EP182 A1 - Wenjing Ma, A1 - Sriram Krishnamoorthy, A1 - Gagan Agrawal, PY - 2011 KW - auto-tuning KW - GPU KW - optimization VL - 0 JA - Parallel Architectures and Compilation Techniques, International Conference on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/PACT.2011.30
Auto-tuning has emerged as an important practical method for creating highly optimized code. However, the growing complexity of architectures and applications has resulted in a prohibitively large search space that preclude empirical auto-tuning. Here, we focus on the challenge to auto-tuning presented by applications that require auto-tuning of not just a small number of distinct kernels, but a large number of kernels that exhibit similar computation and memory access characteristics and require optimization over similar problem spaces. We propose an auto-tuning method for tensor contraction functions on GPUs, based on parameterized micro-benchmarks. Using our parameterized micro-benchmarking approach, we obtain a speedup of up to 2 over the version that used default optimizations without auto-tuning.
Index Terms:
auto-tuning, GPU, optimization
Citation:
Wenjing Ma, Sriram Krishnamoorthy, Gagan Agrawal, "Parameterized Micro-benchmarking: An Auto-tuning Approach for Complex Applications," pact, pp.181-182, 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011
Usage of this product signifies your acceptance of the Terms of Use.
