|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2011 International Conference on Parallel Processing
A Comprehensive Performance Comparison of CUDA and OpenCL
Taipei City, Taiwan
September 13-September 16
ISBN: 978-0-7695-4510-3
| ASCII Text | x | ||
| Jianbin Fang, Ana Lucia Varbanescu, Henk Sips, "A Comprehensive Performance Comparison of CUDA and OpenCL," 2012 41st International Conference on Parallel Processing, pp. 216-225, 2011 International Conference on Parallel Processing, 2011. | |||
| BibTex | x | ||
| @article{ 10.1109/ICPP.2011.45, author = {Jianbin Fang and Ana Lucia Varbanescu and Henk Sips}, title = {A Comprehensive Performance Comparison of CUDA and OpenCL}, journal ={2012 41st International Conference on Parallel Processing}, volume = {0}, year = {2011}, issn = {0190-3918}, pages = {216-225}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICPP.2011.45}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - 2012 41st International Conference on Parallel Processing TI - A Comprehensive Performance Comparison of CUDA and OpenCL SN - 0190-3918 SP216 EP225 A1 - Jianbin Fang, A1 - Ana Lucia Varbanescu, A1 - Henk Sips, PY - 2011 KW - Performance Comparison KW - CUDA KW - OpenCL VL - 0 JA - 2012 41st International Conference on Parallel Processing ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICPP.2011.45
This paper presents a comprehensive performance comparison between CUDA and OpenCL. We have selected 16 benchmarks ranging from synthetic applications to real-world ones. We make an extensive analysis of the performance gaps taking into account programming models, ptimization strategies, architectural details, and underlying compilers. Our results show that, for most applications, CUDA performs at most 30\% better than OpenCL. We also show that this difference is due to unfair comparisons: in fact, OpenCL can achieve similar performance to CUDA under a fair comparison. Therefore, we define a fair comparison of the two types of applications, providing guidelines for more potential analyses. We also investigate OpenCL's portability by running the benchmarks on other prevailing platforms with minor modifications. Overall, we conclude that OpenCL's portability does not fundamentally affect its performance, and OpenCL can be a good alternative to CUDA.
Index Terms:
Performance Comparison, CUDA, OpenCL
Citation:
Jianbin Fang, Ana Lucia Varbanescu, Henk Sips, "A Comprehensive Performance Comparison of CUDA and OpenCL," icpp, pp.216-225, 2011 International Conference on Parallel Processing, 2011
Usage of this product signifies your acceptance of the Terms of Use.
