2013 International Conference on Computing, Networking and Communications (ICNC) (2012)
Okinawa, Japan Japan
Dec. 5, 2012 to Dec. 7, 2012
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICNC.2012.39
The CUDA programming model provides better abstraction for GPU programming. However, it is still hard to write programs with CUDA because both some specific techniques and knowledge about GPU architecture are required. Hence, many programming frameworks for CUDA have been developed. OpenMPC is one of them based on OpenMP. OpenMPC is an easy-to-write framework for programmers familiar with traditional OpenMP, but still requires programmers to use the special directives for utilizing fast device memories. To solve this problem, this paper proposes a method for allocating appropriate device memories automatically. This paper also proposes a method for automatically allocating page locked memory for the data which are transferred between host and device. The evaluation results with several programs show that proposed methods can reduce 52% execution time in maximum.
memory allocation, GPGPU, CUDA, OpenMPC
Hiroaki Uchiyama, Tomoaki Tsumura, Hiroshi Matsuo, "An Automatic Host and Device Memory Allocation Method for OpenMPC", 2013 International Conference on Computing, Networking and Communications (ICNC), vol. 00, no. , pp. 208-214, 2012, doi:10.1109/ICNC.2012.39