|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
| ASCII Text | x | ||
| Tianyi David Han, Tarek S. Abdelrahman, "hiCUDA: High-Level GPGPU Programming," IEEE Transactions on Parallel and Distributed Systems, vol. 22, no. 1, pp. 78-90, January, 2011. | |||
| BibTex | x | ||
| @article{ 10.1109/TPDS.2010.62, author = {Tianyi David Han and Tarek S. Abdelrahman}, title = {hiCUDA: High-Level GPGPU Programming}, journal ={IEEE Transactions on Parallel and Distributed Systems}, volume = {22}, number = {1}, issn = {1045-9219}, year = {2011}, pages = {78-90}, doi = {http://doi.ieeecomputersociety.org/10.1109/TPDS.2010.62}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE Transactions on Parallel and Distributed Systems TI - hiCUDA: High-Level GPGPU Programming IS - 1 SN - 1045-9219 SP78 EP90 EPD - 78-90 A1 - Tianyi David Han, A1 - Tarek S. Abdelrahman, PY - 2011 KW - CUDA KW - GPGPU KW - data-parallel programming KW - directive-based language KW - source-to-source compiler. VL - 22 JA - IEEE Transactions on Parallel and Distributed Systems ER - | |||
[1] NVIDIA, "NVIDIA GeForce 8800 GPU Architecture Overview," http://www.nvidia.com/objectIO_37100.html , Nov. 2006.
[2] NVIDIA, "NVIDIA CUDA Programming Guide v1.1," http://developer.download.nvidia.com/compute/ cuda/1_1NVIDIA_ CUDA_Programming_Guide_1.1.pdf , Nov. 2007.
[3] I. Buck et al., "Brook for GPUs: Stream Computing on Graphics Hardware," Proc. ACM SIGGRAPH, pp. 777-786, 2004.
[4] "Open Computing Language (OpenCL)," http://www.khronos. orgopencl/, 2010.
[5] S. Ryoo et al., "Optimization Principles and Application Performance Evaluation of a Multithreaded GPU Using CUDA," Proc. Symp. Principles and Practice of Parallel Programming, pp. 73-82, 2008.
[6] NVIDIA, "The CUDA Compiler Driver NVCC v1.1," http://www.nvidia.com/objectcuda_programming_tools.html , 2007.
[7] S. Ryoo et al., "Program Optimization Space Pruning for a Multithreaded GPU," Proc. Int'l Symp. Code Generation and Optimization, pp. 195-204, 2008.
[8] ISO( )14882:2003, "Information Technology—Programming Languages—C++," ISO, 2003.
[9] T.D. Han, "Directive-Based General-Purpose GPU Programming," master's thesis, Univ. of Toronto, Sept. 2009.
[10] C. Liao et al., "Effective Source-to-Source Outlining to Support Whole Program Empirical Optimization," Proc. Int'l Workshop Languages and Compilers for Parallel Computing, Oct. 2009.
[11] S. Muchnick, Advanced Compiler Design and Implementation. Morgan Kaufmann, 1997.
[12] J. Fabri, "Automatic Storage Optimization," Proc. Symp. Compiler Construction, pp. 83-91, 1979.
[13] A. Clementson and C. Elphick, "Approximate Coloring Algorithms for Composite Graphs," J. Operational Research Soc., vol. 34, no. 6, pp. 503-509, 1983.
[14] "Open64 Research Compiler," http:/open64.sourceforge.net, 2010.
[15] IMPACT Research Group, "The Parboil Benchmark Suite," http://www.crhc.uiuc.edu/IMPACTparboil.php , 2007.
[16] L. Wang, S. Jacques, and L. Zheng, "MCML—Monte Carlo Modeling of Light Transport in Multi-Layered Tissues," Computer Methods and Programs in Biomedicine, vol. 47, no. 2, pp. 131-146, 1995.
[17] A. Klockner, "Pycuda v0.94beta Documentation," http:// documen.tician.depycuda/, 2010.
[18] GASS, "jCUDA: Java for CUDA," http://www.gass-ltd.co.il/en/productsjcuda /, 2010.
[19] The Portland Group, "CUDA Fortran Programming Guide and Reference v0.9," http://www.pgroup.com/lit/whitepaperspgi_spec_cuda_fortran_0.9.pdf , June 2009.
[20] S. Lee, S.-J. Min, and R. Eigenmann, "OpenMP to GPGPU: A Compiler Framework for Automatic Translation and Optimization," Proc. Symp. Principles and Practice of Parallel Programming, pp. 101-110, 2009.
[21] OpenMP ARB, "OpenMP Specification v3.0," http://openmp. org/wpopenmp-specifications /, May 2008.
[22] The Portland Group, "PGI Fortran and C Accelerator Programming Model," http://www.pgroup.com/lit/whitepaperspgi_ accel_prog_model_1.0.pdf , June 2009.
[23] S.-Z. Ueng et al., "CUDA-lite: Reducing GPU Programming Complexity," Proc. Int'l Workshop Languages and Compilers for Parallel Computing, pp. 1-15, 2008.
[24] C.-K. Luk, S. Hong, and H. Kim, "Qilin: Exploiting Parallelism on Heterogeneous Multiprocessors with Adaptive Mapping," Proc. Int'l Symp. Microarchitecture, pp. 45-55, 2009.
[25] M.M. Baskaran et al., "A Compiler Framework for Optimization of Affine Loop Nests for GPGPUs," Proc. Int'l Conf. Supercomputing, pp. 225-234, 2008.

