Complexity analysis and algorithm design for reorganizing data to minimize non-coalesced memory accesses on GPU
Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming (PPoPP '13)
By Bo Wu, Eddy Zheng Zhang, Xipeng Shen, Yunlian Jiang, Zhijia Zhao
Issue Date:February 2013
The performance of Graphic Processing Units (GPU) is sensitive to irregular memory references. Some recent work shows the promise of data reorganization for eliminating non-coalesced memory accesses that are caused by irregular references. However, all pre...