Combining Program Recovery, Auto-Parallelisation and Locality Analysis for C Programs on Multi-Processor Embedded Systems
Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques (2003)
New Orleans, Louisiana
Sept. 27, 2003 to Oct. 1, 2003
Björn Franke , University of Edinburgh
M.F.P. O?Boyle , University of Edinburgh
This paper develops a complete auto-parallelisation approach for multiple-address space digital signal processors (DSPs). It combines a pointer conversion technique with a new modulo elimination transformation. This is followed by a combined parallelisation and address resolution approach which maps array references without introducing message-passing. Furthermore, as DSPs do not possess any cache structure, an optimisation is presented which transforms the program to both exploit remote data locality and local memory bandwidth. This parallelisation approach is applied to the DSPstone and UTDSP benchmark suites, giving an average speedup of 3.78 on a four processor Analog Devices TigerSHARC.
Björn Franke, M.F.P. O?Boyle, "Combining Program Recovery, Auto-Parallelisation and Locality Analysis for C Programs on Multi-Processor Embedded Systems", Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques, vol. 00, no. , pp. 104, 2003, doi:10.1109/PACT.2003.1238007