International Conference on Parallel Processing, 2004. ICPP 2004. (2004)
Montreal, Quebec, Canada
Aug. 15, 2004 to Aug. 18, 2004
Qingfeng Zhuge , University of Texas at Dallas
Zili Shao , University of Texas at Dallas
Edwin H.-M. Sha , University of Texas at Dallas
Software pipelining for nested loops remains a challenging problem for embedded system design. The existing software pipelining techniques for single loops can only explore the parallelism of the innermost loop, so the final timing performance is inferior. While multi-dimensional (MD) retiming can explore the outer loop parallelism, it introduces large overheads in loop index generation and code size due to transformation. In this paper, we use MD retiming to model the software pipelining problem of nested loops. We show that the computation time and code size of a software-pipelined loop nest is affected by execution sequence and retiming function. The algorithm of Software PIpelining for NEsted loops technique (SPINE) is proposed to generate fully parallelized loops efficiently with the overheads as small as possible. The experimental results show that our technique outperforms both the standard software pipelining and MD retiming significantly.
E. H. Sha, Q. Zhuge and Z. Shao, "Timing Optimization of Nested Loops Considering Code Size for DSP Applications," International Conference on Parallel Processing, 2004. ICPP 2004.(ICPP), Montreal, Quebec, Canada, 2004, pp. 475-482.