14th Symposium on Computer Architecture and High Performance Computing (SCAB-PAD'02) Exploiting Loop-Level Parallelism with the Shift Architecture Vit?ria, ES, Brazil October 28-October 30 ISBN: 0-7695-1772-2
The limited amount of instruction-level parallelism inherent in applications is a limiting factor for improving the performance of most conventional microprocessors. A promising solution to overcome this problem is to exploit coarser granularities of parallelism. In this paper, we propose exploiting loop-level parallelism in a multithreaded fashion. We use the Shift Architecture [9] as a baseline architecture, with improved compiler support and register file. The compiler converts iterations of a loop into threads, to be executed by multiple processing elements. The hardware provides a selective register shifting mechanism in order to allow the execution of loops containing loop-carried data dependences, which are very difficult to execute by using conventional architectures.In this paper, we simulate and discuss the parameters of major importance for the implementation of this architectural approach. Our initial results show that, on two simple numerical benchmarks, a considerable amount of iteration overlapping can be potentially achieved by an implementation of the Shift Architecture, in comparison with a multi-processor machine.
Citation:
C. Lima, T. Nakamura, "Exploiting Loop-Level Parallelism with the Shift Architecture," sbac-pad, pp.0184, 14th Symposium on Computer Architecture and High Performance Computing (SCAB-PAD'02), 2002 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||