loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers
Effective Instruction Prefetching via Fetch Prestaging
Denver, Colorado
April 04-April 08
ISBN: 0-7695-2312-9
Ayose Falc?, Barcelona Research Office, HP Labs
Alex Ramirez, Universitat Polit?cnica de Catalunya
Mateo Valero, Universitat Polit?cnica de Catalunya
As technological process shrinks and clock rate increases, instruction caches can no longer be accessed in one cycle. Alternatives are implementing smaller caches (with higher miss rate) or large caches with a pipelined access (with higher branch misprediction penalty). In both cases, the performance obtained is far from the obtained by an ideal large cache with one-cycle access.
In this paper we present Cache Line Guided Prestaging (CLGP), a novel mechanism that overcomes the limitations of current instruction cache implementations. CLGP employs prefetching to charge future cache lines into a set of fast prestage buffers. These buffers are managed efficiently by the CLGP algorithm, trying to fetch from them as much as possible. Therefore, the number of fetches served by the main instruction cache is highly reduced, and so the negative impact of its access latency on the overall performance.
With the best CLGP configuration using a 4 KB I-cache, speedups of 3.5% (at 0.09?m) and 12.5% (at 0.045?m) are obtained over an equivalent Fetch Directed Prefetching configuration, and 39% (at 0.09?m) and 48% (at 0.045?m) over using a pipelined instruction cache without prefetching. Moreover, our results show that CLGP with a 2.5 KB of total cache budget can obtain a similar performance than using a 64 KB pipelined I-cache without prefetching, that is equivalent performance at 6.4X our hardware budget.
Citation:
Ayose Falc?, Alex Ramirez, Mateo Valero, "Effective Instruction Prefetching via Fetch Prestaging," ipdps, vol. 1, pp.20b, 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers, 2005
Usage of this product signifies your acceptance of the Terms of Use.