1999 International Conference on Parallel Processing (ICPP'99)
Optimization of Instruction Fetch for Decision Support Workloads
Wakamatsu, Japan
September 21-September 24
ISBN: 0-7695-0350-0
Instruction fetch bandwidth is feared to be a major limiting factor to the performance of future wide-issue aggressive superscalars.In this paper, we focus on Database applications running Decision Support workloads. We characterize the locality patterns of ia database kernel and find frequently executed paths. Using this information, we propose an algorithm to lay out the basic blocks for improved I-fetch.Our results show a miss reduction of 60-98% for realistic I-cache sizes and a doubling of the number of instructions executed between taken branches. As a consequence, we increase the fetch bandwith provided by an aggressive sequential fetch unit from 5.8 for the original code to 10.6 using our proposed layout. Our software scheme combines well with hardware schemes like a Trace Cache providing up to 12.1 instruction per cycle, suggesting that commercial workloads may be amenable to the aggressive I-fetch of future superscalars.
Index Terms:
High performance fetch, compiler optimization, trace cache, profiling, databases
Citation:
Alex Ramirez, Josep Ll. Larriba-Pey, Carlos Navarro, Xavi Serrano, Mateo Valero, Josep Torrellas, "Optimization of Instruction Fetch for Decision Support Workloads," icpp, pp.238, 1999 International Conference on Parallel Processing (ICPP'99), 1999
Usage of this product signifies your acceptance of the
Terms of Use.
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||