1999 International Conference on Parallel Processing (ICPP'99) Optimization of Instruction Fetch for Decision Support Workloads Wakamatsu, Japan September 21-September 24 ISBN: 0-7695-0350-0
Instruction fetch bandwidth is feared to be a major limiting factor to the performance of future wide-issue aggressive superscalars.In this paper, we focus on Database applications running Decision Support workloads. We characterize the locality patterns of ia database kernel and find frequently executed paths. Using this information, we propose an algorithm to lay out the basic blocks for improved I-fetch.Our results show a miss reduction of 60-98% for realistic I-cache sizes and a doubling of the number of instructions executed between taken branches. As a consequence, we increase the fetch bandwith provided by an aggressive sequential fetch unit from 5.8 for the original code to 10.6 using our proposed layout. Our software scheme combines well with hardware schemes like a Trace Cache providing up to 12.1 instruction per cycle, suggesting that commercial workloads may be amenable to the aggressive I-fetch of future superscalars.
Index Terms:
High performance fetch, compiler optimization, trace cache, profiling, databases
Citation:
Alex Ramirez, Josep Ll. Larriba-Pey, Carlos Navarro, Xavi Serrano, Mateo Valero, Josep Torrellas, "Optimization of Instruction Fetch for Decision Support Workloads," icpp, pp.238, 1999 International Conference on Parallel Processing (ICPP'99), 1999 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||