Eighth International Conference on Parallel Architectures and Compilation Techniques (PACT'99) Exploring Last n Value Prediction Newport Beach, California October 12-October 16 ISBN: 0-7695-0425-6
Most load value predictors retain a large number of previously loaded values for making future predictions. In this paper we evaluate the trade-off between tall and slim versus short and wide predictors of the same total size, i.e., between retaining a few values for a large number of load instructions and many values for a proportionately smaller number of loads. Our results show, for example, that even modest predictors holding sixteen kilobytes of values benefit from retaining four values per load instruction when running SPECint95.A detailed comparison of eight load value predictors on a cycle-accurate simulator of a superscalar out-of-order microprocessor shows that our implementation of a last four value predictor outperforms other predictors from the literature, often significantly. With 21kB of state, it yields a harmonic mean speedup of 12.5% with existing re-fetch misprediction recovery hardware and 13.7% with a not yet realized re-execution recovery mechanism.
Index Terms:
value prediction, processor performance, value locality, predictor design, behavior prediction
Citation:
Martin Burtscher, Benjamin G. Zorn, "Exploring Last n Value Prediction," pact, pp.66, Eighth International Conference on Parallel Architectures and Compilation Techniques (PACT'99), 1999 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||