34th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'01) Direct Load: Dependence-Linked Dataflow Resolution of Load Address and Cache Coordinate Austin, Texas December 01-December 05 ISBN: 0-7695-1369-7
An increasing cache latency in future processors incurs profound performance impacts in spite of advanced out-of-order execution techniques. In this paper, we de- scribe an early address resolution mechanism that accurately resolves both regular and irregular load addresses. The basic idea is to build dynamic dependence links from the instruction that updates the base register to the consumer load instructions. Once a new base address is available, it triggers calculations of the new load addresses for dependent loads. Furthermore, the exact cache location of the requested data is predicted based on the newly resolved load address. As a result, this direct load can access the data cache directly to achieve a zero- cycle load latency. Performance evaluation using SPEC integer programs shows that the dynamic dependence links can be established accurately. Combined with a stride-based predictor, the proposed early address resolution achieves about 97% average accuracy with less than 1% misprediction. Based on a modified Simple Scalar model, the proposed method can potentially improve the IPC by about 18%.
Citation:
Byung Chung, Jinsuo Zhang, Jih-Kwon Peir, Shih Lai, Konrad Lai, "Direct Load: Dependence-Linked Dataflow Resolution of Load Address and Cache Coordinate," micro, pp.76, 34th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'01), 2001 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||