|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2008 International Symposium on Computer Architecture
Counting Dependence Predictors
June 21-June 25
ISBN: 978-0-7695-3174-8
| ASCII Text | x | ||
| Franziska Roesner, Doug Burger, Stephen W. Keckler, "Counting Dependence Predictors," Computer Architecture, International Symposium on, pp. 215-226, 2008 International Symposium on Computer Architecture, 2008. | |||
| BibTex | x | ||
| @article{ 10.1109/ISCA.2008.6, author = {Franziska Roesner and Doug Burger and Stephen W. Keckler}, title = {Counting Dependence Predictors}, journal ={Computer Architecture, International Symposium on}, volume = {0}, year = {2008}, issn = {1063-6897}, pages = {215-226}, doi = {http://doi.ieeecomputersociety.org/10.1109/ISCA.2008.6}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Computer Architecture, International Symposium on TI - Counting Dependence Predictors SN - 1063-6897 SP215 EP226 A1 - Franziska Roesner, A1 - Doug Burger, A1 - Stephen W. Keckler, PY - 2008 KW - memory systems KW - dependence prediction KW - multiprocessor and multicore architectures VL - 0 JA - Computer Architecture, International Symposium on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ISCA.2008.6
Modern processors rely on memory dependence prediction to execute load instructions as early as possible, speculating that they are not dependent on an earlier, unissued store. To date, the most sophisticated dependence predictors, such as Store Sets, have been tightly coupled to the fetch and execution streams, requiring global knowledge of the in-flight stream of stores to synchronize loads with specific stores. This paper proposes a new dependence predictor design, called a Counting Dependence Predictor (CDP). The key feature of CDPs is that the prediction mechanism predicts some set of events for which a particular dynamic load should wait, which may include some number of matching stores. By waiting for local events only, this dependence predictor can work effectively in a distributed microarchitecture where centralized fetch and execution streams are infeasible or undesirable. We describe and evaluate a distributed Counting Dependence Predictor and protocol that achieves 92% of the performance of perfect memory disambiguation. It outperforms a load-wait table, similar to the Alpha 21264, by 11%. Idealized, centralized implementations of Store Sets and the Exclusive Collision Predictor, both of which would be difficult to implement in a distributed microarchitecture, achieve 97% and 94% of oracular performance, respectively.
Index Terms:
memory systems, dependence prediction, multiprocessor and multicore architectures
Citation:
Franziska Roesner, Doug Burger, Stephen W. Keckler, "Counting Dependence Predictors," isca, pp.215-226, 2008 International Symposium on Computer Architecture, 2008
Usage of this product signifies your acceptance of the Terms of Use.
