16th International Conference on Parallel Architecture and Compilation Techniques (PACT 2007) Brasov, Romania September 15-September 19 ISBN: 0-7695-2944-5
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/PACT.2007.10
Out-of-order superscalar processors require the ability to issue loads while older stores are in-flight. Forcing loads to wait for all older stores, including those on which they may not be dependent on, to retire and write to the cache would reduce IPC and take away almost all the benefit of out-of-order execution. On the other hand, maintaining functional correctness while allowing loads to execute in the presence of stores in-flight requires the ability to forward data from the most recent older inflight store to the same address. Such forwarding typically involves a CAM match of the 64 bit physical address field of each store queue entry. The store queue data forwarding logic is thus a significantly high-latency circuit and could limit the frequency of the design [2].
Citation:
Rajesh Vivekanandharn, R. Govindarajan, "A Scalable Low Power Store Queue for Large InstructionWindow Processors," pact, pp.430, 16th International Conference on Parallel Architecture and Compilation Techniques (PACT 2007), 2007 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||