|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
| ASCII Text | x | ||
| Thomas F. Wenisch, Michael Ferdman, Anastasia Ailamaki, Babak Falsafi, Andreas Moshovos, "Making Address-Correlated Prefetching Practical," IEEE Micro, vol. 30, no. 1, pp. 50-59, January/February, 2010. | |||
| BibTex | x | ||
| @article{ 10.1109/MM.2010.21, author = {Thomas F. Wenisch and Michael Ferdman and Anastasia Ailamaki and Babak Falsafi and Andreas Moshovos}, title = {Making Address-Correlated Prefetching Practical}, journal ={IEEE Micro}, volume = {30}, number = {1}, issn = {0272-1732}, year = {2010}, pages = {50-59}, doi = {http://doi.ieeecomputersociety.org/10.1109/MM.2010.21}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - MGZN JO - IEEE Micro TI - Making Address-Correlated Prefetching Practical IS - 1 SN - 0272-1732 SP50 EP59 EPD - 50-59 A1 - Thomas F. Wenisch, A1 - Michael Ferdman, A1 - Anastasia Ailamaki, A1 - Babak Falsafi, A1 - Andreas Moshovos, PY - 2010 KW - cache memories KW - address-correlated prefetching VL - 30 JA - IEEE Micro ER - | |||
Despite a decade of research demonstrating its efficacy, address-correlated prefetching has never been implemented in a shipping processor because it requires megabytes of metadata—too large to store practically on chip. New storage-, latency-, and bandwidth-efficient mechanisms for storing metadata off chip yield a practical design that achieves 90 percent of the performance potential of idealized on-chip metadata storage.
1. N. Hardavellas et al., "Database Servers on Chip Multiprocessors: Limitations and Opportunities," Proc. 3rd Biennial Conf. Innovative Data Systems Research, 2007, www.cidrdb.org/cidr2007/paperscidr07p08.pdf .
2. Y. Chou, "Low-Cost Epoch-Based Correlation Prefetching for Commercial Applications," Proc. 40th Ann. IEEE/ACM Int'l Symp. Microarchitecture (MICRO 07), IEEE CS Press, 2007, pp. 301-313.
3. T.F. Wenisch et al., "Temporal Streaming of Shared Memory," Proc. 32nd Ann. Int'l Symp. Computer Architecture, IEEE CS Press, 2005, pp. 222-233.
4. T.M. Chilimbi and M. Hirzel, "Dynamic Hot Data Stream Prefetching for General-Purpose Programs," Proc. ACM SIGPLAN 2002 Conf. Programming Language Design and Implementation, ACM Press, 2002, pp. 199-209.
5. M. Ferdman and B. Falsafi, "Last-Touch Correlated Data Streaming," IEEE Int'l Symp. Performance Analysis of Systems and Software (ISPASS 07), IEEE CS Press, 2007, pp. 105-115.
6. M. Ferdman et al., "Temporal Instruction Fetch Streaming," Proc. 41st Ann. IEEE/ACM Int'l Symp. Microarchitecture (MICRO 08), IEEE CS Press, 2008, pp. 1-10.
7. D. Joseph and D. Grunwald, "Prefetching Using Markov Predictors," Proc. 24th Ann. Int'l Symp. Computer Architecture, ACM Press, 1997, pp. 252-263.
8. Z. Hu, M. Martonosi, and S. Kaxiras, "TCP: Tag Correlating Prefetchers," Proc. 9th IEEE Symp. High-Performance Computer Architecture (HPCA 9), IEEE CS Press, 2003, pp. 317-326.
9. A.-C. Lai, C. Fide, and B. Falsafi, "Dead-Block Prediction and Dead-Block Correlating Prefetchers," Proc. 28th Ann. Int'l Symp. Computer Architecture, ACM Press, 2001, pp. 144-154.
10. K.J. Nesbit and J.E. Smith, "Data Cache Prefetching Using a Global History Buffer," Proc. 10th Int'l Symp. High-Performance Computer Architecture, IEEE CS Press, 2004, p. 96-105.
11. Y. Solihin, J. Lee, and J. Torrellas, "Using a User-Level Memory Thread for Correlation Prefetching," Proc. 29th Ann. Int'l Symp. Computer Architecture, IEEE CS Press, 2002, pp. 171-182.
12. T.F. Wenisch et al., "Practical Off-chip Metadata for Temporal Memory Streaming," Proc. IEEE 15th Int'l Symp. High Performance Computer Architecture, IEEE CS Press, 2009, pp. 79-90.
1. T.F. Wenisch et al., "Temporal Streams in Commercial Server Applications," IEEE Int'l Symp. Workload Characterization (IISWC 2008), IEEE CS Press, 2008, pp. 99-108.

