|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
| ASCII Text | x | ||
| Salvador Petit Martí, Julio Sahuquillo Borrás, Pedro López Rodríguez, Rafael Ubal Tena, José Duato Marín, "A Complexity-Effective Out-of-Order Retirement Microarchitecture," IEEE Transactions on Computers, vol. 58, no. 12, pp. 1626-1639, December, 2009. | |||
| BibTex | x | ||
| @article{ 10.1109/TC.2009.95, author = {Salvador Petit Martí and Julio Sahuquillo Borrás and Pedro López Rodríguez and Rafael Ubal Tena and José Duato Marín}, title = {A Complexity-Effective Out-of-Order Retirement Microarchitecture}, journal ={IEEE Transactions on Computers}, volume = {58}, number = {12}, issn = {0018-9340}, year = {2009}, pages = {1626-1639}, doi = {http://doi.ieeecomputersociety.org/10.1109/TC.2009.95}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE Transactions on Computers TI - A Complexity-Effective Out-of-Order Retirement Microarchitecture IS - 12 SN - 0018-9340 SP1626 EP1639 EPD - 1626-1639 A1 - Salvador Petit Martí, A1 - Julio Sahuquillo Borrás, A1 - Pedro López Rodríguez, A1 - Rafael Ubal Tena, A1 - José Duato Marín, PY - 2009 KW - Instruction-level parallelism KW - out-of-order commit KW - long latency operations KW - control dependencies KW - exception handling. VL - 58 JA - IEEE Transactions on Computers ER - | |||
[1] J. Smith and A. Pleszkun, “Implementation of Precise Interrupts in Pipelined Processors,” Proc. 12th Ann. Int'l Symp. Computer Architecture, pp. 36-44, June 1985.
[2] S. Palacharla, N. Jouppi, and J. Smith, “Complexity-Effective Superscalar Processor,” Proc. 24th Ann. Int'l Symp. Computer Architecture, June 1997.
[3] N. Kirman, M. Kirman, M. Chaudhuri, and J. Martínez, “Checkpointed Early Load Retirement,” Proc. Int'l Symp. High Performance Architecture, Feb. 2005.
[4] H. Akkary, R. Rajwar, and S.T. Srinivasan, “Checkpoint Processing and Recovery: Towards Scalable Large Instruction Window Processors,” Proc. 36th Int'l Symp. Microarchitecture, Dec. 2003.
[5] A. Cristal, D. Ortega, J. Llosa, and M. Valero, “Out-of-Order Commit Processors,” Proc. Int'l Symp. High Performance Architecture, Feb. 2004.
[6] G. Bell and M. Lipasti, “Deconstructing Commit,” Proc. Int'l Symp. Performance Analysis of Systems and Software, Mar. 2004.
[7] R.E. Kessler, “The Alpha 21264 Microprocessor,” IEEE Micro, vol. 19, no. 2, pp. 24-36, Mar. 1999.
[8] J.M. Tendler, S. Dodson, S. Fields, H. Le, and B. Sinharoy, “Power4 System Microarchitecture,” technical white paper, IBM Server Group, Oct. 2001.
[9] G. Hinton, D. Sager, M. Upton, D. Upton, D. Boggs, D. Carmean, A. Kyker, and P. Rousell, “The Microarchitecture of the Pentium 4 Processor,” Intel Technology J., vol. 5, no. 1, 2001.
[10] D. Burger and T.M. Austin, “The Simplescalar Tool Set, Version 2.0.,” Computer Architecture News, vol. 25, no. 3, pp.13-25, 1997.
[11] J. Smith and G. Sohi, “The Microarchitecture of Superscalar Processors,” Proc. IEEE, vol. 83, no. 2, pp. 1609-1624, Dec. 1995.
[12] M. Moudgill, K. Pingali, and S. Vassiliadis, “Register Renaming and Dynamic Speculation: An Alternative Approach,” Proc. 26th Int'l Symp. Microarchitecture, pp. 202-213, Dec. 1993.
[13] K. Yeager, “The mips r10000 Superscalar Microprocessor,” IEEE Micro, vol. 16, no. 2, pp. 28-40, Apr. 1996.
[14] J. Shen and M. Lipasti, Modern Processor Design. McGraw-Hill, 2005.
[15] K. Gharachorloo, A. Gupta, J.H. Singhal, D. Broniarczyk, F.M. Cerauskis, J. Price, L. Yuan, G. Cheng, D. Doblar, S. Fosth, N. Agarwal, K. Harvey, and E. Hagersten, “Two Techniques to Enhance the Performance of Memory Consistency Modelsgigaplane: A High Performance Bus for Large Smps,” Proc. Symp. High Performance Interconnects IV, pp. 41-52, 1996.
[16] D.J. Sorin, M. Plakal, M.D. Hill, and A.E. Condon, “Lamport Clocks: Reasoning about Shared Memory Correctness,” Technical Report CS-TR-1998-1367, Computer Sciences Dept., University of Wisconsin, Madison, 1998.
[17] M. Plakal, D.J. Sorin, A.E. Condon, and M.D. Hill, “Lamport Clocks: Verifying a Directory Cache-Coherency Protocol,” Proc. 10th ACM Ann. Symp. Parallel Algorithms and Architectures (SPAA '98), pp. 67-76, 1998.
[18] K. Gharachorloo, A. Gupta, and J. Hennessy, “Two Techniques to Enhance the Performance of Memory Consistency Models,” Proc. Int'l Conf. Parallel Processing, pp. I-355-I-364, 1991.
[19] Standard Performance Evaluation Corporation, http://www. spec.orgcpu2000/, 2009.
[20] T. Sherwood, E. Perelman, G. Hamerly, and B. Calder, “Automatically Characterizing Large Scale Program Behavior,” Proc. 10th Int'l Conf. Architectural Support for Programming Languages and Operating Systems (ASPLOS-X), Oct. 2002.
[21] Free Software Foundation, GCC Online Documentation, http://www.gnu.org/software/gcconlinedocs /, 2006.
[22] O. Mutlu, J. Stark, C. Wilkerson, and Y. Patt, “Runahead Execution: An Alternative to Very Large Instruction Window for Out-of-Order Processors,” Proc. Int'l Symp. High Performance Architecture, Feb. 2003.
[23] M. Kirman, N. Kirman, and J. Martínez, “Cherry-mp: Correctly Integrating Checkpointed Early Resource Recycling in Chip Multiprocessors,” Proc. Int'l Symp. Microarchitecture, Nov. 2005.
[24] E. Vallejo, M. Galluzzi, A. Cristal, F. Vallejo, R. Beivide, P. Stenstrom, J.E. Smith, and M. Valero, “Implementing Kilo-Instruction Multiprocessors,” Proc. IEEE Conf. Pervasive Services, Invited lecture, pp. 325-336, July 2005.
[25] J. Martinez, J. Renau, M. Huang, M. Prvulovic, and J. Torrellas, “Cherry: Checkpointed Early Resource Recycling in Out-of-Order Processors,” Proc. 35th Int'l Symp. Microarchitecture, Nov. 2002.
[26] S.E. Raasch, N.L. Binkert, and S.K. Reinhardt, “A Scalable Instruction Queue Design Using Dependence Chains,” Proc. 29th Ann. Int'l Symp. Computer Architecture, May 2002.
[27] R. Balasubramonian, S. Dwarkadas, and D. Albonesi, “Reducing the Complexity of the Register File in Dynamic Superscalar Processors,” Proc. 34th Int'l Symp. Microarchitecture, Dec. 2001.
[28] I. Park, C. Ooi, and T. Vijaykumar, “Reducing Design Complexity of the Load/Store Queue,” Proc. 36th Int'l Symp. Microarchitecture, Dec. 2003.

