loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'06)
Reunion: Complexity-Effective Multicore Redundancy
Orlando, Florida, USA
December 09-December 13
ISBN: 0-7695-2732-9
Jared C. Smolens, Carnegie Mellon University
Brian T. Gold, Carnegie Mellon University
Babak Falsafi, Carnegie Mellon University
James C. Hoe, Carnegie Mellon University

To protect processor logic from soft errors, multicore redundant architectures execute two copies of a program on separate cores of a chip multiprocessor (CMP). Maintaining identical instruction streams is challenging because redundant cores operate independently, yet must still receive the same inputs (e.g., load values and shared-memory invalidations). Past proposals strictly replicate load values across two cores, requiring significant changes to the highly-optimized core.

We make the key observation that, in the common case, both cores load identical values without special hardware. When the cores do receive different load values (e.g., due to a data race), the same mechanisms employed for soft error detection and recovery can correct the difference. This observation permits designs that relax input replication, while still providing correct redundant execution. In this paper, we present Reunion, an execution model that provides relaxed input replication and preserves the existing memory interface, coherence protocols, and consistency models. We evaluate a CMP-based implementation of the Reunion execution model with full-system, cycle-accurate simulation. We show that the performance overhead of relaxed input replication is only 5% and 6% for commercial and scientific workloads, respectively.

Citation:
Jared C. Smolens, Brian T. Gold, Babak Falsafi, James C. Hoe, "Reunion: Complexity-Effective Multicore Redundancy," micro, pp.223-234, 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.