loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
12th Pacific Rim International Symposium on Dependable Computing (PRDC'06)
Design Trade-Offs and Deadlock Prevention in Transient Fault-Tolerant SMT Processors
Riverside, California
December 18-December 20
ISBN: 0-7695-2724-8
Xiaobin Li, Enterprise Microprocessor Group Intel Corporation
Jean-Luc Gaudiot, University of California, Irvine
Since the very concept of Simultaneous Multi-Threading (SMT) entails inherent redundancy, some proposals have been made to run two copies of the same thread on top of SMT platforms in order to detect and correct soft errors. This allows, upon detection of an error, for the rolling back of the processor state to a known safe point, and then a retry of the instructions, thereby resulting in a completely error-free execution. This paper focuses on two crucial implementation issues introduced by this concept: (i) the design trade-off between the fault detection coverage versus the design costs; (ii) the possible occurrence of deadlock situations. To achieve the largest possible fault detection coverage, we replicate the instructions fetched in order to generate the redundant thread copies. Further, we apply the SMT thread scheduling at the the instruction dispatch stage so as to lower the performance overhead. As a result, when compared to the baseline processor, our simulation results show that by using our two new schemes, the performance overhead can be reduced down to as little as 34% on the average, down from 42%. Finally, in the faulttolerant execution mode, since the two copied threads are cooperating with one another, deadlock situations could be quite common. We thus present a detailed deadlock analysis and then conclude that allocating some entries of ROB, LQ, and SQ for the trailing thread is sufficient to prevent such deadlocks.
Citation:
Xiaobin Li, Jean-Luc Gaudiot, "Design Trade-Offs and Deadlock Prevention in Transient Fault-Tolerant SMT Processors," prdc, pp.315-322, 12th Pacific Rim International Symposium on Dependable Computing (PRDC'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.