The Community for Technology Leaders
Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques (2011)
Galveston, Texas USA
Oct. 10, 2011 to Oct. 14, 2011
ISSN: 1089-795X
ISBN: 978-0-7695-4566-0
pp: 199-200
ABSTRACT
Fault-tolerance has become an essential concern for processor designers due to increasing transient and permanent fault rates. In this study we propose Symptom TM, a symptom-based error detection technique that recovers from errors by leveraging the abort mechanism of Transactional Memory (TM). To the best of our knowledge, this is the first architectural fault-tolerance proposal using Hardware Transactional Memory (HTM). Symptom TM can recover from 86% and 65% of catastrophic failures caused by transient and permanent errors respectively with no performance overhead in error-free executions.
INDEX TERMS
Fault Tolerance, Hardware Transactional Memory
CITATION
Gulay Yalcin, Osman S. Unsal, Adrian Cristal, Mateo Valero, Ibrahim Hur, "SymptomTM: Symptom-Based Error Detection and Recovery Using Hardware Transactional Memory", Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques, vol. 00, no. , pp. 199-200, 2011, doi:10.1109/PACT.2011.39
166 ms
(Ver 3.3 (11022016))