loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2009 Parallel, Distributed and Network-based Processing
Proactive Fault Tolerance Using Preemptive Migration
Weimar, Germany
February 18-February 20
ISBN: 978-0-7695-3544-9
Proactive fault tolerance (FT) in high-performance computing is a concept that prevents compute node failures from impacting running parallel applications by preemptively migrating application parts away from nodes that are about to fail. This paper provides a foundation for proactive FT by defining its architecture and classifying implementation options. This paper further relates prior work to the presented architecture and classification, and discusses the challenges ahead for needed supporting technologies.
Index Terms:
fault tolerance, high-performance computing, preemptive migration
Citation:
Christian Engelmann, Geoffroy R. Vallee, Thomas Naughton, Stephen L. Scott, "Proactive Fault Tolerance Using Preemptive Migration," pdp, pp.252-257, 2009 Parallel, Distributed and Network-based Processing, 2009
Usage of this product signifies your acceptance of the Terms of Use.