loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
30th Annual Simulation Symposium (SS '97)
Execution-Driven Simulation of Error Recovery Techniques for Multicomputers
Atlanta, GA
April 07-April 09
ISBN: 0-8186-7934-4
Tiffany M. Frazier, Computer Science Department UCLA
Yuval Tamir, Computer Science Department UCLA
DERT (Distributed Error Recovery Testbed) is a testbed for simulation and performance evaluation of several classes of application-transparent distributed error recovery schemes. DERT is built on top of an event-driven, message-passing, object-oriented, multithreaded simulation kernel. Actual compiled distributed applications are instrumented for data collection and executed on the simulated multicomputer. Checkpointing is implemented in full detail, including associated overhead per message, additional messages, and changes to the memory system. DERT allows easy modification of a wide variety of system parameters, thus offering a level of flexibility not easily achieved by experimentation on a particular real machine. This paper describes the design, functionality, and performance of DERT. The main problems encountered in DERT's development are discussed, as well as examples of its use in evaluating recovery schemes.
Citation:
Tiffany M. Frazier, Yuval Tamir, "Execution-Driven Simulation of Error Recovery Techniques for Multicomputers," ss, pp.4, 30th Annual Simulation Symposium (SS '97), 1997
Usage of this product signifies your acceptance of the Terms of Use.