This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fault Tolerance and Configurability in DSM Coherence Protocols
April-June 2000 (vol. 8 no. 2)
pp. 10-21

Potentially malfunctioning components in large distributed shared memory systems require highly available services that can be configured according to expected failure rates in the environment. Although several coherence protocols have been developed for DSM systems,1 few address configurability and fault tolerance. To make complex computer systems more robust and fault tolerant, data must be replicated for high availability, and the level of replication must be configurable to control overhead costs. Using an application suite, the authors test several distributed shared memory coherence protocols under different workloads and analyze the operation costs, fault tolerance, and configurability of each.

Index Terms:
distributed shared memory, coherence protocols, software fault tolerance, replication, data availability, configurability
Citation:
Brett D. Fleisch, Heiko Michel, Sachin K. Shah, Oliver E. Theel, "Fault Tolerance and Configurability in DSM Coherence Protocols," IEEE Concurrency, vol. 8, no. 2, pp. 10-21, April-June 2000, doi:10.1109/4434.846189
Usage of this product signifies your acceptance of the Terms of Use.