Steward: Scaling Byzantine Fault-Tolerant Replication to Wide Area Networks
January-March 2010 (vol. 7 no. 1)
pp. 80-93
Yair Amir, Johns Hopkins University, Baltimore
Claudiu Danilov, Boeing Phantom Works, Seattle
Danny Dolev, Hebrew University of Jerusalem, Jerusalem
Jonathan Kirsch, Johns Hopkins University, Baltimore
John Lane, Johns Hopkins University, Baltimore
Cristina Nita-Rotaru, Purdue University, West Lafayette
Josh Olsen, University of California, Irvine, Irvine
David Zage, Purdue University, West Lafayette
This paper presents the first hierarchical Byzantine fault-tolerant replication architecture suitable to systems that span multiple wide-area sites. The architecture confines the effects of any malicious replica to its local site, reduces message complexity of wide-area communication, and allows read-only queries to be performed locally within a site for the price of additional standard hardware. We present proofs that our algorithm provides safety and liveness properties. A prototype implementation is evaluated over several network topologies and is compared with a flat Byzantine fault-tolerant approach. The experimental results show considerable improvement over flat Byzantine replication algorithms, bringing the performance of Byzantine replication closer to existing benign fault-tolerant replication techniques over wide area networks.

Fault tolerance, scalability, wide area networks.
