This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Principal Features of the VOLTAN Family of Reliable Node Architectures for Distributed Systems
May 1992 (vol. 41 no. 5)
pp. 542-549

A VOLTAN node is composed of a number of conventional processors on which application-level processes are replicated to achieve fault-tolerance. The architecture of a family of such nodes with differing functionalities is presented. These include failure-masking, fail-signal, and fail-silent nodes. The software architectures of a three-processor failure-masking and a two-processor fail-silent node are discussed in detail. The suitability of VOLTAN nodes as building blocks of reliable distributed systems is also discussed.

[1] J. Arlatet al., "Fault injection for dependability validation: A methodology and some applications,"IEEE Trans. Software Eng., vol. 16, pp. 166-182, Feb. 1990.
[2] F. Cristian, H. Aghili, H. R. Strong, and D. Dolev, "Atomic broadcast: From simple message diffusion to Byzantine agreement," inDig. Papers, FTCS-15, Ann Arbor, MI, June 1985, pp. 200-206.
[3] P. D. Ezhilchelvan, S. K. Shrivastava, and A. Tully, "Constructing replicated systems using processors with point to point communication links," inProc. 16th Annu. Symp. Comput. Architecture, Jerusalem, Israel, June 1989, pp. 177-184.
[4] P. D. Ezhilchelvan and S. K. Shrivastava, "A distributed systems architecture supporting high availability and reliability," inDependable Computing and Fault Tolerant Systems, Vol. 6, J. F. Meyer and R. D. Schlichting, Eds. New York: Springer-Verlag, 1992, pp. 67-91.
[5] J.Y. Halpern et al., "Fault-Tolerant Clock Synchronization,"Proc. Third Ann. ACM Symp. Principles of Distributed Computing, ACM, New York, 1984, pp. 89-102.
[6] J. H. Lala, R. E. Harper, and L. S. Alger, "A design approach for ultrareliable real time systems,"IEEE Comput. Mag., pp. 12-22, May 1991.
[7] L. Lamport, R. Shostak, and M. Pease, "The Byzantine Generals Problem,"ACM Trans. Programming Languages and Systems, Vol. 4, No. 3, July 1982, pp. 382-401.
[8] Perihelion Software Ltd.,The Helios Operating System. Englewood Cliffs, NJ: Prentice-Hall Int., ISBN 0-13-386004-3, 1989.
[9] D. Powellet al., "The Delta-4 approach to dependability in open distributed computing systems," inDig. Papers, FTCS-18, Tokyo, Japan, June 1988, pp. 246-251.
[10] R.L. Rivest, A. Shamir, and L. Adleman, "A Method for Obtaining Digital Signatures and Public-Key Cryptosystems,"Comm. ACM, Vol. 21, No. 2, Feb. 1978, pp. 120-126.
[11] F. Schneider, "Implementing fault-tolerant services using the state machine approach: A tutorial,"ACM Comput. Surveys, vol. 22, no. 4, pp. 299-319, Dec. 1990.
[12] S. K. Shrivastava, P. D. Ezhilchelvan, N. A. Speirs, and D. T. Seaton, "Fail-controlled computer architectures for distributed systems" Tech. Rep. 333, Comput. Lab., Univ. Newcastle upon Tyne, July 1991.
[13] S. K. Shrivastava, D. T. Seaton, N. Howard, and N. A. Speirs, "Fail-silent hardware for distributed systems," inDELTA-4: A Generic Architecture for Dependable Distributed Systems, D. Powell, Ed. Berlin, Germany: Springer-Verlag, 1991, ch. 5.
[14] N. A. Speirs, P. D. Ezhilchelvan, S. K. Shrivastava, S. Tao, and A. Tully, "The design and implementation of Voltan fault-tolerant nodes for distributed systems," Tech. Rep., Comput. Lab., Univ. Newcastle upon Tyne, 1992.
[15] N. Theuretzbacher, "VOTRICS: Voting triple modular computing system," inDig. Papers, FTCS-16, Vienna, Austria, July 1986, pp. 144-150.
[16] A. Tully and S. K. Shrivastava, "Preventing state divergence in replicated distributed programs," inProc. 9th IEEE Symp. Reliable Distributed Syst., Huntsville, AL, Oct. 1990, pp. 104-113.
[17] A. Tully, "Distributed programming on transputer networks--An object oriented model for concurrent processing," inProc. Transputer Appl. Conf., TA91, Glasgow, Scotland, Aug. 1991.
[18] J. H. Wensleyet al., "SIFT: Design and analysis of a fault-tolerant computer for aircraft control,"Proc. IEEE, vol. 66, pp. 1240-1255, Oct. 1978.

Index Terms:
reliable node architectures; VOLTAN node; application-level processes; fault-tolerance; failure-masking; fail-signal; fail-silent; software architectures; reliable distributed systems; distributed processing; fault tolerant computing.
Citation:
S.K. Shrivastava, P.D. Ezhilchelvan, N.A. Speirs, S. Tao, A. Tully, "Principal Features of the VOLTAN Family of Reliable Node Architectures for Distributed Systems," IEEE Transactions on Computers, vol. 41, no. 5, pp. 542-549, May 1992, doi:10.1109/12.142680
Usage of this product signifies your acceptance of the Terms of Use.