This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
A Reconfigurable Modular Fault-Tolerant Hypercube Architecture
October 1994 (vol. 5 no. 10)
pp. 1018-1032

We propose a new fault-tolerant design of a hypercube system. We first build thefault-tolerant modules (FTM's), then we interconnect these FTM's as the modularhypercube. Finally, we obtain our proposed system by augmenting links, called thespare-sharing links (SSL's), in the modular hypercube, which forms a ring connection inour architecture. The characteristic of our system is that the spare nodes in an FTM canbe used as local spares to replace the faulty nodes in the FTM, or as remote spares toreplace the faulty nodes in other FTM's via the spare-sharing links in the architecture.Thus, the use of spare nodes in any FTM will increase, and the proposed system reliability will improve. In the system, the switch and link failures are also considered. The modular diagnosis and modular reconfiguration are proposed to identify and reconfigure the failure of nodes, switches, and links.

[1] M. S. Alam and R. G. Melhem, "An efficient modular spare allocation scheme and its application to fault tolerant binary hypercube,"IEEE Trans. Parallel Distrib. Syst., vol. 2, pp. 117-126, 1991.
[2] S-C. Chau and A. L. Liestman, "A proposal for a fault-tolerant binary hypercube," inProc. Nineteenth Fault Tolerant Comput. Symp., Chicago, IL, June 1989, pp. 323-330.
[3] T. F. Chan and Y. Saad, "Multigrid algorithms on the hypercube multiprocessor,"IEEE Trans. Comput., vol. C-35, no. 11, pp. 969-977, Nov. 1986.
[4] C.Y.R. Chen and Y.C. Chung, "Embedding networks with ring connections in hypercube machines,"Int. Conf. Parallel Processing, 1990, pp. III-327-III-334.
[5] S. Dutt and J. P. Hayes, "An automorphic approach to the design of fault-tolerant multiprocessors," inProc. Nineteenth Fault Tolerant Comput. Symp., June 1989, Chicago, IL, pp. 496-503.
[6] R. Hamming,Coding and Information Theory. Englewood Cliffs, NJ: Prentice-Hall, 1980.
[7] F. Harary,Graph Theory. Reading, MA: Addison-Wesley, 1968.
[8] D. A. Rennels, "On implementing fault-tolerance in binary hypercube,"Proc. IEEE Fault Tolerant Computing, 1985, pp. 344-349.
[9] F. Barsi, F. Grandoni, and P. Maestrini, "A theory of diagnosability of digital systems,"IEEE Trans. Comput, vol. C-25, no. 6, pp. 585-593, June 1976.

Index Terms:
Index Termsfault tolerant computing; reliability; reconfigurable architectures; hypercube networks;parallel architectures; system recovery; reconfigurable modular fault-tolerant hypercube;hypercube architecture; fault-tolerant design; hypercube system; modular hypercube;spare-sharing links; ring connection; local spares; faulty nodes; FTM; system reliability;link failures; switch failures; modular reconfiguration; nodes; switches; links
Citation:
C.S. Yang, L.P. Zu, Y.N. Wu, "A Reconfigurable Modular Fault-Tolerant Hypercube Architecture," IEEE Transactions on Parallel and Distributed Systems, vol. 5, no. 10, pp. 1018-1032, Oct. 1994, doi:10.1109/71.313119
Usage of this product signifies your acceptance of the Terms of Use.