Issue No. 02 - February (1995 vol. 6)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/71.342122
<p><it>Abstract—</it>Fault-tolerant, real-time communication in distributed systems is very important yet difficult to achieve. Traditional protocols like the TCP/IP achieve reliable communication through acknowledgment and retransmission schemes, where one achieves the reliability at the cost of performance. In this paper, we discuss how both the timeliness and fault-tolerance of communication can be achieved by using the concept of <it>real-time channel</it> [<ref type="bib" rid="D01131">1</ref>] and exploring the inherent spatial redundancy of a given network topology. Specifically, we show how <it>isolated failure immune</it> real-time channels can be established in wrapped hexagonal mesh networks, thus ensuring timely delivery of messages in the presence of network component failures as long as the failures are isolated. This kind of fault-tolerance cannot be achieved with other commonly-known topologies like rings, rectangular meshes, and hypercubes. The proposed approach is to be implemented in an experimental distributed real-time system, called HARTS [<ref type="bib" rid="D01132">2</ref>], whose construction is underway.</p><p><it>Index Terms—</it>Distributed computing systems, fault-tolerant real-time communications, wrapped hexagonal mesh, isolated failure immune networks, real-time channels.</p>
Q. Zheng and K. G. Shin, "Establishment of Isolated Failure Immune Real-Time Channels in HARTS," in IEEE Transactions on Parallel & Distributed Systems, vol. 6, no. , pp. 113-119, 1995.