11th Annual IEEE Symposium on Field-Programmable Custom Computing Machines Adaptive Fault Recovery for Networked Reconfigurable Systems Napa, California April 09-April 11 ISBN: 0-7695-1979-2
The device-level size and complexity of reconfigurable architectures makes fault tolerance an important concern in system design. In this paper, we introduce a fully-automated fault recovery system for networked systems which contain FPGAs. If a fault is detected that can not be addressed locally, fault information is transferred to a reconfiguration server. Following design recompilation to avoid the fault, a new FPGA configuration is returned to the remote system and computation is reinitiated. To illustrate the benefit of this approach, we have implemented a complete fault recovery system which requires no manual intervention. An important part of the system is a timing-driven incremental router for Xilinx Virtex devices. This router is directly interfaced to Xilinx JBits and uses no CAD tools from the standard Xilinx Alliance tool flow. Our completed system has been applied to three benchmark designs and exhibits complete fault recovery in up to 12 \times less time than the standard incremental Xilinx PAR flow.
Citation:
Weifeng Xu, Ramshankar Ramanarayanan, Russell Tessier, "Adaptive Fault Recovery for Networked Reconfigurable Systems," fccm, pp.143, 11th Annual IEEE Symposium on Field-Programmable Custom Computing Machines, 2003 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||