|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Fault-Tolerant Communication in Embedded Supercomputing
September/October 1998 (vol. 18 no. 5)
pp. 42-52
| ASCII Text | x | ||
| Giorgos Efthivoulidis, Evangelos A. Verentziotis, Apostolos N. Meliones, Theodora A. Varvarigou, Antonios Kontizas, Geert Deconinck, Vincenzo De Florio, "Fault-Tolerant Communication in Embedded Supercomputing," IEEE Micro, vol. 18, no. 5, pp. 42-52, September/October, 1998. | |||
| BibTex | x | ||
| @article{ 10.1109/40.735943, author = {Giorgos Efthivoulidis and Evangelos A. Verentziotis and Apostolos N. Meliones and Theodora A. Varvarigou and Antonios Kontizas and Geert Deconinck and Vincenzo De Florio}, title = {Fault-Tolerant Communication in Embedded Supercomputing}, journal ={IEEE Micro}, volume = {18}, number = {5}, issn = {0272-1732}, year = {1998}, pages = {42-52}, doi = {http://doi.ieeecomputersociety.org/10.1109/40.735943}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - MGZN JO - IEEE Micro TI - Fault-Tolerant Communication in Embedded Supercomputing IS - 5 SN - 0272-1732 SP42 EP52 EPD - 42-52 A1 - Giorgos Efthivoulidis, A1 - Evangelos A. Verentziotis, A1 - Apostolos N. Meliones, A1 - Theodora A. Varvarigou, A1 - Antonios Kontizas, A1 - Geert Deconinck, A1 - Vincenzo De Florio, PY - 1998 KW - Fault tolerance KW - embedded systems KW - communications KW - parallel systems VL - 18 JA - IEEE Micro ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/40.735943
A framework is developed to integrate fault tolerance flexibly and easily into embedded parallel HPC applications. This framework consists of a variety of reusable fault tolerance modules acting at different levels and coping with common requirements. The burden of ad hoc fault tolerance programming is removed from the application developers, while at the same time mediocre fault tolerance support taken at the operating system level is avoided. Integration of this functionality in real embedded applications validates this approach, and provides promising results. In this article we focus on fault tolerance mechanisms for synchronous and asynchronous communication between application threads running on system nodes.
Index Terms:
Fault tolerance, embedded systems, communications, parallel systems
Citation:
Giorgos Efthivoulidis, Evangelos A. Verentziotis, Apostolos N. Meliones, Theodora A. Varvarigou, Antonios Kontizas, Geert Deconinck, Vincenzo De Florio, "Fault-Tolerant Communication in Embedded Supercomputing," IEEE Micro, vol. 18, no. 5, pp. 42-52, Sept.-Oct. 1998, doi:10.1109/40.735943
Usage of this product signifies your acceptance of the Terms of Use.

