loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07)
An Efficient and Reliable Scientific Workflow System
Rio De Janeiro, Brazil
May 14-May 17
ISBN: 0-7695-2833-3
Tulio Tavares, Universidade Federal de Minas
George Teodoro, Universidade Federal de Minas
Tahsin Kurc, Ohio State University, USA
Renato Ferreira, Universidade Federal de Minas
Dorgival Guedes, Universidade Federal de Minas
Wagner Jr. Meira, Universidade Federal de Minas
Umit Catalyurek, Ohio State University, USA
Shannon Hastings, Ohio State University, USA
Scott Oster, Ohio State University, USA
Steve Langella, Ohio State University, USA
Joel Saltz, Ohio State University, USA
This paper presents a fault tolerance framework for applications that process data using a distributed network of user-defined operations in a pipelined fashion. The framework saves intermediate results and messages exchanged among application components in a distributed data management system to facilitate quick recovery from failures. The experimental results show that the framework scales well and our approach introduces very little overhead to application execution.
Citation:
Tulio Tavares, George Teodoro, Tahsin Kurc, Renato Ferreira, Dorgival Guedes, Wagner Jr. Meira, Umit Catalyurek, Shannon Hastings, Scott Oster, Steve Langella, Joel Saltz, "An Efficient and Reliable Scientific Workflow System," ccgrid, pp.445-452, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07), 2007
Usage of this product signifies your acceptance of the Terms of Use.