loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)
Phoenix: Making Data-Intensive Grid Applications Fault-Tolerant
Pittsburgh, PA
November 08-November 08
ISBN: 0-7695-2256-4
George Kola, University of Wisconsin-Madison
Tevfik Kosar, University of Wisconsin-Madison
Miron Livny, University of Wisconsin-Madison
A major hurdle facing data intensive grid applications is the appropriate handling of failures that occur in the grid-environment. Implementing the fault-tolerance transparently at the grid-middleware level would make different data intensive applications fault-tolerant without each having to pay a separate cost and reduce the time to grid-based solution for many scientific problems. We analyzed the failures encountered by four real-life production data intensive applications: NCSA image processing pipeline, WCER video processing pipeline, US-CMS pipeline and BMRB BLAST pipeline. Taking the result of the analysis into account, we have designed and implemented Phoenix, a transparent middleware-level fault-tolerance layer that detects failures early, classifies failures into transient and permanent and appropriately handles the transient failures. We applied our fault-tolerance layer to a prototype of the NCSA image processing pipeline and considerably improved the failure handling and report on the insights gained in the process.
Citation:
George Kola, Tevfik Kosar, Miron Livny, "Phoenix: Making Data-Intensive Grid Applications Fault-Tolerant," grid, pp.251-258, Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.