This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fifth IEEE International Symposium on Cluster Computing and the Grid (CCGrid'05) - Volume 2
ReGS: user-level reliability in a grid environment
Cardiff, Wales, UK
May 09-May 12
ISBN: 0-7803-9074-1
J.A.L. Sanches, COPPE, Univ. Fed. do Rio de Janeiro, Brazil
P.K. Vargas, COPPE, Univ. Fed. do Rio de Janeiro, Brazil
I. de Castro Dutra, COPPE, Univ. Fed. do Rio de Janeiro, Brazil
V.S. Costa, COPPE, Univ. Fed. do Rio de Janeiro, Brazil
C.F.R. Geyer, Dept. of Comput. Sci., Nat. Univ. of Ireland, Cork, Ireland
Grid environments are ideal for executing applications that require a huge amount of computational work, both due to the big number of tasks to execute and to the large amount of data to be analysed. Unfortunately, current tools may require that users deal themselves with corrupted outputs or early termination of tasks. This becomes inconvenient as the number of parallel runs grows to easily exceed the thousands. ReGS is a user-level software designed to provide automatic detection and restart of corrupted or early terminated tasks. ReGS uses a Web interface to allow the setup and control of grid execution, and provides automatic input data setup. ReGS allows the automatic detection of job dependencies, through the GRID-ADL task management language. Our results show that besides automatically and effectively managing a huge number of tasks in grid environments, ReGS is also a good monitoring tool to spot grid nodes pitfalls.
Citation:
J.A.L. Sanches, P.K. Vargas, I. de Castro Dutra, V.S. Costa, C.F.R. Geyer, "ReGS: user-level reliability in a grid environment," ccgrid, vol. 2, pp.718-725, Fifth IEEE International Symposium on Cluster Computing and the Grid (CCGrid'05) - Volume 2, 2005
Usage of this product signifies your acceptance of the Terms of Use.