Cluster Computing and the Grid, IEEE International Symposium on (2008)
May 19, 2008 to May 22, 2008
Grid Computing has proven to be a solution for big workflow execution, especially in Bioinformatics. However, Grid nature itself introduces overheads that make its use in many cases an unfeasible solution if considering wall-time. Different heuristics such as list scheduling, agglomeration and replication are available for optimizing workflow execution. In particular, the replication heuristics have been previously used in heterogeneous environments with good results. In this work, we analyze their use for workflow scheduling on Grid infrastructures. In particular, we study its applications to an intree workflow, generated by the distribution of the CD-HIT application. The experiments were conducted on a testbed made of resources from two different grids and results show a significant reduction of the workflow execution time.
Grid Computing, Bioinformatics, CD-HIT, Workflows, Replication Heuristics
I. M. Llorente, R. S. Montero, J. L. V?zquez-Poletti and E. Huedo, "CD-HIT Workflow Execution on Grids Using Replication Heuristics," 2008 8th International Symposium on Cluster Computing and the Grid (CCGRID '08)(CCGRID), Lyon, 2008, pp. 735-740.