|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2008 Fourth IEEE International Conference on eScience
User Friendly Management of Workflow Results: From Provenance Information to Grid Logical File Names
December 07-December 12
ISBN: 978-0-7695-3535-7
| ASCII Text | x | ||
| Tristan Glatard, Silvia D. Olabarriaga, "User Friendly Management of Workflow Results: From Provenance Information to Grid Logical File Names," eScience, IEEE International Conference on, pp. 103-110, 2008 Fourth IEEE International Conference on eScience, 2008. | |||
| BibTex | x | ||
| @article{ 10.1109/eScience.2008.31, author = {Tristan Glatard and Silvia D. Olabarriaga}, title = {User Friendly Management of Workflow Results: From Provenance Information to Grid Logical File Names}, journal ={eScience, IEEE International Conference on}, volume = {0}, year = {2008}, isbn = {978-0-7695-3535-7}, pages = {103-110}, doi = {http://doi.ieeecomputersociety.org/10.1109/eScience.2008.31}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - eScience, IEEE International Conference on TI - User Friendly Management of Workflow Results: From Provenance Information to Grid Logical File Names SN - 978-0-7695-3535-7 SP103 EP110 A1 - Tristan Glatard, A1 - Silvia D. Olabarriaga, PY - 2008 KW - Workflow KW - grid KW - large result sets KW - logical file catalog KW - provenance VL - 0 JA - eScience, IEEE International Conference on ER - | |||
Grid workflows can produce thousands of results that should be properly organised to enable further analysis. Typically results are stored on locations hard-coded in the workflow or in the components, limiting reusability. In this paper we present an approach to (re)organise the output files generated by a grid workflow in a distributed storage environment. We propose to perform a post-mortem mapping of workflow results into a directory structure. This mapping is based on data provenance information and exploits grid catalog features, namely logical file names, to avoid data replication. By defining different mappings, users can generate their own semantic view of results generated during a workflow execution, which fosters user-friendliness, whereas preserving workflow reusability. An implementation on the Virtual Resource Browser (VBrowser) framework is detailed and evaluated on neuroimaging workflows. Results show that the complex directory structure of an image analysis application cane properly generated by our system. An initial performance evaluation of the mapping resolution and directory structure creation indicates that this approach provides a practical, simple, yet powerful solution to an important roadblock for the adoption of workflows to implement complex image analysis pipelines.
Index Terms:
Workflow, grid, large result sets, logical file catalog, provenance
Citation:
Tristan Glatard, Silvia D. Olabarriaga, "User Friendly Management of Workflow Results: From Provenance Information to Grid Logical File Names," escience, pp.103-110, 2008 Fourth IEEE International Conference on eScience, 2008
Usage of this product signifies your acceptance of the Terms of Use.
