The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.03 - May/June (2008 vol.10)
pp: 11-21
Juliana Freire , University of Utah
David Koop , University of Utah
Emanuele Santos , University of Utah
Cl?udio T. Silva , University of Utah
ABSTRACT
The problem of systematically capturing and managing provenance for computational tasks has recently received significant attention because of its relevance to a wide range of domains and applications. The authors give an overview of important concepts related to provenance management, so that potential users can make informed decisions when selecting or designing a provenance solution.
INDEX TERMS
provenance, data visualization, VisTrails
CITATION
Juliana Freire, David Koop, Emanuele Santos, Cl?udio T. Silva, "Provenance for Computational Tasks: A Survey", Computing in Science & Engineering, vol.10, no. 3, pp. 11-21, May/June 2008, doi:10.1109/MCSE.2008.79
REFERENCES
1. R. Bose and J. Frew, "Lineage Retrieval for Scientific Data Processing: A Survey," ACM Computing Surveys, vol. 37, no. 1, 2005, pp. 1–28.
2. Y.L. Simmhan, B. Plale, and D. Gannon, "A Survey of Data Provenance in E-Science," SIGMOD Record, vol. 34, no. 3, 2005, pp. 31–36.
3. W.C. Tan, "Provenance in Databases: Past, Current, and Future," IEEE Data Eng. Bulletin, vol. 30, no. 4, 2007, pp. 3–12.
4. B. Clifford et al., "Tracking Provenance in a Virtual Data Grid," Concurrency and Computation: Practice and Experience, vol. 20, no. 5, 2008, pp. 565–575.
5. P. Groth, The Origin of Data: Enabling the Determination of Provenance in Multi-Institutional Scientific Systems through the Documentation of Processes, PhD thesis, Univ. of Southampton, 2007.
6. J. Frew, D. Metzger, and P. Slaughter, "Automatic Capture and Reconstruction of Computational Provenance," Concurrency and Computation: Practice and Experience, vol. 20, no. 5, 2008, pp. 485–496.
7. K.-K. Muniswamy-Reddy, D.A. Holland, and U.B.M.I. Seltzer, "Provenance-Aware Storage Systems," Proc. USENIX Conf., Usenix, 2006, pp. 43–56.
8. I. Altintas, O. Barney, and E. Jaeger-Frank, "Provenance Collection Support in the Kepler Scientific Workflow System," Proc. Int'l Provenance and Annotation Workshop (IPAW), LNCS 4145, Springer, 2006, pp. 118–132.
9. S. Cohen, S.C. Boulakia, and S.B. Davidson, "Towards a Model of Provenance and User Views in Scientific Workflows," Data Integration in the Life Sciences, LNCS 4075, Springer, 2006, pp. 264–279.
10. J. Golbeck and J. Hendler, "A Semantic Web Approach to the Provenance Challenge," Concurrency and Computation: Practice and Experience, vol. 20, no. 5, 2008, pp. 431–439.
11. J. Kim et al., "Provenance Trails in the Wings/Pegasus System," Concurrency and Computation: Practice and Experience, vol. 20, no. 5, 2008, pp. 587–597.
12. J. Freire et al., "Managing Rapidly-Evolving Scientific Workflows," Proc. Int'l Provenance and Annotation Workshop (IPAW), LNCS 4145, Springer, 2006, pp. 10–18.
13. S. Miles et al., "Extracting Causal Graphs from an Open Provenance Data Model," Concurrency and Computation: Practice and Experience, vol. 20, no. 5, 2008, pp. 577–586.
14. Y.L. Simmhan et al., "Karma2: Provenance Management for Data Driven Workflows," to be published in Int'l J. Web Services Research, vol. 5, no. 1, 2008.
15. J. Zhao et al., "Mining Taverna's Semantic Web of Provenance," Concurrency and Computation: Practice and Experience, vol. 20, no. 5, 2008, pp. 463–472.
16. R.S. Barga and L.A. Digiampietri, "Automatic Capture and Efficient Storage of e-Science Experiment Provenance," Concurrency and Computation: Practice and Experience, vol. 20, no. 5, 2008, pp. 419–429.
17. B. Ludäscher et al., "From Computation Models to Models of Provenance: The RWS Approach," Concurrency and Computation: Practice and Experience, vol. 20, no. 5, 2008, pp. 507–518.
18. T. Oinn et al., "Taverna: Lessons in Creating a Workflow Environment for the Life Sciences," Concurrency and Computation: Practice &Experience, vol. 18, no. 10, 2006, pp. 1067–1100.
19. O. Biton et al., "Querying and Managing Provenance through User Views in Scientific Workflows," to be published in Proc. IEEE Int'l Conf. Data Eng., 2008.
20. S. Bowers, T. McPhillips, and B. Ludaescher, "Provenance in Collection-Oriented Scientific Workflows," Concurrency and Computation: Practice and Experience, vol. 20, no. 5, 2008, pp. 519–529.
21. C.E. Scheidegger et al., "Querying and Creating Visualizations by Analogy," IEEE Trans. Visualization and Computer Graphics, vol. 13, no. 6, 2007, pp. 1560–1567.
22. J. Futrelle and J. Myers, "Tracking Provenance Semantics in Heterogeneous Execution Systems," Concurrency and Computation: Practice and Experience, vol. 20, no. 5, 2008, pp. 555–564.
23. Y. Zhao et al., "Swift: Fast, Reliable, Loosely Coupled Parallel Computation," IEEE Int'l Workshop on Sci. Workflows (SWF), IEEE CS Press, 2007, pp. 199–206.
24. J. Freire and C. Silva, "Towards Enabling Social Analysis of Scientific Data," CHI Social Data Analysis Workshop, 2008, (to appear).
13 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool