This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Provenance: The Bridge Between Experiments and Data
May/June 2008 (vol. 10 no. 3)
pp. 38-46
Simon Miles, King's College London
Paul Groth, University of Southern California
Ewa Deelman, University of Southern California
Karan Vahi, University of Southern California
Gaurang Mehta, University of Southern California
Luc Moreau, University of Southampton
Current scientific applications are often structured as workflows and rely on workflow systems to compile abstract experiment designs into enactable workflows that utilize the best available resources. The automation of this step, and of the workflow enactment, hides the details about how results are produced. Knowing how compilation and enactment occurred allows results to be reconnected with the experiment design. The authors investigate how provenance helps scientists connect their results with the actual execution that took place, their original experiment, and its inputs and parameters.

1. B.C. Barish and R. Weiss, "LIGO and the Detection of Gravitational Waves," Physics Today, vol. 52, no. 10, 1999, pp. 44–50.
2. D.A. Brown et al., "A Case Study on the Use of Workflow Technologies for Scientific Analysis: Gravitational Wave Data Analysis," Workflows for e-Science, I. Taylor et al., eds., Springer, 2006, pp. 39–59.
3. D. Bernholdt et al., "The Earth System Grid: Supporting the Next Generation of Climate Modeling Research," Proc. IEEE, vol. 93, no. 3, 2005, pp. 485–495.
4. G.B. Berriman et al., "Montage: A Grid Enabled Engine for Delivering Custom Science-Grade Mosaics On Demand," Proc. SPIE Conf., SPIE, 2004; http://montage.ipac.caltech.edu/publications/ 2004SPIEMontage_SPIE_2004_paper.pdf .
5. E. Deelman et al., "Pegasus: A Framework for Mapping Complex Scientific Workflows onto Distributed Systems," Scientific Programming J., vol. 13, no. 3, 2005, pp. 219–237.
6. E. Deelman et al., "Pegasus: Mapping Large-Scale Workflows to Distributed Resources," Workflows in e-Science, I. Taylor et al., eds., Springer, 2006, pp. 376–394.
7. S. Miles et al., "Connecting Scientific Data to Scientific Experiments with Provenance" Proc. 3rd IEEE Int'l Conf. e-Science and Grid Computing, IEEE CS Press, 2007, pp. 179–186.
8. Y. Gil et al., "Wings for Pegasus: A Semantic Approach to Creating Very Large Scientific Workflows," Proc. OWL: Experiences and Directions (OWL-ED), CEUR-WS.org, 2006; http://sunsite.informatik.rwth-aachen.de/ Publications/CEUR-WS//Vol-216submission_29.pdf .
9. M. Wieczorek, R. Prodan, and T. Fahringer, "Scheduling of Scientific Workflows in the ASKALON Grid Environment," SIGMOD Record, vol. 34, no. 3, 2005, pp. 56–62.
10. L. Moreau et al., "The First Provenance Challenge," Concurrency and Computation: Practice and Experience, vol. 20, no. 5, 2008, pp. 400–418.
11. Y. Simmhan, B. Plale, and D. Gannon, "A Survey of Data Provenance in e-Science," SIGMOD Record, vol. 34, no. 3, 2005, pp. 31–36.
12. S. Miles et al., "The Requirements of Using Provenance in e-Science Experiments," J. Grid Computing, vol. 5, no. 1, 2007, pp. 1–25.
13. E. Deelman et al. "Managing Large-Scale Workflow Execution from Resource Provisioning to Provenance Tracking: The CyberShake Example," Proc. e-Science 2006, IEEE Press, 2006, p. 14.
14. A. Lathers et al., "Enabling Parallel Scientific Applications with Workflow Tools," Challenges of Large Applications in Distributed Environments (CLADE), IEEE Press, 2006, pp. 55–60.
15. S. Álvarez et al., "Applying Provenance in Distributed Organ Transplant Management," Proc. Int'l Provenance and Annotation Workshop (IPAW 2006), Springer, 2006, pp. 28–36.
16. G. K. Kloss and A. Schreiber, "Provenance Implementation in a Scientific Simulation Environment," Proc. Int'l Provenance and Annotation Workshop (IPAW 06), Springer, 2006, pp. 37–45.

Index Terms:
provenance, workflow compilation, grid computing, traceability, service-oriented architecture
Citation:
Simon Miles, Paul Groth, Ewa Deelman, Karan Vahi, Gaurang Mehta, Luc Moreau, "Provenance: The Bridge Between Experiments and Data," Computing in Science and Engineering, vol. 10, no. 3, pp. 38-46, May-June 2008, doi:10.1109/MCSE.2008.82
Usage of this product signifies your acceptance of the Terms of Use.