The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.03 - May/June (2008 vol.10)
pp: 38-46
Simon Miles , King's College London
Paul Groth , University of Southern California
Ewa Deelman , University of Southern California
Karan Vahi , University of Southern California
Gaurang Mehta , University of Southern California
Luc Moreau , University of Southampton
ABSTRACT
Current scientific applications are often structured as workflows and rely on workflow systems to compile abstract experiment designs into enactable workflows that utilize the best available resources. The automation of this step, and of the workflow enactment, hides the details about how results are produced. Knowing how compilation and enactment occurred allows results to be reconnected with the experiment design. The authors investigate how provenance helps scientists connect their results with the actual execution that took place, their original experiment, and its inputs and parameters.
INDEX TERMS
provenance, workflow compilation, grid computing, traceability, service-oriented architecture
CITATION
Simon Miles, Paul Groth, Ewa Deelman, Karan Vahi, Gaurang Mehta, Luc Moreau, "Provenance: The Bridge Between Experiments and Data", Computing in Science & Engineering, vol.10, no. 3, pp. 38-46, May/June 2008, doi:10.1109/MCSE.2008.82
REFERENCES
1. B.C. Barish and R. Weiss, "LIGO and the Detection of Gravitational Waves," Physics Today, vol. 52, no. 10, 1999, pp. 44–50.
2. D.A. Brown et al., "A Case Study on the Use of Workflow Technologies for Scientific Analysis: Gravitational Wave Data Analysis," Workflows for e-Science, I. Taylor et al., eds., Springer, 2006, pp. 39–59.
3. D. Bernholdt et al., "The Earth System Grid: Supporting the Next Generation of Climate Modeling Research," Proc. IEEE, vol. 93, no. 3, 2005, pp. 485–495.
4. G.B. Berriman et al., "Montage: A Grid Enabled Engine for Delivering Custom Science-Grade Mosaics On Demand," Proc. SPIE Conf., SPIE, 2004; http://montage.ipac.caltech.edu/publications/ 2004SPIEMontage_SPIE_2004_paper.pdf .
5. E. Deelman et al., "Pegasus: A Framework for Mapping Complex Scientific Workflows onto Distributed Systems," Scientific Programming J., vol. 13, no. 3, 2005, pp. 219–237.
6. E. Deelman et al., "Pegasus: Mapping Large-Scale Workflows to Distributed Resources," Workflows in e-Science, I. Taylor et al., eds., Springer, 2006, pp. 376–394.
7. S. Miles et al., "Connecting Scientific Data to Scientific Experiments with Provenance" Proc. 3rd IEEE Int'l Conf. e-Science and Grid Computing, IEEE CS Press, 2007, pp. 179–186.
8. Y. Gil et al., "Wings for Pegasus: A Semantic Approach to Creating Very Large Scientific Workflows," Proc. OWL: Experiences and Directions (OWL-ED), CEUR-WS.org, 2006; http://sunsite.informatik.rwth-aachen.de/ Publications/CEUR-WS//Vol-216submission_29.pdf .
9. M. Wieczorek, R. Prodan, and T. Fahringer, "Scheduling of Scientific Workflows in the ASKALON Grid Environment," SIGMOD Record, vol. 34, no. 3, 2005, pp. 56–62.
10. L. Moreau et al., "The First Provenance Challenge," Concurrency and Computation: Practice and Experience, vol. 20, no. 5, 2008, pp. 400–418.
11. Y. Simmhan, B. Plale, and D. Gannon, "A Survey of Data Provenance in e-Science," SIGMOD Record, vol. 34, no. 3, 2005, pp. 31–36.
12. S. Miles et al., "The Requirements of Using Provenance in e-Science Experiments," J. Grid Computing, vol. 5, no. 1, 2007, pp. 1–25.
13. E. Deelman et al. "Managing Large-Scale Workflow Execution from Resource Provisioning to Provenance Tracking: The CyberShake Example," Proc. e-Science 2006, IEEE Press, 2006, p. 14.
14. A. Lathers et al., "Enabling Parallel Scientific Applications with Workflow Tools," Challenges of Large Applications in Distributed Environments (CLADE), IEEE Press, 2006, pp. 55–60.
15. S. Álvarez et al., "Applying Provenance in Distributed Organ Transplant Management," Proc. Int'l Provenance and Annotation Workshop (IPAW 2006), Springer, 2006, pp. 28–36.
16. G. K. Kloss and A. Schreiber, "Provenance Implementation in a Scientific Simulation Environment," Proc. Int'l Provenance and Annotation Workshop (IPAW 06), Springer, 2006, pp. 37–45.
21 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool