Issue No.09 - September (2009 vol.20)
Paul Groth , University of Southern California, Marina del Rey
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPDS.2008.215
Scientific and business communities are adopting large-scale distributed systems as a means to solve a wide range of resource-intensive tasks. These communities also have requirements in terms of provenance. We define the provenance of a result produced by a distributed system as the process that led to that result. This paper describes a protocol for recording documentation of a distributed system's execution. The distributed protocol guarantees that documentation with characteristics suitable for accurately determining the provenance of results is recorded. These characteristics are confirmed through a number of proofs based on an abstract state machine formalization.
Provenance, lineage, grids, distributed systems, data protocols.
Paul Groth, "Recording Process Documentation for Provenance", IEEE Transactions on Parallel & Distributed Systems, vol.20, no. 9, pp. 1246-1259, September 2009, doi:10.1109/TPDS.2008.215