Applying Distributed Ledgers to Manage Workflow Provenance
Jay Jay Billings

TL;DR
This paper explores how permissioned distributed ledgers can be used to automatically share and verify workflow provenance across systems, reducing duplication and enhancing data integrity.
Contribution
It presents the idea that distributed ledgers naturally meet many workflow provenance requirements and discusses multiple ways to embed provenance data within ledgers.
Findings
Distributed ledgers can effectively encode provenance information.
Provenance can be stored in multiple formats within the ledger.
Several new research directions are identified based on this approach.
Abstract
Sharing provenance across workflow management systems automatically is not currently possible, but the value of such a capability is high since it could greatly reduce the amount of duplicated workflows, accelerate the discovery of new knowledge, and verify the integrity of past and present analyses. Although numerous technological challenges exist to efficiently share provenance information across workflow management systems, permissioned distributed ledgers could surmount many of them. The primary benefit of permissioned distributed ledgers over other technologies is that their distribution is over a peer-to-peer network that encodes transactions across the network into an immutable hash list and achieves consensus on the validity of the new data through a common consensus mechanism. This work discusses provenance and distributed ledgers on their own and then presents an argument that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlockchain Technology Applications and Security · Scientific Computing and Data Management · Innovative Microfluidic and Catalytic Techniques Innovation
