ProvMark: A Provenance Expressiveness Benchmarking System
Sheung Chi Chan, James Cheney, Pramod Bhatotia, Thomas Pasquier,, Ashish Gehani, Hassaan Irshad, Lucian Carata, and Margo Seltzer

TL;DR
ProvMark is an automated benchmarking system that evaluates the expressiveness of provenance capture tools by analyzing their recorded provenance graphs through subgraph isomorphism, aiding in system comparison and correctness verification.
Contribution
Introduces ProvMark, a novel automated tool for benchmarking and comparing the expressiveness of provenance systems using subgraph isomorphism analysis.
Findings
ProvMark successfully distinguishes different provenance capture systems.
It provides a reliable method for testing provenance correctness.
Demonstrates differences in architecture and design philosophies of tested systems.
Abstract
System level provenance is of widespread interest for applications such as security enforcement and information protection. However, testing the correctness or completeness of provenance capture tools is challenging and currently done manually. In some cases there is not even a clear consensus about what behavior is correct. We present an automated tool, ProvMark, that uses an existing provenance system as a black box and reliably identifies the provenance graph structure recorded for a given activity, by a reduction to subgraph isomorphism problems handled by an external solver. ProvMark is a beginning step in the much needed area of testing and comparing the expressiveness of provenance systems. We demonstrate ProvMark's usefuless in comparing three capture systems with different architectures and distinct design philosophies.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
