Causality and the Semantics of Provenance

James Cheney (University of Edinburgh)

arXiv:1006.1429·cs.LO·June 9, 2010·DCM

Causality and the Semantics of Provenance

James Cheney (University of Edinburgh)

PDF

TL;DR

This paper explores the formal relationship between causality and provenance, proposing a mathematical framework based on structural models to better understand and justify provenance mechanisms.

Contribution

It introduces a formal approach using structural causal models to analyze and interpret provenance graphs, bridging causality theory and data provenance.

Findings

01

Causality models can clarify provenance semantics.

02

Formal causality frameworks help evaluate provenance mechanisms.

03

Work in progress on applying causality to provenance graphs.

Abstract

Provenance, or information about the sources, derivation, custody or history of data, has been studied recently in a number of contexts, including databases, scientific workflows and the Semantic Web. Many provenance mechanisms have been developed, motivated by informal notions such as influence, dependence, explanation and causality. However, there has been little study of whether these mechanisms formally satisfy appropriate policies or even how to formalize relevant motivating concepts such as causality. We contend that mathematical models of these concepts are needed to justify and compare provenance techniques. In this paper we review a theory of causality based on structural models that has been developed in artificial intelligence, and describe work in progress on using causality to give a semantics to provenance graphs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.