Why so? or Why no? Functional Causality for Explaining Query Answers
Alexandra Meliou, Wolfgang Gatterbauer, Katherine F. Moore, Dan Suciu

TL;DR
This paper introduces a causality-based framework for explaining query answers and non-answers in databases, extending provenance and missing result explanations with a refined, property-rich definition of causes.
Contribution
It develops functional causes as a refined causality concept, enabling uniform explanations of query results, non-results, and contributions, including aggregate functions, with complexity analysis and algorithms.
Findings
Framework unifies explanations for query answers and non-answers.
Polynomial algorithms for causality evaluation in tractable cases.
Ability to rank causes by responsibility and explain aggregate contributions.
Abstract
In this paper, we propose causality as a unified framework to explain query answers and non-answers, thus generalizing and extending several previously proposed approaches of provenance and missing query result explanations. We develop our framework starting from the well-studied definition of actual causes by Halpern and Pearl. After identifying some undesirable characteristics of the original definition, we propose functional causes as a refined definition of causality with several desirable properties. These properties allow us to apply our notion of causality in a database context and apply it uniformly to define the causes of query results and their individual contributions in several ways: (i) we can model both provenance as well as non-answers, (ii) we can define explanations as either data in the input relations or relational operations in a query plan, and (iii) we can give…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Distributed and Parallel Computing Systems · Advanced Database Systems and Queries
