How Hard is it to Decide if a Fact is Relevant to a Query?
Meghyn Bienvenu, Diego Figueira, Pierre Lafourcade

TL;DR
This paper investigates the complexity of determining fact relevance to queries in databases, revealing that self-joins significantly impact difficulty and identifying classes where relevance can be computed efficiently.
Contribution
It characterizes the complexity of relevance decision problems, highlighting the role of self-joins and providing conditions for efficient relevance computation.
Findings
Relevance is NP-hard for acyclic chain CQs due to self-joins.
Bounding self-joins reduces relevance complexity to that of query evaluation.
In ontology-mediated queries, bounded interaction width ensures relevance is no harder than query answering.
Abstract
We consider the following fundamental problem: given a database D, Boolean conjunctive query (CQ) q, and fact f in D, decide whether f is relevant to q wrt. D, i.e., does f belong to a minimal subset S of D such that S |= q. Despite being of central importance to query answer explanation, the combined complexity of deciding query relevance has not been studied in detail, leaving open what makes this problem hard, and which restrictions can yield lower complexity. Relevance has already been shown to be harder than query evaluation: namely, -complete for CQs, even over a binary signature. We further observe that NP-hardness applies already to (acyclic) chain CQs. Our work identifies self-joins (multiple atoms with the same relation) as the culprit. Indeed, we prove that if we forbid or bound the occurrence of self-joins, then relevance has the same complexity as query…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
