# A More General Theory of Static Approximations for Conjunctive Queries

**Authors:** Pablo Barcel\'o, Miguel Romero, Thomas Zeume

arXiv: 1904.00934 · 2019-04-02

## TL;DR

This paper introduces a unified framework for overapproximations and symmetric difference approximations of conjunctive queries, analyzing their computational properties and providing complexity results for evaluation, identification, and existence problems.

## Contribution

It establishes a connection between overapproximations and existential pebble games, enabling systematic study and complexity analysis of these approximations.

## Key findings

- Evaluation and identification of overapproximations are polynomial-time solvable.
- Existence of overapproximations is decidable in 2EXPTIME for acyclic CQs.
- Symmetric difference approximations have well-characterized complexity results.

## Abstract

Conjunctive query (CQ) evaluation is NP-complete, but becomes tractable for fragments of bounded hypertreewidth. Approximating a hard CQ by a query from such a fragment can thus allow for an efficient approximate evaluation. While underapproximations (i.e., approximations that return correct answers only) are well-understood, the dual notion of overapproximations (i.e, approximations that return complete - but not necessarily sound - answers), and also a more general notion of approximation based on the symmetric difference of query results, are almost unexplored. In fact, the decidability of the basic problems of evaluation, identification, and existence of those approximations has been open.   This article establishes a connection between overapproximations and existential pebble games that allows for studying such problems systematically. Building on this connection, it is shown that the evaluation and identification problem for overapproximations can be solved in polynomial time. While the general existence problem remains open, the problem is shown to be decidable in 2EXPTIME over the class of acyclic CQs and in PTIME for Boolean CQs over binary schemata. Additionally we propose a more liberal notion of overapproximations to remedy the known shortcoming that queries might not have an overapproximation, and study how queries can be overapproximated in the presence of tuple generating and equality generating dependencies.   The techniques are then extended to symmetric difference approximations and used to provide several complexity results for the identification, existence, and evaluation problem for this type of approximations.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.00934/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1904.00934/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/1904.00934/full.md

---
Source: https://tomesphere.com/paper/1904.00934