Dissociation and Propagation for Approximate Lifted Inference with Standard Relational Database Management Systems
Wolfgang Gatterbauer, Dan Suciu

TL;DR
This paper introduces a novel database-based approach for approximate probabilistic inference using multiple query plans to efficiently estimate probabilities, generalizing existing methods and applicable to lifted inference in statistical relational models.
Contribution
It presents a new algorithm that evaluates conjunctive queries in relational databases by generating minimal necessary plans, extending PTIME query classification, and adapting ranking techniques from graphs to hypergraphs.
Findings
Algorithm efficiently enumerates minimal plans based on schema information.
Approach generalizes PTIME classification for self-join-free conjunctive queries.
Experimental results demonstrate effectiveness and speed of the method.
Abstract
Probabilistic inference over large data sets is a challenging data management problem since exact inference is generally #P-hard and is most often solved approximately with sampling-based methods today. This paper proposes an alternative approach for approximate evaluation of conjunctive queries with standard relational databases: In our approach, every query is evaluated entirely in the database engine by evaluating a fixed number of query plans, each providing an upper bound on the true probability, then taking their minimum. We provide an algorithm that takes into account important schema information to enumerate only the minimal necessary plans among all possible plans. Importantly, this algorithm is a strict generalization of all known PTIME self-join-free conjunctive queries: A query is in PTIME if and only if our algorithm returns one single plan. Furthermore, our approach is a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Data Management and Algorithms · Data Quality and Management
