Probabilistic Databases with an Infinite Open-World Assumption
Martin Grohe, Peter Lindner

TL;DR
This paper introduces a mathematically rigorous model for infinite probabilistic databases with an open-world assumption, allowing for more realistic representations of uncertain data over infinite domains like integers or strings.
Contribution
It extends the theory of finite probabilistic databases to infinite, countable domains, providing a construction and characterization for such models with an open-world perspective.
Findings
Existence of countable, tuple-independent infinite PDBs demonstrated
Framework for extending finite PDB query evaluation algorithms to infinite cases
Provides a foundation for more realistic probabilistic data modeling in infinite domains
Abstract
Probabilistic databases (PDBs) introduce uncertainty into relational databases by specifying probabilities for several possible instances. Traditionally, they are finite probability spaces over database instances. Such finite PDBs inherently make a closed-world assumption: non-occurring facts are assumed to be impossible, rather than just unlikely. As convincingly argued by Ceylan et al. (KR '16), this results in implausibilities and clashes with intuition. An open-world assumption, where facts not explicitly listed may have a small positive probability can yield more reasonable results. The corresponding open-world model of Ceylan et al., however, assumes that all entities in the PDB come from a fixed finite universe. In this work, we take one further step and propose a model of "truly" open-world PDBs with an infinite universe. This is natural when we consider entities from typical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Logic, Reasoning, and Knowledge · Semantic Web and Ontologies
