Tuple-Independent Representations of Infinite Probabilistic Databases
Nofar Carmeli, Martin Grohe, Peter Lindner, Christoph Standke

TL;DR
This paper investigates the conditions under which infinite probabilistic databases can be represented as views over tuple-independent PDBs, extending the theory beyond finite cases and exploring the limits of such representations.
Contribution
It provides necessary and sufficient criteria for the representability of infinite PDBs as first-order views over TI-PDBs, advancing the understanding of infinite probabilistic database models.
Findings
A necessary condition for PDB representability is established.
A sufficient criterion based on probability distributions is proposed.
Conditioning on first-order properties does not increase expressivity.
Abstract
Probabilistic databases (PDBs) are probability spaces over database instances. They provide a framework for handling uncertainty in databases, as occurs due to data integration, noisy data, data from unreliable sources or randomized processes. Most of the existing theory literature investigated finite, tuple-independent PDBs (TI-PDBs) where the occurrences of tuples are independent events. Only recently, Grohe and Lindner (PODS '19) introduced independence assumptions for PDBs beyond the finite domain assumption. In the finite, a major argument for discussing the theoretical properties of TI-PDBs is that they can be used to represent any finite PDB via views. This is no longer the case once the number of tuples is countably infinite. In this paper, we systematically study the representability of infinite PDBs in terms of TI-PDBs and the related block-independent disjoint PDBs. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
