Probabilistic Data with Continuous Distributions
Martin Grohe, Benjamin Lucien Kaminski, Joost-Pieter Katoen and, Peter Lindner

TL;DR
This paper develops a mathematical framework for infinite probabilistic databases involving continuous distributions, extending existing models and languages to handle continuous probability distributions.
Contribution
It introduces a general framework for probabilistic databases with continuous distributions and extends probabilistic programming languages to support continuous probabilistic models.
Findings
Queries have well-defined semantics in the new framework.
Generative Datalog is extended to continuous distributions.
The approach enables modeling of infinite probabilistic databases.
Abstract
Statistical models of real world data typically involve continuous probability distributions such as normal, Laplace, or exponential distributions. Such distributions are supported by many probabilistic modelling formalisms, including probabilistic database systems. Yet, the traditional theoretical framework of probabilistic databases focusses entirely on finite probabilistic databases. Only recently, we set out to develop the mathematical theory of infinite probabilistic databases. The present paper is an exposition of two recent papers which are cornerstones of this theory. In (Grohe, Lindner; ICDT 2020) we propose a very general framework for probabilistic databases, possibly involving continuous probability distributions, and show that queries have a well-defined semantics in this framework. In (Grohe, Kaminski, Katoen, Lindner; PODS 2020) we extend the declarative probabilistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
