Measure Transformer Semantics for Bayesian Machine Learning
Johannes Borgstr\"om (Uppsala University, Uppsala, Sweden), Andrew D, Gordon (Microsoft Research, Cambridge, UK), Michael Greenberg (University of, Pennsylvania, Philadelphia, PA, USA), James Margetson (Microsoft Research,, Cambridge, UK), Jurgen Van Gael (Microsoft FUSE Labs

TL;DR
This paper introduces a measure-transformer semantics for probabilistic programming in Bayesian machine learning, enabling rigorous reasoning and efficient inference across diverse measure types, including hybrid and zero-probability events.
Contribution
It proposes a core functional calculus with measure-transformer combinators, supporting discrete, continuous, and hybrid measures, and compiles to an imperative language for efficient inference.
Findings
Supports inference on zero-probability events
Achieves thousands of observations per second in large models
Provides a rigorous semantics for probabilistic programming
Abstract
The Bayesian approach to machine learning amounts to computing posterior distributions of random variables from a probabilistic model of how the variables are related (that is, a prior distribution) and a set of observations of variables. There is a trend in machine learning towards expressing Bayesian models as probabilistic programs. As a foundation for this kind of programming, we propose a core functional calculus with primitives for sampling prior distributions and observing variables. We define measure-transformer combinators inspired by theorems in measure theory, and use these to give a rigorous semantics to our core calculus. The original features of our semantics include its support for discrete, continuous, and hybrid measures, and, in particular, for observations of zero-probability events. We compile our core language to a small imperative language that is processed by an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
