ADEV: Sound Automatic Differentiation of Expected Values of   Probabilistic Programs

Alexander K. Lew; Mathieu Huot; Sam Staton; Vikash K. Mansinghka

arXiv:2212.06386·cs.PL·December 14, 2022

ADEV: Sound Automatic Differentiation of Expected Values of Probabilistic Programs

Alexander K. Lew, Mathieu Huot, Sam Staton, Vikash K. Mansinghka

PDF

1 Repo

TL;DR

ADEV is a novel extension of forward-mode automatic differentiation that accurately computes gradients of expected values in probabilistic programs, enabling unbiased Monte Carlo estimation for gradient-based optimization.

Contribution

We introduce ADEV, a source-to-source transformation for probabilistic programs that correctly differentiates expectations, including discrete and continuous distributions, with a formal correctness proof.

Findings

01

ADEV produces unbiased gradient estimates for probabilistic programs.

02

The implementation is concise and easily integrated into existing frameworks.

03

A prototype in Haskell demonstrates practical applicability.

Abstract

Optimizing the expected values of probabilistic processes is a central problem in computer science and its applications, arising in fields ranging from artificial intelligence to operations research to statistical computing. Unfortunately, automatic differentiation techniques developed for deterministic programs do not in general compute the correct gradients needed for widely used solutions based on gradient-based optimization. In this paper, we present ADEV, an extension to forward-mode AD that correctly differentiates the expectations of probabilistic processes represented as programs that make random choices. Our algorithm is a source-to-source program transformation on an expressive, higher-order language for probabilistic computation, with both discrete and continuous probability distributions. The result of our transformation is a new probabilistic program, whose expected…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

probcomp/adev
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.