TL;DR
ADEV is a novel extension of forward-mode automatic differentiation that accurately computes gradients of expected values in probabilistic programs, enabling unbiased Monte Carlo estimation for gradient-based optimization.
Contribution
We introduce ADEV, a source-to-source transformation for probabilistic programs that correctly differentiates expectations, including discrete and continuous distributions, with a formal correctness proof.
Findings
ADEV produces unbiased gradient estimates for probabilistic programs.
The implementation is concise and easily integrated into existing frameworks.
A prototype in Haskell demonstrates practical applicability.
Abstract
Optimizing the expected values of probabilistic processes is a central problem in computer science and its applications, arising in fields ranging from artificial intelligence to operations research to statistical computing. Unfortunately, automatic differentiation techniques developed for deterministic programs do not in general compute the correct gradients needed for widely used solutions based on gradient-based optimization. In this paper, we present ADEV, an extension to forward-mode AD that correctly differentiates the expectations of probabilistic processes represented as programs that make random choices. Our algorithm is a source-to-source program transformation on an expressive, higher-order language for probabilistic computation, with both discrete and continuous probability distributions. The result of our transformation is a new probabilistic program, whose expected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
