Attention for Inference Compilation

William Harvey; Andreas Munk; At{\i}l{\i}m G\"une\c{s} Baydin,; Alexander Bergholm; Frank Wood

arXiv:1910.11961·cs.LG·October 29, 2019·1 cites

Attention for Inference Compilation

William Harvey, Andreas Munk, At{\i}l{\i}m G\"une\c{s} Baydin,, Alexander Bergholm, Frank Wood

PDF

Open Access

TL;DR

This paper introduces an attention mechanism in inference compilation for probabilistic programming, improving the modeling of long-range dependencies and enhancing posterior approximation accuracy.

Contribution

It proposes a novel attention-based neural network architecture for inference compilation, addressing limitations of previous models in capturing long-range dependencies.

Findings

01

Attention improves posterior approximation in probabilistic programs.

02

Enhanced inference performance over existing methods.

03

Better modeling of long-range dependencies in latent variables.

Abstract

We present a new approach to automatic amortized inference in universal probabilistic programs which improves performance compared to current methods. Our approach is a variation of inference compilation (IC) which leverages deep neural networks to approximate a posterior distribution over latent variables in a probabilistic program. A challenge with existing IC network architectures is that they can fail to model long-range dependencies between latent variables. To address this, we introduce an attention mechanism that attends to the most salient variables previously sampled in the execution of a probabilistic program. We demonstrate that the addition of attention allows the proposal distributions to better match the true posterior, enhancing inference about latent variables in simulators.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Machine Learning and Data Classification