Learning in Integer Latent Variable Models with Nested Automatic   Differentiation

Daniel Sheldon; Kevin Winner; Debora Sujono

arXiv:1806.03207·stat.ML·June 11, 2018

Learning in Integer Latent Variable Models with Nested Automatic Differentiation

Daniel Sheldon, Kevin Winner, Debora Sujono

PDF

Open Access

TL;DR

This paper introduces advanced nested automatic differentiation algorithms for exact inference and learning in complex integer latent variable models, achieving faster, more stable, and polynomial-time computations for nested derivatives.

Contribution

The paper presents novel, efficient AD algorithms for integer latent variable models, enabling exact gradient computation and improved learning performance.

Findings

01

Faster and more stable AD algorithms for nested derivatives

02

Exact gradient computation for complex models

03

Polynomial-time complexity in nesting levels

Abstract

We develop nested automatic differentiation (AD) algorithms for exact inference and learning in integer latent variable models. Recently, Winner, Sujono, and Sheldon showed how to reduce marginalization in a class of integer latent variable models to evaluating a probability generating function which contains many levels of nested high-order derivatives. We contribute faster and more stable AD algorithms for this challenging problem and a novel algorithm to compute exact gradients for learning. These contributions lead to significantly faster and more accurate learning algorithms, and are the first AD algorithms whose running time is polynomial in the number of levels of nesting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Bayesian Modeling and Causal Inference · Gaussian Processes and Bayesian Inference