Amortizing intractable inference in diffusion models for vision,   language, and control

Siddarth Venkatraman; Moksh Jain; Luca Scimeca; Minsu Kim; Marcin; Sendera; Mohsin Hasan; Luke Rowe; Sarthak Mittal; Pablo Lemos; Emmanuel; Bengio; Alexandre Adam; Jarrid Rector-Brooks; Yoshua Bengio; Glen Berseth,; Nikolay Malkin

arXiv:2405.20971·cs.LG·January 14, 2025

Amortizing intractable inference in diffusion models for vision, language, and control

Siddarth Venkatraman, Moksh Jain, Luca Scimeca, Minsu Kim, Marcin, Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel, Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yoshua Bengio, Glen Berseth,, Nikolay Malkin

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a theoretically grounded method called relative trajectory balance for amortized inference in diffusion models, enabling efficient posterior sampling in vision, language, and control tasks, with broad practical applications.

Contribution

It proposes a new asymptotically correct learning objective for diffusion-based posterior inference, extending the use of deep reinforcement learning techniques to improve mode coverage.

Findings

01

Unbiased inference of arbitrary posteriors demonstrated in vision, language, and multimodal tasks.

02

Achieved state-of-the-art results in offline reinforcement learning using diffusion-based behavior priors.

03

Theoretical proof of asymptotic correctness for the proposed learning objective.

Abstract

Diffusion models have emerged as effective distribution estimators in vision, language, and reinforcement learning, but their use as priors in downstream tasks poses an intractable posterior inference problem. This paper studies amortized sampling of the posterior over data, $x \sim p^{post} (x) \propto p (x) r (x)$ , in a model that consists of a diffusion generative model prior $p (x)$ and a black-box constraint or likelihood function $r (x)$ . We state and prove the asymptotic correctness of a data-free learning objective, relative trajectory balance, for training a diffusion model that samples from this posterior, a problem that existing methods solve only approximately or in restricted cases. Relative trajectory balance arises from the generative flow network perspective on diffusion models, which allows the use of deep reinforcement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gfnorg/diffusion-finetuning
pytorchOfficial

Videos

Amortizing intractable inference in diffusion models for vision, language, and control· slideslive

Taxonomy

TopicsNeural Networks and Applications

MethodsDiffusion