Amortizing intractable inference in diffusion models for vision, language, and control
Siddarth Venkatraman, Moksh Jain, Luca Scimeca, Minsu Kim, Marcin, Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel, Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yoshua Bengio, Glen Berseth,, Nikolay Malkin

TL;DR
This paper introduces a theoretically grounded method called relative trajectory balance for amortized inference in diffusion models, enabling efficient posterior sampling in vision, language, and control tasks, with broad practical applications.
Contribution
It proposes a new asymptotically correct learning objective for diffusion-based posterior inference, extending the use of deep reinforcement learning techniques to improve mode coverage.
Findings
Unbiased inference of arbitrary posteriors demonstrated in vision, language, and multimodal tasks.
Achieved state-of-the-art results in offline reinforcement learning using diffusion-based behavior priors.
Theoretical proof of asymptotic correctness for the proposed learning objective.
Abstract
Diffusion models have emerged as effective distribution estimators in vision, language, and reinforcement learning, but their use as priors in downstream tasks poses an intractable posterior inference problem. This paper studies amortized sampling of the posterior over data, , in a model that consists of a diffusion generative model prior and a black-box constraint or likelihood function . We state and prove the asymptotic correctness of a data-free learning objective, relative trajectory balance, for training a diffusion model that samples from this posterior, a problem that existing methods solve only approximately or in restricted cases. Relative trajectory balance arises from the generative flow network perspective on diffusion models, which allows the use of deep reinforcement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications
MethodsDiffusion
