Inference-Time Alignment in Diffusion Models with Reward-Guided Generation: Tutorial and Review
Masatoshi Uehara, Yulai Zhao, Chenyu Wang, Xiner Li, Aviv Regev,, Sergey Levine, Tommaso Biancalani

TL;DR
This tutorial reviews inference-time alignment methods for diffusion models, highlighting techniques to optimize downstream rewards without fine-tuning, and introduces new algorithms and connections to other AI fields.
Contribution
It provides a unified perspective on inference-time guidance in diffusion models, introduces novel algorithms, and discusses extensions like search algorithms and language model connections.
Findings
Current methods approximate soft optimal denoising processes.
Novel algorithms for inference-time guidance are proposed.
Connections between language models and diffusion models are discussed.
Abstract
This tutorial provides an in-depth guide on inference-time guidance and alignment methods for optimizing downstream reward functions in diffusion models. While diffusion models are renowned for their generative modeling capabilities, practical applications in fields such as biology often require sample generation that maximizes specific metrics (e.g., stability, affinity in proteins, closeness to target structures). In these scenarios, diffusion models can be adapted not only to generate realistic samples but also to explicitly maximize desired measures at inference time without fine-tuning. This tutorial explores the foundational aspects of such inference-time algorithms. We review these methods from a unified perspective, demonstrating that current techniques -- such as Sequential Monte Carlo (SMC)-based guidance, value-based sampling, and classifier guidance -- aim to approximate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks
MethodsSoftmax · Attention Is All You Need · Diffusion
