Inference-Time Alignment in Diffusion Models with Reward-Guided   Generation: Tutorial and Review

Masatoshi Uehara; Yulai Zhao; Chenyu Wang; Xiner Li; Aviv Regev,; Sergey Levine; Tommaso Biancalani

arXiv:2501.09685·cs.AI·January 22, 2025

Inference-Time Alignment in Diffusion Models with Reward-Guided Generation: Tutorial and Review

Masatoshi Uehara, Yulai Zhao, Chenyu Wang, Xiner Li, Aviv Regev,, Sergey Levine, Tommaso Biancalani

PDF

Open Access

TL;DR

This tutorial reviews inference-time alignment methods for diffusion models, highlighting techniques to optimize downstream rewards without fine-tuning, and introduces new algorithms and connections to other AI fields.

Contribution

It provides a unified perspective on inference-time guidance in diffusion models, introduces novel algorithms, and discusses extensions like search algorithms and language model connections.

Findings

01

Current methods approximate soft optimal denoising processes.

02

Novel algorithms for inference-time guidance are proposed.

03

Connections between language models and diffusion models are discussed.

Abstract

This tutorial provides an in-depth guide on inference-time guidance and alignment methods for optimizing downstream reward functions in diffusion models. While diffusion models are renowned for their generative modeling capabilities, practical applications in fields such as biology often require sample generation that maximizes specific metrics (e.g., stability, affinity in proteins, closeness to target structures). In these scenarios, diffusion models can be adapted not only to generate realistic samples but also to explicitly maximize desired measures at inference time without fine-tuning. This tutorial explores the foundational aspects of such inference-time algorithms. We review these methods from a unified perspective, demonstrating that current techniques -- such as Sequential Monte Carlo (SMC)-based guidance, value-based sampling, and classifier guidance -- aim to approximate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks

MethodsSoftmax · Attention Is All You Need · Diffusion