The tractability landscape of diffusion alignment: regularization, rewards, and computational primitives
Ankur Moitra, Andrej Risteski, Dhruv Rohatgi

TL;DR
This paper explores the computational primitives needed for reward alignment in diffusion models, analyzing how different distance measures like KL and Wasserstein influence the algorithms and reward classes that can be efficiently implemented.
Contribution
It introduces a primitive-based framework for reward alignment, characterizing the necessary algorithms for different distribution distances and reward types.
Findings
KL-based alignment uses exponential tilts that are efficiently sampleable.
Wasserstein-based alignment employs proximal transport oracles for certain reward classes.
Choice of distance measure determines the computational primitive and reward class tractability.
Abstract
Inference-time reward alignment asks how to turn a pre-trained diffusion model with base law into a sampler that favors a reward while remaining close to . Since there is no canonical distributional distance for this closeness constraint, different choices lead to different "reward-aligned" laws and, just as importantly, different algorithmic problems. We develop a primitive-based approach to reward alignment: rather than assuming arbitrary reward-aligned laws can be sampled, we ask which simple algorithmic primitives suffice to implement alignment for non-trivial reward classes. If closeness is measured in KL distance, the target law is . For this setting, we show that linear exponential tilts of the form -- which according to recent work [MRR26] can be efficiently sampled from -- are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
