Steering diffusion models with quadratic rewards: a fine-grained analysis

Ankur Moitra; Andrej Risteski; Dhruv Rohatgi

arXiv:2602.16570·cs.LG·February 19, 2026

Steering diffusion models with quadratic rewards: a fine-grained analysis

Ankur Moitra, Andrej Risteski, Dhruv Rohatgi

PDF

Open Access

TL;DR

This paper analyzes the computational complexity of sampling from reward-tilted diffusion models with quadratic rewards, providing efficient algorithms for certain cases and proving intractability for others.

Contribution

It introduces a detailed analysis of quadratic reward tilts in diffusion models, including a new algorithm for low-rank positive-definite cases and complexity results for negative-definite cases.

Findings

01

Linear quadratic rewards are always efficiently sampleable.

02

New algorithm based on Hubbard-Stratonovich transform for low-rank positive-definite tilts.

03

Negative-definite quadratic tilts are computationally intractable even at rank 1.

Abstract

Inference-time algorithms are an emerging paradigm in which pre-trained models are used as subroutines to solve downstream tasks. Such algorithms have been proposed for tasks ranging from inverse problems and guided image generation to reasoning. However, the methods currently deployed in practice are heuristics with a variety of failure modes -- and we have very little understanding of when these heuristics can be efficiently improved. In this paper, we consider the task of sampling from a reward-tilted diffusion model -- that is, sampling from $p^{⋆} (x) \propto p (x) exp (r (x))$ -- given a reward function $r$ and pre-trained diffusion oracle for $p$ . We provide a fine-grained analysis of the computational tractability of this task for quadratic rewards $r (x) = x^{⊤} A x + b^{⊤} x$ . We show that linear-reward tilts are always efficiently sampleable -- a simple result that seems…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods