Iterative Tilting for Diffusion Fine-Tuning

Jean Pachebat; Giovanni Conforti; Alain Durmus; Yazid Janati

arXiv:2512.03234·stat.ML·December 4, 2025

Iterative Tilting for Diffusion Fine-Tuning

Jean Pachebat, Giovanni Conforti, Alain Durmus, Yazid Janati

PDF

Open Access

TL;DR

This paper presents iterative tilting, a gradient-free approach for fine-tuning diffusion models towards reward-tilted distributions by decomposing large tilts into smaller, tractable steps validated on a Gaussian mixture example.

Contribution

The paper introduces a novel gradient-free method for diffusion model fine-tuning that avoids backpropagation through sampling chains by decomposing reward tilts into smaller steps.

Findings

01

Validated on a Gaussian mixture with linear reward

02

Achieves tractable score updates via Taylor expansion

03

Demonstrates effectiveness without backpropagation

Abstract

We introduce iterative tilting, a gradient-free method for fine-tuning diffusion models toward reward-tilted distributions. The method decomposes a large reward tilt $exp (λ r)$ into $N$ sequential smaller tilts, each admitting a tractable score update via first-order Taylor expansion. This requires only forward evaluations of the reward function and avoids backpropagating through sampling chains. We validate on a two-dimensional Gaussian mixture with linear reward, where the exact tilted distribution is available in closed form.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Stochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference