ART for Diffusion Sampling: A Reinforcement Learning Approach to Timestep Schedule

Yilie Huang; Wenpin Tang; Xunyu Zhou

arXiv:2601.18681·cs.LG·May 11, 2026

ART for Diffusion Sampling: A Reinforcement Learning Approach to Timestep Schedule

Yilie Huang, Wenpin Tang, Xunyu Zhou

PDF

TL;DR

This paper introduces ART, an adaptive time discretization method for diffusion models, optimized via reinforcement learning, leading to improved sample quality and transferability across datasets.

Contribution

The paper proposes ART and ART-RL, novel methods for adaptive time discretization in diffusion models, bridging deterministic control and reinforcement learning for improved sampling.

Findings

01

ART improves FID scores on CIFAR-10 across various budgets.

02

The deterministic ART schedule transfers effectively to other datasets without retraining.

03

ART-RL enhances diffusion sampling by optimizing timestep schedules via reinforcement learning.

Abstract

We consider time discretization for score-based diffusion models to generate samples from a learned reverse-time dynamic on a finite grid. Uniform and hand-crafted grids can be suboptimal given a budget on the number of time steps. We introduce Adaptive Reparameterized Time (ART), which controls the clock speed of a reparameterized time variable to redistribute computation along the sampling trajectory while preserving the terminal time, with the objective of minimizing the aggregate Euler discretization error. We derive a randomized companion ART-RL that recasts ART as a continuous-time reinforcement learning problem with Gaussian policies, and prove a two-directional bridge between the two: the deterministic ART optimum lifts to an optimal Gaussian policy, and conversely any optimal Gaussian policy must recover the ART control through its mean. This bridge turns continuous-time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.