From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training

Julius Berner; Lorenz Richter; Marcin Sendera; Jarrid Rector-Brooks; Nikolay Malkin

arXiv:2501.06148·cs.LG·February 5, 2026

From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training

Julius Berner, Lorenz Richter, Marcin Sendera, Jarrid Rector-Brooks, Nikolay Malkin

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper introduces a new approach to training neural diffusion models for sampling from Boltzmann distributions, leveraging asymptotic equivalences and optimized discretization to improve efficiency and performance.

Contribution

It establishes theoretical links between entropic RL and continuous-time diffusion models, and demonstrates how coarse discretization enhances training efficiency and reduces computational costs.

Findings

01

Theoretical equivalences between objectives in infinitesimal discretization limit.

02

Coarse discretization improves sample efficiency during training.

03

Achieves competitive sampling performance with lower computational cost.

Abstract

We study the problem of training neural stochastic differential equations, or diffusion models, to sample from a Boltzmann distribution without access to target samples. Existing methods for training such models enforce time-reversal of the generative and noising processes, using either differentiable simulation or off-policy reinforcement learning (RL). We prove equivalences between families of objectives in the limit of infinitesimal discretization steps, linking entropic RL methods (GFlowNets) with continuous-time objects (partial differential equations and path space measures). We further show that an appropriate choice of coarse time discretization during training allows greatly improved sample efficiency and the use of time-local objectives, achieving competitive performance on standard sampling benchmarks with reduced computational cost.

Peer Reviews

Decision·Submitted to ICLR 2025

Reviewer 01Rating 5Confidence 3

Strengths

1. The paper is very well written and easy to follow, with clear exposition of the mathematical derivations and the empirical results. 2. The experimental section is thorough and well designed, exploring the effects of different discretization strategies and their impact on performance in detail. The benchmarks used are diverse and represent a wide range of sampling challenges. 3. The work provides strong empirical evidence that non-uniform time discretization (particularly random placement) imp

Weaknesses

1. While the theoretical contributions are valuable and provide an interesting link between discrete-time and continuous-time objectives, they are not completely unexpected and partly already present in the literature. 2. In the experimental results, it is noted that the ELBO gap does not converge to zero as the discretization becomes finer but instead appears to stabilize at a positive value. The authors do not give an explanation for this phenomenon. In particular, the lack of a "benchmark" ma

Reviewer 02Rating 5Confidence 4

Strengths

1. The approach of linking discrete-time policy objectives with continuous-time SDE training is a useful idea, albeit heavily reliant on established results. 2. Authors show that this method potentially reduces computational costs for neural SDE training.

Weaknesses

1. Firstly, I think the presentation of this work remains a major bottleneck for readers. Section 2 is preliminary, and it spans from pages 3 to 7. Such a lengthy preliminary section introduces well-known equations and results (e.g., equations (4)-(6) from GFlowNet papers, (9)-(15) from stochastic control and diffusion models, and (16), (17) as standard Euler-Maruyama discretizations). These derivations, mostly grounded in existing work, dilute the contributions and add an undue burden for read

Reviewer 03Rating 5Confidence 2

Strengths

The paper appears to be mathematically rigorous and experiments appear to give credence to the authors' work.

Weaknesses

I found the paper very difficult to read. The notation is dense, not all appears to be defined, some is non-standard and unclear and I found it a little tricky to understand exactly what the authors wanted to do. It may be that the authors have solved in interesting problem in a genuinely useful way but that was unclear from the paper. All but the very expert reader would, in my view, find the paper a difficult read. A few specific comments are: - Abstract could be more informative and precis

Code & Models

Repositories

gfnorg/gfn-diffusion
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Mathematical Modeling in Engineering · Mathematical Biology Tumor Growth

MethodsDiffusion