Sublinear iterations can suffice even for DDPMs
Matthew S. Zhang, Stephen Huan, Jerry Huang, Nicholas M. Boffi, Sitan Chen, Sinho Chewi

TL;DR
This paper introduces a new randomized midpoint integrator for DDPMs, achieving sublinear complexity in sample generation, and demonstrates its theoretical advantages and practical effectiveness in image synthesis.
Contribution
It presents the first sublinear complexity bound for pure DDPM sampling using a novel randomized midpoint method and the shifted composition rule framework.
Findings
Sublinear $ ilde{O}(\sqrt{d})$ score evaluations suffice for convergence.
The proposed method outperforms traditional discretization approaches.
Experimental results confirm practical efficiency in image synthesis.
Abstract
SDE-based methods such as denoising diffusion probabilistic models (DDPMs) have shown remarkable success in real-world sample generation tasks. Prior analyses of DDPMs have been focused on the exponential Euler discretization, showing guarantees that generally depend at least linearly on the dimension or initial Fisher information. Inspired by works in log-concave sampling (Shen and Lee, 2019), we analyze an integrator -- the denoising diffusion randomized midpoint method (DDRaM) -- that leverages an additional randomized midpoint to better approximate the SDE. Using a recently-developed analytic framework called the "shifted composition rule", we show that this algorithm enjoys favorable discretization properties under appropriate smoothness assumptions, with sublinear score evaluations needed to ensure convergence. This is the first sublinear complexity bound…
Peer Reviews
Decision·Submitted to ICLR 2026
- As far as I know, this is indeed the first $O(\sqrt{d})$ order error bound for a stochastic sampler in the diffusion model area. - The paper is clearly written, and the idea is easy to follow.
- In the paper of Shen & Lee (2019), their analysis requires the target distribution to be a log-concave one. I am not sure if the Assumptions 1,2,3 in this paper can lead to the conclusion that the marginal distribution will be log-concave. Or could you explain how you could surpass this condition? - The experiments validate the usage of the DDRaM method, but it does not involve popular stochastic samplers like DDPM itself, EDM-stochastic, PNDM-Stochastic, and DPM-Solver-Stochastic for comparis
1. The biggest contribution is providing the first sublinear O(sqrt(d)) proof for DDPM (SDE). This is a very important step forward in diffusion model theory. Many prior works (including Li & Jiao, ICLR 2025) achieved sublinear complexity for ODE (DDIM), but they could not solve the problem for DDPM because of its stochasticity. This paper successfully fills this important theoretical gap. 2. Besides the theory, the paper shows that the proposed DDRaM method works well in practice. In experimen
My only one concern is about the decreasing importance of the DDPM sampler itself. In practice, many researchers are trying to develop samplers with very small NFE (like DDIM, DPM-Solvers) to make generation faster. Or, they train the model differently from the beginning (like Consistency Models or Rectified Flow). It is clear that proving sublinear complexity for DDPM was a very difficult problem, but I am a little unsure if solving this problem is as important as prior works on DDIM, which is
(1) The paper analyzes a stochastic sampler with the random point method, and proves that the KL and $W_2$ divergence can be controlled with iteration complexity that has a sublinear dependence on the dimension $d$. (2) It conducts experiments to demonstrate the superiority of the randomized point method.
(1) The equations for reverse SDE seem incorrect. It can be seen from related works, e.g., [1], that there is no $\gamma$ there. (2) The keyword, DDPM, is not very accurate. Actually, it should be termed as sampling diffusion with stochastic samplers, or SDE. This is because DDPM is only a special case of score-based diffusion when taking the limit in the length of time steps. However, the denoising diffusion model introduced in this paper is more related to the score-based SDE, instead of the
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods
