Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time Complexity
Haoxuan Chen, Yinuo Ren, Lexing Ying, Grant M. Rotskoff

TL;DR
This paper introduces a novel parallel sampling method for diffusion models that achieves sub-linear time complexity in data dimension, significantly reducing inference costs while maintaining theoretical guarantees.
Contribution
The paper presents the first provably sub-linear time sampling algorithm for diffusion models using parallel Picard iterations and rigorous theoretical analysis.
Findings
Achieves ( ext{poly} \, ext{log} \, d) time complexity
Compatible with SDE and probability flow ODE implementations
Enables fast high-dimensional data sampling on GPU clusters
Abstract
Diffusion models have become a leading method for generative modeling of both image and scientific data. As these models are costly to train and \emph{evaluate}, reducing the inference cost for diffusion models remains a major goal. Inspired by the recent empirical success in accelerating diffusion models via the parallel sampling technique~\cite{shih2024parallel}, we propose to divide the sampling process into blocks with parallelizable Picard iterations within each block. Rigorous theoretical analysis reveals that our algorithm achieves overall time complexity, marking \emph{the first implementation with provable sub-linear complexity w.r.t. the data dimension }. Our analysis is based on a generalized version of Girsanov's theorem and is compatible with both the SDE and probability flow ODE implementations. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Markov Chains and Monte Carlo Methods
MethodsDiffusion · Parsing Incrementally for Constrained Auto-Regressive Decoding
