Accelerated Diffusion Models via Speculative Sampling

Valentin De Bortoli; Alexandre Galashov; Arthur Gretton; Arnaud Doucet

arXiv:2501.05370·cs.LG·July 8, 2025

Accelerated Diffusion Models via Speculative Sampling

Valentin De Bortoli, Alexandre Galashov, Arthur Gretton, Arnaud Doucet

PDF

Open Access

TL;DR

This paper extends speculative sampling to diffusion models, enabling faster sample generation by reducing function evaluations while maintaining exactness, with strategies that do not require additional training.

Contribution

It introduces a novel speculative sampling method for diffusion models, including a training-free drafting strategy that accelerates inference without sacrificing sample quality.

Findings

01

Halves the number of function evaluations needed for sampling

02

Achieves significant speedup in diffusion model inference

03

Generates exact samples from the target diffusion model

Abstract

Speculative sampling is a popular technique for accelerating inference in Large Language Models by generating candidate tokens using a fast draft model and accepting or rejecting them based on the target model's distribution. While speculative sampling was previously limited to discrete sequences, we extend it to diffusion models, which generate samples via continuous, vector-valued Markov chains. In this context, the target model is a high-quality but computationally expensive diffusion model. We propose various drafting strategies, including a simple and effective approach that does not require training a draft model and is applicable out of the box to any diffusion model. Our experiments demonstrate significant generation speedup on various diffusion models, halving the number of function evaluations, while generating exact samples from the target model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference

MethodsDiffusion