Dual-Rate Diffusion: Accelerating diffusion models with an interleaved heavy-light network

Grigory Bartosh; David Ruhe; Emiel Hoogeboom; Jonathan Heek; Thomas Mensink; Tim Salimans

arXiv:2605.18190·cs.LG·May 19, 2026

Dual-Rate Diffusion: Accelerating diffusion models with an interleaved heavy-light network

Grigory Bartosh, David Ruhe, Emiel Hoogeboom, Jonathan Heek, Thomas Mensink, Tim Salimans

PDF

TL;DR

Dual-Rate Diffusion accelerates diffusion model sampling by interleaving a heavy context encoder with a lightweight denoising network, reducing computation while maintaining quality.

Contribution

It introduces a novel interleaved heavy-light network approach to significantly speed up diffusion sampling without quality loss.

Findings

01

Reduces computational cost by 2-4 times on ImageNet benchmarks.

02

Maintains sample quality comparable to standard diffusion models.

03

Compatible with distillation techniques for further efficiency.

Abstract

Diffusion models achieve state-of-the-art generative performance but suffer from high computational costs during inference due to the repeated evaluation of a heavy neural network. In this work, we propose Dual-Rate Diffusion, a method to accelerate sampling by interleaving the execution of a heavy high-capacity context encoder and a light efficient denoising model. The context encoder is evaluated sparsely to extract high-dimensional features, which are effectively reused by the light denoising model at every step to refine the sample efficiently. This approach significantly accelerates inference without compromising sample quality. On ImageNet benchmarks, Dual-Rate Diffusion matches the performance of standard baselines while reducing computational cost by a factor of $2$ - $4$ . Furthermore, we demonstrate that our method is compatible with distillation techniques, such as Moment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.