AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising

Zigeng Chen; Xinyin Ma; Gongfan Fang; Zhenxiong Tan; Xinchao Wang

arXiv:2406.06911·cs.CV·September 27, 2024·1 cites

AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising

Zigeng Chen, Xinyin Ma, Gongfan Fang, Zhenxiong Tan, Xinchao Wang

PDF

Open Access 2 Repos

TL;DR

AsyncDiff introduces an asynchronous, parallel computation scheme for diffusion models, significantly reducing inference latency by dividing the model across devices and exploiting high similarity between diffusion steps.

Contribution

It presents a universal, plug-and-play acceleration method that enables model parallelism in diffusion models by transforming sequential denoising into an asynchronous process.

Findings

01

Achieves 2.7x speedup on Stable Diffusion v2.1 with negligible quality loss.

02

Attains 4.0x speedup with only 0.38 CLIP Score reduction.

03

Successfully applied to video diffusion models with promising results.

Abstract

Diffusion models have garnered significant interest from the community for their great generative ability across various applications. However, their typical multi-step sequential-denoising nature gives rise to high cumulative latency, thereby precluding the possibilities of parallel computation. To address this, we introduce AsyncDiff, a universal and plug-and-play acceleration scheme that enables model parallelism across multiple devices. Our approach divides the cumbersome noise prediction model into multiple components, assigning each to a different device. To break the dependency chain between these components, it transforms the conventional sequential denoising into an asynchronous process by exploiting the high similarity between hidden states in consecutive diffusion steps. Consequently, each component is facilitated to compute in parallel on separate devices. The proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques

MethodsContrastive Language-Image Pre-training · Diffusion