Training-Free Adaptive Diffusion with Bounded Difference Approximation   Strategy

Hancheng Ye; Jiakang Yuan; Renqiu Xia; Xiangchao Yan; Tao Chen; Junchi; Yan; Botian Shi; Bo Zhang

arXiv:2410.09873·cs.CV·October 15, 2024

Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

Hancheng Ye, Jiakang Yuan, Renqiu Xia, Xiangchao Yan, Tao Chen, Junchi, Yan, Botian Shi, Bo Zhang

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces AdaptiveDiffusion, a method that adaptively skips noise prediction steps in diffusion models, significantly speeding up image and video synthesis without sacrificing quality.

Contribution

The paper presents a novel adaptive skipping strategy guided by latent differences, enabling faster diffusion without loss of output quality.

Findings

01

Achieves 2-5x speedup in denoising process

02

Maintains identical results to full-step diffusion

03

Applicable to both image and video diffusion models

Abstract

Diffusion models have recently achieved great success in the synthesis of high-quality images and videos. However, the existing denoising techniques in diffusion models are commonly based on step-by-step noise predictions, which suffers from high computation cost, resulting in a prohibitive latency for interactive applications. In this paper, we propose AdaptiveDiffusion to relieve this bottleneck by adaptively reducing the noise prediction steps during the denoising process. Our method considers the potential of skipping as many noise prediction steps as possible while keeping the final denoised results identical to the original full-step ones. Specifically, the skipping strategy is guided by the third-order latent difference that indicates the stability between timesteps during the denoising process, which benefits the reusing of previous noise prediction results. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

unimodal4reasoning/adaptivediffusion
jaxOfficial

Datasets

HankYe/Sampled_AIGCBench_text2image_ar_0.625
dataset· 29 dl
29 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Neural Networks and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Diffusion