$\Delta$-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers
Pengtao Chen, Mingzhu Shen, Peng Ye, Jianjian Cao, Chongjun Tu,, Christos-Savvas Bouganis, Yiren Zhao, Tao Chen

TL;DR
This paper introduces $ ext{Delta}$-DiT, a training-free inference acceleration framework for diffusion transformers that leverages block-specific caching to significantly speed up image generation without sacrificing quality.
Contribution
The paper proposes a novel, training-free acceleration method for diffusion transformers, utilizing a cache mechanism tailored to DiT architecture and insights into block-function correlations.
Findings
Achieves 1.6x speedup on 20-step generation
Improves performance in most cases with acceleration
Outperforms existing methods at 4-step generation
Abstract
Diffusion models are widely recognized for generating high-quality and diverse images, but their poor real-time performance has led to numerous acceleration works, primarily focusing on UNet-based structures. With the more successful results achieved by diffusion transformers (DiT), there is still a lack of exploration regarding the impact of DiT structure on generation, as well as the absence of an acceleration framework tailored to the DiT architecture. To tackle these challenges, we conduct an investigation into the correlation between DiT blocks and image generation. Our findings reveal that the front blocks of DiT are associated with the outline of the generated images, while the rear blocks are linked to the details. Based on this insight, we propose an overall training-free inference acceleration framework -DiT: using a designed cache mechanism to accelerate the rear DiT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsDiffusion
