Accelerating Diffusion Transformer via Increment-Calibrated Caching with Channel-Aware Singular Value Decomposition
Zhiyuan Chen, Keyi Li, Yifan Jia, Le Ye, Yufei Ma

TL;DR
This paper introduces an increment-calibrated caching method with channel-aware SVD to accelerate diffusion transformer models, reducing computation while maintaining high image quality without additional training.
Contribution
The proposed training-free acceleration method uses low-rank approximation and channel-aware SVD to improve caching effectiveness in diffusion transformers.
Findings
Achieves over 45% reduction in computation compared to 35-step DDIM.
Improves Inception Score (IS) by 12 points.
Maintains low FID increase of less than 0.06 while accelerating inference.
Abstract
Diffusion transformer (DiT) models have achieved remarkable success in image generation, thanks for their exceptional generative capabilities and scalability. Nonetheless, the iterative nature of diffusion models (DMs) results in high computation complexity, posing challenges for deployment. Although existing cache-based acceleration methods try to utilize the inherent temporal similarity to skip redundant computations of DiT, the lack of correction may induce potential quality degradation. In this paper, we propose increment-calibrated caching, a training-free method for DiT acceleration, where the calibration parameters are generated from the pre-trained model itself with low-rank approximation. To deal with the possible correction failure arising from outlier activations, we introduce channel-aware Singular Value Decomposition (SVD), which further strengthens the calibration effect.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image Enhancement Techniques · Advanced Data Compression Techniques
MethodsDiffusion
