SODA: Sensitivity-Oriented Dynamic Acceleration for Diffusion Transformer
Tong Shao, Yusen Fu, Guoying Sun, Jingde Kong, Zhuotao Tian, Jingyong Su

TL;DR
SODA is a novel method that adaptively accelerates diffusion transformers by dynamically balancing caching and pruning based on fine-grained sensitivity analysis, significantly improving inference efficiency without sacrificing quality.
Contribution
SODA introduces a sensitivity-oriented framework that adaptively configures caching and pruning, overcoming limitations of fixed strategies and enhancing diffusion transformer acceleration.
Findings
Achieves state-of-the-art fidelity at high acceleration ratios.
Effectively balances acceleration and quality through sensitivity-aware optimization.
Demonstrates superior performance on multiple diffusion models.
Abstract
Diffusion Transformers have become a dominant paradigm in visual generation, yet their low inference efficiency remains a key bottleneck hindering further advancement. Among common training-free techniques, caching offers high acceleration efficiency but often compromises fidelity, whereas pruning shows the opposite trade-off. Integrating caching with pruning achieves a balance between acceleration and generation quality. However, existing methods typically employ fixed and heuristic schemes to configure caching and pruning strategies. While they roughly follow the overall sensitivity trend of generation models to acceleration, they fail to capture fine-grained and complex variations, inevitably skipping highly sensitive computations and leading to quality degradation. Furthermore, such manually designed strategies exhibit poor generalization. To address these issues, we propose SODA, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Image Enhancement Techniques · Generative Adversarial Networks and Image Synthesis
