Faster Diffusion via Temporal Attention Decomposition
Haozhe Liu, Wentian Zhang, Jinheng Xie, Francesco Faccio, Mengmeng Xu,, Tao Xiang, Mike Zheng Shou, Juan-Manuel Perez-Rua, J\"urgen Schmidhuber

TL;DR
This paper investigates the role of attention mechanisms in diffusion models during inference, revealing a two-phase process and proposing a simple, training-free method called TGATE that accelerates image generation by 10-50%.
Contribution
It introduces TGATE, a novel attention gating technique that improves inference speed without additional training in text-conditional diffusion models.
Findings
Cross-attention converges to a fixed point after several steps.
Inference divides into planning and fidelity phases based on attention roles.
TGATE accelerates diffusion models by 10-50%.
Abstract
We explore the role of attention mechanism during inference in text-conditional diffusion models. Empirical observations suggest that cross-attention outputs converge to a fixed point after several inference steps. The convergence time naturally divides the entire inference process into two phases: an initial phase for planning text-oriented visual semantics, which are then translated into images in a subsequent fidelity-improving phase. Cross-attention is essential in the initial phase but almost irrelevant thereafter. However, self-attention initially plays a minor role but becomes crucial in the second phase. These findings yield a simple and training-free method known as temporally gating the attention (TGATE), which efficiently generates images by caching and reusing attention outputs at scheduled time steps. Experimental results show when widely applied to various existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Radiomics and Machine Learning in Medical Imaging · Image Retrieval and Classification Techniques
MethodsDiffusion
