Timestep-Aware Block Masking for Efficient Diffusion Model Inference
Haodong He, Yuan Gao, Weizhong Zhang, Gui-Song Xia

TL;DR
This paper introduces a timestep-aware block masking framework that dynamically optimizes the computational graph of diffusion models during inference, significantly reducing latency while maintaining high image quality.
Contribution
It proposes a novel, memory-efficient method to learn timestep-specific masks for diffusion models, improving inference speed without sacrificing performance.
Findings
Achieves faster sampling with minimal quality loss.
Demonstrates effectiveness across multiple diffusion architectures.
Introduces a timestep-aware loss and mask rectification strategy.
Abstract
Diffusion Probabilistic Models (DPMs) have achieved great success in image generation but suffer from high inference latency due to their iterative denoising nature. Motivated by the evolving feature dynamics across the denoising trajectory, we propose a novel framework to optimize the computational graph of pre-trained DPMs on a per-timestep basis. By learning timestep-specific masks, our method dynamically determines which blocks to execute or bypass through feature reuse at each inference stage. Unlike global optimization methods that incur prohibitive memory costs via full-chain backpropagation, our method optimizes masks for each timestep independently, ensuring a memory-efficient training process. To guide this process, we introduce a timestep-aware loss scaling mechanism that prioritizes feature fidelity during sensitive denoising phases, complemented by a knowledge-guided mask…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image Enhancement Techniques · Advanced Neural Network Applications
