ALTER: All-in-One Layer Pruning and Temporal Expert Routing for Efficient Diffusion Generation
Xiaomeng Yang, Lei Lu, Qihui Fan, Changdi Yang, Juyi Lin, Yanzhi Wang, Xuan Zhang, Shangqian Gao

TL;DR
ALTER introduces a unified approach combining layer pruning and temporal expert routing with a trainable hypernetwork, significantly reducing computational costs of diffusion models while maintaining high image quality.
Contribution
It presents a novel single-stage optimization framework that jointly prunes layers and manages expert routing during diffusion model fine-tuning, addressing prior sub-optimal sequential strategies.
Findings
Achieves 3.64x speedup with 35% sparsity.
Maintains original model fidelity with only 20 inference steps.
Reduces MACs to 25.9% of the original while preserving quality.
Abstract
Diffusion models have demonstrated exceptional capabilities in generating high-fidelity images. However, their iterative denoising process results in significant computational overhead during inference, limiting their practical deployment in resource-constrained environments. Existing acceleration methods often adopt uniform strategies that fail to capture the temporal variations during diffusion generation, while the commonly adopted sequential pruning-then-fine-tuning strategy suffers from sub-optimality due to the misalignment between pruning decisions made on pretrained weights and the model's final parameters. To address these limitations, we introduce ALTER: All-in-One Layer Pruning and Temporal Expert Routing, a unified framework that transforms diffusion models into a mixture of efficient temporal experts. ALTER achieves a single-stage optimization that unifies layer pruning,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image Enhancement Techniques · Advanced Neural Network Applications
