Lightweight Diffusion Models with Distillation-Based Block Neural Architecture Search
Siao Tang, Xin Wang, Hong Chen, Chaoyu Guan, Yansong Tang, Wenwu zhu

TL;DR
This paper introduces DiffNAS, a neural architecture search method that reduces the size and computational cost of diffusion models by block-wise search and dynamic loss, maintaining or improving performance.
Contribution
It proposes a novel block-wise NAS approach with local search and dynamic loss for diffusion models, significantly reducing their computational complexity.
Findings
Achieves about 50% reduction in MACs and parameters.
Maintains or improves diffusion model performance.
Demonstrates effectiveness on latent diffusion models.
Abstract
Diffusion models have recently shown remarkable generation ability, achieving state-of-the-art performance in many tasks. However, the high computational cost is still a troubling problem for diffusion models. To tackle this problem, we propose to automatically remove the structural redundancy in diffusion models with our proposed Diffusion Distillation-based Block-wise Neural Architecture Search (DiffNAS). Specifically, given a larger pretrained teacher, we leverage DiffNAS to search for the smallest architecture which can achieve on-par or even better performance than the teacher. Considering current diffusion models are based on UNet which naturally has a block-wise structure, we perform neural architecture search independently in each block, which largely reduces the search space. Different from previous block-wise NAS methods, DiffNAS contains a block-wise local search strategy and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Mathematical Modeling in Engineering · Advanced Neuroimaging Techniques and Applications · Model Reduction and Neural Networks
MethodsDiffusion
