Efficient Pruning of Text-to-Image Models: Insights from Pruning Stable Diffusion
Samarth N Ramesh, Zhixue Zhao

TL;DR
This paper investigates post-training pruning techniques for Stable Diffusion 2, revealing that simple magnitude pruning can effectively reduce model size with minimal quality loss, and uncovers critical insights into information encoding in text-to-image models.
Contribution
It is the first comprehensive study on pruning multi-modal text-to-image models, comparing component-wise pruning strategies and revealing unexpected results about pruning effectiveness.
Findings
Simple magnitude pruning outperforms advanced methods in text-to-image models.
Stable Diffusion 2 can be pruned to 38.5% sparsity with minimal quality loss.
Pruning beyond certain thresholds causes significant performance drops, indicating critical weight importance.
Abstract
As text-to-image models grow increasingly powerful and complex, their burgeoning size presents a significant obstacle to widespread adoption, especially on resource-constrained devices. This paper presents a pioneering study on post-training pruning of Stable Diffusion 2, addressing the critical need for model compression in text-to-image domain. Our study tackles the pruning techniques for the previously unexplored multi-modal generation models, and particularly examines the pruning impact on the textual component and the image generation component separately. We conduct a comprehensive comparison on pruning the model or the single component of the model in various sparsities. Our results yield previously undocumented findings. For example, contrary to established trends in language model pruning, we discover that simple magnitude pruning outperforms more advanced techniques in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Computer Graphics and Visualization Techniques · Video Analysis and Summarization
MethodsPruning · Diffusion
