LAPTOP-Diff: Layer Pruning and Normalized Distillation for Compressing Diffusion Models
Dingkun Zhang, Sijia Li, Chen Chen, Qingsong Xie, Haonan Lu

TL;DR
LAPTOP-Diff introduces an automatic layer pruning and normalized distillation method to efficiently compress diffusion models, maintaining high performance with significantly reduced model size and improved scalability over previous handcrafted approaches.
Contribution
The paper presents a novel one-shot layer pruning criterion and normalized feature distillation, enhancing diffusion model compression with better performance and scalability.
Findings
Achieved only 4.0% decline in PickScore at 50% pruning ratio
Outperformed existing methods with less performance loss
Successfully compressed SDXL and SDM-v1.5 models
Abstract
In the era of AIGC, the demand for low-budget or even on-device applications of diffusion models emerged. In terms of compressing the Stable Diffusion models (SDMs), several approaches have been proposed, and most of them leveraged the handcrafted layer removal methods to obtain smaller U-Nets, along with knowledge distillation to recover the network performance. However, such a handcrafting manner of layer removal is inefficient and lacks scalability and generalization, and the feature distillation employed in the retraining phase faces an imbalance issue that a few numerically significant feature loss terms dominate over others throughout the retraining process. To this end, we proposed the layer pruning and normalized distillation for compressing diffusion models (LAPTOP-Diff). We, 1) introduced the layer pruning method to compress SDM's U-Net automatically and proposed an effective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Mathematical Modeling in Engineering
MethodsConvolution · Concatenated Skip Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Pruning · Max Pooling · U-Net · Knowledge Distillation · Diffusion
