LAPTOP-Diff: Layer Pruning and Normalized Distillation for Compressing   Diffusion Models

Dingkun Zhang; Sijia Li; Chen Chen; Qingsong Xie; Haonan Lu

arXiv:2404.11098·cs.CV·March 6, 2025·1 cites

LAPTOP-Diff: Layer Pruning and Normalized Distillation for Compressing Diffusion Models

Dingkun Zhang, Sijia Li, Chen Chen, Qingsong Xie, Haonan Lu

PDF

Open Access

TL;DR

LAPTOP-Diff introduces an automatic layer pruning and normalized distillation method to efficiently compress diffusion models, maintaining high performance with significantly reduced model size and improved scalability over previous handcrafted approaches.

Contribution

The paper presents a novel one-shot layer pruning criterion and normalized feature distillation, enhancing diffusion model compression with better performance and scalability.

Findings

01

Achieved only 4.0% decline in PickScore at 50% pruning ratio

02

Outperformed existing methods with less performance loss

03

Successfully compressed SDXL and SDM-v1.5 models

Abstract

In the era of AIGC, the demand for low-budget or even on-device applications of diffusion models emerged. In terms of compressing the Stable Diffusion models (SDMs), several approaches have been proposed, and most of them leveraged the handcrafted layer removal methods to obtain smaller U-Nets, along with knowledge distillation to recover the network performance. However, such a handcrafting manner of layer removal is inefficient and lacks scalability and generalization, and the feature distillation employed in the retraining phase faces an imbalance issue that a few numerically significant feature loss terms dominate over others throughout the retraining process. To this end, we proposed the layer pruning and normalized distillation for compressing diffusion models (LAPTOP-Diff). We, 1) introduced the layer pruning method to compress SDM's U-Net automatically and proposed an effective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Mathematical Modeling in Engineering

MethodsConvolution · Concatenated Skip Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Pruning · Max Pooling · U-Net · Knowledge Distillation · Diffusion