Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers

Jian Ma; Qirong Peng; Xujie Zhu; Peixing Xie; Chen Chen; Haonan Lu

arXiv:2511.16156·cs.CV·February 25, 2026

Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers

Jian Ma, Qirong Peng, Xujie Zhu, Peixing Xie, Chen Chen, Haonan Lu

PDF

Open Access 4 Models

TL;DR

This paper introduces PPCL, a structured pruning framework for Diffusion Transformers that reduces parameters by 50% with minimal performance loss, enabling efficient deployment in resource-limited settings.

Contribution

The paper proposes a novel pluggable pruning method with a contiguous layer distillation scheme tailored for Diffusion Transformers, allowing flexible compression without retraining for each ratio.

Findings

01

Achieves 50% parameter reduction with less than 3% performance degradation.

02

Maintains high-quality image generation after pruning.

03

Enables resource-efficient deployment of Diffusion Transformers.

Abstract

Diffusion Transformers (DiTs) have shown exceptional performance in image generation, yet their large parameter counts incur high computational costs, impeding deployment in resource-constrained settings. To address this, we propose Pluggable Pruning with Contiguous Layer Distillation (PPCL), a flexible structured pruning framework specifically designed for DiT architectures. First, we identify redundant layer intervals through a linear probing mechanism combined with the first-order differential trend analysis of similarity metrics. Subsequently, we propose a plug-and-play teacher-student alternating distillation scheme tailored to integrate depth-wise and width-wise pruning within a single training phase. This distillation framework enables flexible knowledge transfer across diverse pruning ratios, eliminating the need for per-configuration retraining. Extensive experiments on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Enhancement Techniques · Advanced Memory and Neural Computing · Generative Adversarial Networks and Image Synthesis