CORP: Closed-Form One-shot Representation-Preserving Structured Pruning for Transformers
Boxiang Zhang, Baijian Yang

TL;DR
CORP is a novel one-shot structured pruning method for transformers that uses closed-form solutions to remove components without retraining, maintaining high accuracy with significant sparsity.
Contribution
It introduces a representation recovery-based pruning approach that operates in a single step without gradients or fine-tuning, unlike prior multi-stage methods.
Findings
CORP effectively prunes 50% of MLP and attention structures with minimal accuracy loss.
Experiments on ImageNet with DeiT show strong redundancy in transformer representations.
On DeiT-Huge, CORP achieves 83.27% Top-1 accuracy after 50% pruning.
Abstract
Transformers achieve strong accuracy but incur high compute and memory cost. Structured pruning reduces inference cost, but most methods rely on retraining or multi-stage optimization, which limits post-training deployment. We propose CORP, a closed-form one-shot structured pruning method that removes MLP dimensions and attention substructures using only unlabeled calibration data without gradients or fine-tuning. CORP formulates structured pruning as a representation recovery problem. It models removed components as affine functions of retained components and derives closed-form ridge regression solutions that fold compensation into model weights. This minimizes a layer-local affine/logit reconstruction objective under the calibration distribution. Experiments on ImageNet with DeiT reveal strong redundancy in both MLP and attention representations. With CORP, models retain high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
