COMPOT: Calibration-Optimized Matrix Procrustes Orthogonalization for Transformers Compression
Denis Makhov, Dmitriy Shopkhoev, Magauiya Zhussip, Ammar Ali, Baher Mohammad, Stamatios Lefkimmiatis

TL;DR
COMPOT is a training-free, calibration-based framework for Transformer compression that uses orthogonal dictionaries and a dynamic layer-wise rate allocation to outperform traditional low-rank and sparse methods.
Contribution
It introduces COMPOT, a novel non-iterative, calibration-optimized method employing orthogonal dictionaries and adaptive compression for Transformers.
Findings
Outperforms low-rank and sparse baselines in quality-compression trade-off.
Compatible with post-training quantization for extreme compression.
Effective across diverse architectures and tasks.
Abstract
Post-training compression of Transformer models commonly relies on truncated singular value decomposition (SVD). However, enforcing a single shared subspace can degrade accuracy even at moderate compression. Sparse dictionary learning provides a more flexible union-of-subspaces representation, but existing approaches often suffer from iterative dictionary and coefficient updates. We propose COMPOT (Calibration-Optimized Matrix Procrustes Orthogonalization for Transformers), a training-free compression framework that uses a small calibration dataset to estimate a sparse weight factorization. COMPOT employs orthogonal dictionaries that enable closed-form Procrustes updates for the dictionary and analytical single-step sparse coding for the coefficients, eliminating iterative optimization. To handle heterogeneous layer sensitivity under a global compression budget, COMPOT further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Generative Adversarial Networks and Image Synthesis · Sparse and Compressive Sensing Techniques
