Compress Then Adapt? No, Do It Together via Task-aware Union of Subspaces
Jingze Ge, Yun Liu, Xue Geng, Wanqi Dong, Wang Zhe Mark, Min Wu, Xulei Yang

TL;DR
JACTUS is a unified framework that combines compression and adaptation for large pretrained models, improving efficiency and performance across vision and language tasks by jointly optimizing subspaces.
Contribution
It introduces a task-aware union of subspaces approach that couples compression and adaptation, outperforming sequential methods in parameter efficiency and accuracy.
Findings
Achieves 89.2% accuracy on ViT-Base across eight datasets at 80% parameters.
Attains 80.9% average on Llama2-7B commonsense QA at 80% parameters.
Outperforms strong PEFT baselines and prior compress-then-finetune pipelines.
Abstract
Adapting large pretrained models to diverse tasks is now routine, yet the two dominant strategies of parameter-efficient fine-tuning (PEFT) and low-rank compression are typically composed in sequence. This decoupled practice first compresses and then fine-tunes adapters, potentially misaligning the compressed subspace with downstream objectives and squandering a global parameter budget. To overcome this limitation, we introduce JACTUS (Joint Adaptation and Compression with a Task-aware Union of Subspaces), a single framework that unifies compression and adaptation. From a small calibration set, JACTUS estimates input and pre-activation gradient covariances, forms their orthogonal union with the pretrained weight subspace, performs a projected low-rank approximation inside this union, allocates rank globally by marginal gain per parameter, and trains only a compact core matrix. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
