TL;DR
CRISP is a unified framework that efficiently combines parameter-efficient fine-tuning and model compression by factorizing pretrained weights into shared basis matrices and small mixing projections.
Contribution
It introduces a novel method that integrates multiple parameter recombination tasks within a single framework, outperforming prior dual-task methods.
Findings
CRISP outperforms prior dual-task methods by 4-5%.
CRISP surpasses state-of-the-art in PEFT by 1.5%.
CRISP improves PEFT+MC performance by 1%.
Abstract
Parameter Recombination (PR) methods aim to efficiently compose the weights of a neural network for applications like Parameter-Efficient FineTuning (PEFT) and Model Compression (MC), among others. Most methods typically focus on one application of PR, which can make composing them challenging. For example, when deploying a large model you may wish to compress the model and also quickly adapt to new settings. However, PEFT methods often can still contain millions of parameters. This may be small compared to the original model size, but can be problematic in resource constrained deployments like edge devices, where they take a larger portion of the compressed model's parameters. To address this, we present Coefficient-gated weight Recombination by Interpolated Shared basis Projections (CRISP), a general approach that seamlessly integrates multiple PR tasks within the same framework.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
