TL;DR
MaCP is a novel adaptation method that leverages hierarchical cosine projection to efficiently fine-tune large models with minimal parameters, achieving high accuracy across diverse tasks.
Contribution
It introduces a cosine projection-based adaptation technique that reduces parameters and memory while maintaining or improving performance on various tasks.
Findings
Outperforms existing methods in accuracy and efficiency.
Reduces computational complexity and memory usage.
Effective across single- and multi-modality tasks.
Abstract
We present a new adaptation method MaCP, Minimal yet Mighty adaptive Cosine Projection, that achieves exceptional performance while requiring minimal parameters and memory for fine-tuning large foundation models. Its general idea is to exploit the superior energy compaction and decorrelation properties of cosine projection to improve both model efficiency and accuracy. Specifically, it projects the weight change from the low-rank adaptation into the discrete cosine space. Then, the weight change is partitioned over different levels of the discrete cosine spectrum, and each partition's most critical frequency components are selected. Extensive experiments demonstrate the effectiveness of MaCP across a wide range of single-modality tasks, including natural language understanding, natural language generation, text summarization, as well as multi-modality tasks such as image classification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
