PELA: Learning Parameter-Efficient Models with Low-Rank Approximation

Yangyang Guo; Guangzhi Wang; Mohan Kankanhalli

arXiv:2310.10700·cs.CV·November 21, 2023·1 cites

PELA: Learning Parameter-Efficient Models with Low-Rank Approximation

Yangyang Guo, Guangzhi Wang, Mohan Kankanhalli

PDF

Open Access 1 Repo

TL;DR

This paper introduces PELA, a parameter-efficient method that compresses large pre-trained models using low-rank approximation and specialized modules, enabling effective downstream fine-tuning with reduced resources.

Contribution

The paper proposes a novel low-rank approximation approach with feature distillation and regularization modules, updating only the compressed model for efficiency.

Findings

01

Reduces parameter size by 1/3 to 2/3 with minimal performance loss

02

Maintains comparable results across multiple vision and vision-language models

03

Achieves efficiency in parameters and computation time

Abstract

Applying a pre-trained large model to downstream tasks is prohibitive under resource-constrained conditions. Recent dominant approaches for addressing efficiency issues involve adding a few learnable parameters to the fixed backbone model. This strategy, however, leads to more challenges in loading large models for downstream fine-tuning with limited resources. In this paper, we propose a novel method for increasing the parameter efficiency of pre-trained models by introducing an intermediate pre-training stage. To this end, we first employ low-rank approximation to compress the original large model and then devise a feature distillation module and a weight perturbation regularization module. These modules are specifically designed to enhance the low-rank model. In particular, we update only the low-rank model while freezing the backbone parameters during pre-training. This allows for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

guoyang9/pela
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Linear Layer · Softmax · Residual Connection · Absolute Position Encodings · Layer Normalization · Adam · Byte Pair Encoding