Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey

Zeyu Han; Chao Gao; Jinyang Liu; Jeff Zhang; Sai Qian Zhang

arXiv:2403.14608·cs.LG·September 17, 2024·91 cites

Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey

Zeyu Han, Chao Gao, Jinyang Liu, Jeff Zhang, Sai Qian Zhang

PDF

Open Access 1 Models 1 Datasets

TL;DR

This paper provides a comprehensive survey of Parameter-Efficient Fine-Tuning (PEFT) methods for large models, analyzing their algorithms, performance, computational costs, and system implementations to guide future research and practical applications.

Contribution

It offers an extensive overview of PEFT algorithms, their performance, and system design considerations, filling a gap in understanding how to efficiently adapt large models.

Findings

01

PEFT algorithms significantly reduce computational costs.

02

Different PEFT techniques vary in performance and resource efficiency.

03

System design impacts the practical deployment of PEFT methods.

Abstract

Large models represent a groundbreaking advancement in multiple application fields, enabling remarkable achievements across various tasks. However, their unprecedented scale comes with significant computational costs. These models, often consisting of billions of parameters, require vast amounts of computational resources for execution. Especially, the expansive scale and computational demands pose considerable challenges when customizing them for particular downstream tasks, particularly over the hardware platforms constrained by computational capabilities. Parameter Efficient Fine-Tuning (PEFT) provides a practical solution by efficiently adjusting the large models over the various downstream tasks. In particular, PEFT refers to the process of adjusting the parameters of a pre-trained large model to adapt it to a specific task or domain while minimizing the number of additional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
ethicalabs/xLSTM-7b-Instruct
model· 108 dl· ♡ 3
108 dl♡ 3

Datasets

BAAI/SurveyScope
dataset· 6 dl
6 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Model Reduction and Neural Networks · Neural Networks and Applications