Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey
Zeyu Han, Chao Gao, Jinyang Liu, Jeff Zhang, Sai Qian Zhang

TL;DR
This paper provides a comprehensive survey of Parameter-Efficient Fine-Tuning (PEFT) methods for large models, analyzing their algorithms, performance, computational costs, and system implementations to guide future research and practical applications.
Contribution
It offers an extensive overview of PEFT algorithms, their performance, and system design considerations, filling a gap in understanding how to efficiently adapt large models.
Findings
PEFT algorithms significantly reduce computational costs.
Different PEFT techniques vary in performance and resource efficiency.
System design impacts the practical deployment of PEFT methods.
Abstract
Large models represent a groundbreaking advancement in multiple application fields, enabling remarkable achievements across various tasks. However, their unprecedented scale comes with significant computational costs. These models, often consisting of billions of parameters, require vast amounts of computational resources for execution. Especially, the expansive scale and computational demands pose considerable challenges when customizing them for particular downstream tasks, particularly over the hardware platforms constrained by computational capabilities. Parameter Efficient Fine-Tuning (PEFT) provides a practical solution by efficiently adjusting the large models over the various downstream tasks. In particular, PEFT refers to the process of adjusting the parameters of a pre-trained large model to adapt it to a specific task or domain while minimizing the number of additional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Model Reduction and Neural Networks · Neural Networks and Applications
