GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction
Jiacheng Ruan, Jingsheng Gao, Mingye Xie, Suncheng Xiang, Zefang Yu,, Ting Liu, Yuzhuo Fu

TL;DR
GIST enhances parameter-efficient fine-tuning by explicitly linking task-specific knowledge with trainable tokens and enabling knowledge interaction, leading to improved performance with minimal additional parameters.
Contribution
Introducing GIST, a novel PEFT framework that incorporates a trainable Gist token and bidirectional knowledge interaction to better leverage pre-trained model knowledge.
Findings
Achieved 2.25% performance improvement on VTAB-1K benchmark.
Added only 0.8K parameters to existing PEFT methods.
Demonstrated universality and scalability across tasks.
Abstract
The Parameter-Efficient Fine-Tuning (PEFT) method, which adjusts or introduces fewer trainable parameters to calibrate pre-trained models on downstream tasks, has become a recent research interest. However, existing PEFT methods within the traditional fine-tiuning framework have two main shortcomings: 1) They overlook the explicit association between trainable parameters and downstream task knowledge. 2) They neglect the interaction between the intrinsic task-agnostic knowledge of pre-trained models and the task-specific knowledge in downstream tasks. To address this gap, we propose a novel fine-tuning framework, named GIST, in a plug-and-play manner. Specifically, our framework first introduces a trainable token, called the Gist token, when applying PEFT methods on downstream tasks. This token serves as an aggregator of the task-specific knowledge learned by the PEFT methods and forms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Human Pose and Action Recognition
MethodsAdapter
