Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning?
Cheng Han, Qifan Wang, Yiming Cui, Wenguan Wang, Lifu Huang, Siyuan, Qi, Dongfang Liu

TL;DR
This paper analyzes when and why Visual Prompt Tuning (VPT) outperforms full fine-tuning in vision models, revealing it is preferable under certain task and data conditions and due to its feature-preserving mechanism.
Contribution
It offers a comprehensive analysis across multiple datasets to clarify the conditions favoring VPT and investigates the underlying reasons for its effectiveness.
Findings
VPT is preferable when task objectives differ significantly or data distributions are similar.
VPT's success is linked to its ability to preserve original features and add parameters.
Overfitting and optimization are not the sole reasons for VPT's effectiveness.
Abstract
As the scale of vision models continues to grow, the emergence of Visual Prompt Tuning (VPT) as a parameter-efficient transfer learning technique has gained attention due to its superior performance compared to traditional full-finetuning. However, the conditions favoring VPT (the ``when") and the underlying rationale (the ``why") remain unclear. In this paper, we conduct a comprehensive analysis across 19 distinct datasets and tasks. To understand the ``when" aspect, we identify the scenarios where VPT proves favorable by two dimensions: task objectives and data distributions. We find that VPT is preferrable when there is 1) a substantial disparity between the original and the downstream task objectives (e.g., transitioning from classification to counting), or 2) a similarity in data distributions between the two tasks (e.g., both involve natural images). In exploring the ``why"…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Image Processing Techniques and Applications
