Dynamic Visual Prompt Tuning for Parameter Efficient Transfer Learning

Chunqing Ruan; Hongjian Wang

arXiv:2309.06123·cs.CV·September 13, 2023

Dynamic Visual Prompt Tuning for Parameter Efficient Transfer Learning

Chunqing Ruan, Hongjian Wang

PDF

Open Access

TL;DR

This paper introduces DVPT, a dynamic visual prompt tuning method that generates instance-specific prompts for each image, improving transfer learning efficiency and performance on visual tasks.

Contribution

The paper proposes a novel dynamic prompt generation framework using a Meta-Net, enabling instance-wise adaptation in parameter-efficient transfer learning.

Findings

01

DVPT outperforms existing PETL methods on various tasks.

02

DVPT surpasses full fine-tuning on 17 out of 19 tasks.

03

The approach maintains high parameter efficiency.

Abstract

Parameter efficient transfer learning (PETL) is an emerging research spot that aims to adapt large-scale pre-trained models to downstream tasks. Recent advances have achieved great success in saving storage and computation costs. However, these methods do not take into account instance-specific visual clues for visual tasks. In this paper, we propose a Dynamic Visual Prompt Tuning framework (DVPT), which can generate a dynamic instance-wise token for each image. In this way, it can capture the unique visual feature of each image, which can be more suitable for downstream visual tasks. We designed a Meta-Net module that can generate learnable prompts based on each image, thereby capturing dynamic instance-wise visual features. Extensive experiments on a wide range of downstream recognition tasks show that DVPT achieves superior performance than other PETL methods. More importantly, DVPT…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Machine Learning and ELM