Embedded Visual Prompt Tuning

Wenqiang Zu; Shenghao Xie; Qing Zhao; Guoqi Li; Lei Ma

arXiv:2407.01003·cs.CV·March 24, 2025

Embedded Visual Prompt Tuning

Wenqiang Zu, Shenghao Xie, Qing Zhao, Guoqi Li, Lei Ma

PDF

Open Access 1 Repo

TL;DR

This paper introduces Embedded Prompt Tuning (EPT), a parameter-efficient method embedding prompts into model channels, improving few-shot medical image classification and mitigating feature space anomalies during pre-training.

Contribution

The paper proposes EPT, a novel prompt tuning approach that embeds prompts into expanded channels, enhancing performance and efficiency in cross-domain medical image classification.

Findings

01

EPT outperforms state-of-the-art fine-tuning methods.

02

EPT achieves significant accuracy improvements in few-shot medical tasks.

03

EPT is computationally efficient, completing training rapidly.

Abstract

Foundation models pre-trained on large-scale data have been widely witnessed to achieve success in various natural imaging downstream tasks. Parameter-efficient fine-tuning (PEFT) methods aim to adapt foundation models to new domains by updating only a small portion of parameters in order to reduce computational overhead. However, the effectiveness of these PEFT methods, especially in cross-domain few-shot scenarios, e.g., medical image analysis, has not been fully explored. In this work, we facilitate the study of the performance of PEFT when adapting foundation models to medical image classification tasks. Furthermore, to alleviate the limitations of prompt introducing ways and approximation capabilities on Transformer architectures of mainstream prompt tuning methods, we propose the Embedded Prompt Tuning (EPT) method by embedding prompt tokens into the expanded channels. We also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zuwenqiang/ept
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Image Segmentation Techniques · Image and Signal Denoising Methods · Medical Imaging Techniques and Applications

MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer · Adam · Dense Connections