Embedded Visual Prompt Tuning
Wenqiang Zu, Shenghao Xie, Qing Zhao, Guoqi Li, Lei Ma

TL;DR
This paper introduces Embedded Prompt Tuning (EPT), a parameter-efficient method embedding prompts into model channels, improving few-shot medical image classification and mitigating feature space anomalies during pre-training.
Contribution
The paper proposes EPT, a novel prompt tuning approach that embeds prompts into expanded channels, enhancing performance and efficiency in cross-domain medical image classification.
Findings
EPT outperforms state-of-the-art fine-tuning methods.
EPT achieves significant accuracy improvements in few-shot medical tasks.
EPT is computationally efficient, completing training rapidly.
Abstract
Foundation models pre-trained on large-scale data have been widely witnessed to achieve success in various natural imaging downstream tasks. Parameter-efficient fine-tuning (PEFT) methods aim to adapt foundation models to new domains by updating only a small portion of parameters in order to reduce computational overhead. However, the effectiveness of these PEFT methods, especially in cross-domain few-shot scenarios, e.g., medical image analysis, has not been fully explored. In this work, we facilitate the study of the performance of PEFT when adapting foundation models to medical image classification tasks. Furthermore, to alleviate the limitations of prompt introducing ways and approximation capabilities on Transformer architectures of mainstream prompt tuning methods, we propose the Embedded Prompt Tuning (EPT) method by embedding prompt tokens into the expanded channels. We also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Image and Signal Denoising Methods · Medical Imaging Techniques and Applications
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer · Adam · Dense Connections
