Biomed-DPT: Dual Modality Prompt Tuning for Biomedical Vision-Language Models
Wei Peng, Kang Liu, Jianchen Hu, Meng Zhang

TL;DR
Biomed-DPT introduces a dual modality prompt tuning approach that enhances biomedical image classification by incorporating domain knowledge and attention re-weighting, significantly improving accuracy across diverse datasets.
Contribution
The paper presents a novel knowledge-enhanced dual modality prompt tuning method for biomedical vision-language models, integrating clinical prompts and attention-focused vision prompts.
Findings
Achieves 66.14% average accuracy across 11 datasets.
Surpasses existing methods like CoOp by up to 8.04%.
Effective in both base and novel class recognition.
Abstract
Prompt learning is one of the most effective paradigms for adapting pre-trained vision-language models (VLMs) to the biomedical image classification tasks in few shot scenarios. However, most of the current prompt learning methods only used the text prompts and ignored the particular structures (such as the complex anatomical structures and subtle pathological features) in the biomedical images. In this work, we propose Biomed-DPT, a knowledge-enhanced dual modality prompt tuning technique. In designing the text prompt, Biomed-DPT constructs a dual prompt including the template-driven clinical prompts and the large language model (LLM)-driven domain-adapted prompts, then extracts the clinical knowledge from the domain-adapted prompts through the knowledge distillation technique. In designing the vision prompt, Biomed-DPT introduces the zero vector as a soft prompt to leverage attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
MethodsSoftmax · Attention Is All You Need · Balanced Selection · Knowledge Distillation · Focus
