Knowledge-grounded Adaptation Strategy for Vision-language Models: Building Unique Case-set for Screening Mammograms for Residents Training
Aisha Urooj Khan, John Garrett, Tyler Bradshaw, Lonie Salkowski,, Jiwoong Jason Jeong, Amara Tariq, Imon Banerjee

TL;DR
This paper introduces a domain adaptation framework for vision-language models in medical imaging, specifically for screening mammograms, using selective sampling and hard-negative mining to improve retrieval performance in various training scenarios.
Contribution
The study presents a novel adaptation strategy for VLMs to better handle medical domain data, addressing domain shift and data scarcity issues.
Findings
Enhanced Recall@K in image-text retrieval tasks.
Effective adaptation of both in-domain and out-of-domain VLMs.
Improved performance across zero-shot, few-shot, and supervised settings.
Abstract
A visual-language model (VLM) pre-trained on natural images and text pairs poses a significant barrier when applied to medical contexts due to domain shift. Yet, adapting or fine-tuning these VLMs for medical use presents considerable hurdles, including domain misalignment, limited access to extensive datasets, and high-class imbalances. Hence, there is a pressing need for strategies to effectively adapt these VLMs to the medical domain, as such adaptations would prove immensely valuable in healthcare applications. In this study, we propose a framework designed to adeptly tailor VLMs to the medical domain, employing selective sampling and hard-negative mining techniques for enhanced performance in retrieval tasks. We validate the efficacy of our proposed approach by implementing it across two distinct VLMs: the in-domain VLM (MedCLIP) and out-of-domain VLMs (ALBEF). We assess the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies
