Can language-guided unsupervised adaptation improve medical image   classification using unpaired images and texts?

Umaima Rahman; Raza Imam; Mohammad Yaqub; Boulbaba Ben Amor,; Dwarikanath Mahapatra

arXiv:2409.02729·cs.CV·April 1, 2025

Can language-guided unsupervised adaptation improve medical image classification using unpaired images and texts?

Umaima Rahman, Raza Imam, Mohammad Yaqub, Boulbaba Ben Amor,, Dwarikanath Mahapatra

PDF

Open Access

TL;DR

This paper introduces MedUnA, a novel unsupervised adaptation method for vision-language models that leverages unpaired images and texts to improve medical image classification, especially when labeled data is scarce.

Contribution

MedUnA is the first approach to enable unsupervised learning of medical image classifiers using unpaired images and texts with vision-language models.

Findings

01

Significant accuracy improvements over zero-shot baselines.

02

Effective in scenarios with limited labeled medical images.

03

Applicable across multiple medical imaging datasets.

Abstract

In medical image classification, supervised learning is challenging due to the scarcity of labeled medical images. To address this, we leverage the visual-textual alignment within Vision-Language Models (VLMs) to enable unsupervised learning of a medical image classifier. In this work, we propose \underline{Med}ical \underline{Un}supervised \underline{A}daptation (\texttt{MedUnA}) of VLMs, where the LLM-generated descriptions for each class are encoded into text embeddings and matched with class labels via a cross-modal adapter. This adapter attaches to a visual encoder of \texttt{MedCLIP} and aligns the visual embeddings through unsupervised learning, driven by a contrastive entropy-based loss and prompt tuning. Thereby, improving performance in scenarios where textual information is more abundant than labeled images, particularly in the healthcare domain. Unlike traditional VLMs,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 diagnosis using AI · AI in cancer detection

MethodsAdapter · ALIGN