MedCLIP: Contrastive Learning from Unpaired Medical Images and Text

Zifeng Wang; Zhenbang Wu; Dinesh Agarwal; Jimeng Sun

arXiv:2210.10163·cs.CV·October 20, 2022·29 cites

MedCLIP: Contrastive Learning from Unpaired Medical Images and Text

Zifeng Wang, Zhenbang Wu, Dinesh Agarwal, Jimeng Sun

PDF

Open Access 1 Repo 1 Models 1 Datasets

TL;DR

MedCLIP introduces a novel contrastive learning framework for medical images and texts that scales data efficiently and reduces false negatives, significantly improving zero-shot and supervised medical image classification and retrieval.

Contribution

It decouples images and texts for scalable training and replaces the InfoNCE loss with a semantic matching loss based on medical knowledge, addressing false negatives.

Findings

01

Outperforms state-of-the-art methods on multiple tasks

02

Achieves superior results with only 20K pre-training data

03

Effectively reduces false negatives in contrastive learning

Abstract

Existing vision-text contrastive learning like CLIP aims to match the paired image and caption embeddings while pushing others apart, which improves representation transferability and supports zero-shot prediction. However, medical image-text datasets are orders of magnitude below the general images and captions from the internet. Moreover, previous methods encounter many false negatives, i.e., images and reports from separate patients probably carry the same semantics but are wrongly treated as negatives. In this paper, we decouple images and texts for multimodal contrastive learning thus scaling the usable training data in a combinatorial magnitude with low cost. We also propose to replace the InfoNCE loss with semantic matching loss based on medical knowledge to eliminate false negatives in contrastive learning. We prove that MedCLIP is a simple yet effective framework: it…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ryanwangzf/medclip
pytorchOfficial

Models

🤗
razaimam45/RobustMedCLIP
model

Datasets

razaimam45/MediMeta-C
dataset· 575 dl
575 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · COVID-19 diagnosis using AI · Domain Adaptation and Few-Shot Learning

MethodsContrastive Learning · Contrastive Language-Image Pre-training · InfoNCE