Unified Medical Image-Text-Label Contrastive Learning With Continuous Prompt
Yuhao Wang

TL;DR
This paper introduces a unified contrastive learning framework for medical image-text-label data using continuous prompts, improving data utilization and reducing false negatives to enhance downstream task performance.
Contribution
The paper proposes a novel unified framework that combines images, text, and labels with continuous prompts to improve medical image-text pre-training.
Findings
Enhanced performance on multiple downstream tasks
Effective reduction of false-negative sample issues
Expanded training data through data unification
Abstract
Contrastive language-image Pre-training (CLIP) [13] can leverage large datasets of unlabeled Image-Text pairs, which have demonstrated impressive performance in various downstream tasks. Given that annotating medical data is time-consuming and laborious, Image-Text Pre-training has promising applications in exploiting large-scale medical image and radiology report datasets. However, medical Image-Text Pre-training faces several challenges, as follows: (1) Due to privacy concerns, the amount of available medical data is relatively small compared to natural data, leading to weaker generalization ability of the model. (2) Medical images are highly similar with only fine-grained differences in subtleties, resulting in a large number of false-negative sample pairs in comparison learning. (3) The hand-crafted Prompt usually differs from the natural medical image report, Subtle changes in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · COVID-19 diagnosis using AI · Multimodal Machine Learning Applications
MethodsContrastive Learning
