CLEFT: Language-Image Contrastive Learning with Efficient Large Language   Model and Prompt Fine-Tuning

Yuexi Du; Brian Chang; Nicha C. Dvornek

arXiv:2407.21011·cs.CV·July 31, 2024

CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning

Yuexi Du, Brian Chang, Nicha C. Dvornek

PDF

Open Access 1 Repo

TL;DR

CLEFT introduces an efficient contrastive learning approach combining large language models and prompt fine-tuning, achieving state-of-the-art results in medical imaging with reduced computational resources.

Contribution

It presents a novel, resource-efficient contrastive learning framework that leverages large pre-trained language models and context-based prompt learning for medical image analysis.

Findings

01

State-of-the-art performance on chest X-ray and mammography datasets

02

Reduces trainable model size by 39% and language model parameters to 4%

03

Effective prompt learning strategy bridges clinical data and labels

Abstract

Recent advancements in Contrastive Language-Image Pre-training (CLIP) have demonstrated notable success in self-supervised representation learning across various tasks. However, the existing CLIP-like approaches often demand extensive GPU resources and prolonged training times due to the considerable size of the model and dataset, making them poor for medical applications, in which large datasets are not always common. Meanwhile, the language model prompts are mainly manually derived from labels tied to images, potentially overlooking the richness of information within training samples. We introduce a novel language-image Contrastive Learning method with an Efficient large language model and prompt Fine-Tuning (CLEFT) that harnesses the strengths of the extensive pre-trained language and visual models. Furthermore, we present an efficient strategy for learning context-based prompts that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xypb/cleft
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRadiomics and Machine Learning in Medical Imaging · COVID-19 diagnosis using AI · Interpreting and Communication in Healthcare

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Dropout · WordPiece · Weight Decay · Attention Dropout · Residual Connection · Adam · Linear Layer · Layer Normalization