Mammo-CLIP: A Vision Language Foundation Model to Enhance Data   Efficiency and Robustness in Mammography

Shantanu Ghosh; Clare B. Poynton; Shyam Visweswaran; Kayhan; Batmanghelich

arXiv:2405.12255·eess.IV·May 24, 2024

Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in Mammography

Shantanu Ghosh, Clare B. Poynton, Shyam Visweswaran, Kayhan, Batmanghelich

PDF

Open Access 1 Repo 1 Models

TL;DR

Mammo-CLIP is a vision-language model trained on mammogram-report pairs, improving data efficiency and robustness in breast cancer detection tasks, with an added interpretability method for report-based spatial explanations.

Contribution

This work introduces Mammo-CLIP, the first VLM trained on mammogram-report data, enhancing CAD performance and interpretability in mammography analysis.

Findings

01

Strong classification and localization performance on public datasets

02

Improved robustness and data efficiency comparable to CLIP in CV

03

Novel Mammo-FActOR for spatial report interpretation

Abstract

The lack of large and diverse training data on Computer-Aided Diagnosis (CAD) in breast cancer detection has been one of the concerns that impedes the adoption of the system. Recently, pre-training with large-scale image text datasets via Vision-Language models (VLM) (\eg CLIP) partially addresses the issue of robustness and data efficiency in computer vision (CV). This paper proposes Mammo-CLIP, the first VLM pre-trained on a substantial amount of screening mammogram-report pairs, addressing the challenges of dataset diversity and size. Our experiments on two public datasets demonstrate strong performance in classifying and localizing various mammographic attributes crucial for breast cancer detection, showcasing data efficiency and robustness similar to CLIP in CV. We also propose Mammo-FActOR, a novel feature attribution method, to provide spatial interpretation of representation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

batmanlab/mammo-clip
pytorchOfficial

Models

🤗
shawn24/Mammo-CLIP
model· 21 dl· ♡ 3
21 dl♡ 3

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · Biomedical Text Mining and Ontologies

MethodsContrastive Language-Image Pre-training