When are radiology reports useful for training medical image classifiers?

Herman Bergstr\"om; Zhongqi Yue; Fredrik D. Johansson

arXiv:2510.24385·cs.CV·October 30, 2025

When are radiology reports useful for training medical image classifiers?

Herman Bergstr\"om, Zhongqi Yue, Fredrik D. Johansson

PDF

TL;DR

This paper systematically studies how radiology reports can be used during pre-training and fine-tuning to improve medical image classification, revealing when report-based information is most beneficial.

Contribution

It provides a comprehensive analysis of report utilization during training, highlighting conditions where report-based pre-training and fine-tuning improve classification performance.

Findings

01

Report-based pre-training helps when labels are well-represented in text.

02

Explicit image-text alignment can be harmful if labels are weakly associated.

03

Fine-tuning with reports can outperform pre-training in certain scenarios.

Abstract

Medical images used to train machine learning models are often accompanied by radiology reports containing rich expert annotations. However, relying on these reports as inputs for clinical prediction requires the timely manual work of a trained radiologist. This raises a natural question: when can radiology reports be leveraged during training to improve image-only classification? Prior works are limited to evaluating pre-trained image representations by fine-tuning them to predict diagnostic labels, often extracted from reports, ignoring tasks with labels that are weakly associated with the text. To address this gap, we conduct a systematic study of how radiology reports can be used during both pre-training and fine-tuning, across diagnostic and prognostic tasks (e.g., 12-month readmission), and under varying training set sizes. Our findings reveal that: (1) Leveraging reports during…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.