Detecting Shortcuts in Medical Images -- A Case Study in Chest X-rays

Amelia Jim\'enez-S\'anchez; Dovile Juodelyte; Bethany Chamberlain; Veronika Cheplygina

arXiv:2211.04279·cs.CV·September 26, 2025

Detecting Shortcuts in Medical Images -- A Case Study in Chest X-rays

Amelia Jim\'enez-S\'anchez, Dovile Juodelyte, Bethany Chamberlain, Veronika Cheplygina

PDF

Open Access 2 Repos 1 Models

TL;DR

This paper highlights the issue of shortcuts and artifacts in medical image datasets, demonstrating their impact on model performance and emphasizing the need for careful data validation and subgroup testing in chest X-ray classification.

Contribution

The study validates previous concerns about shortcuts in medical imaging datasets and provides a detailed case study with annotations and recommendations for improving model robustness.

Findings

01

Models may exploit shortcuts, leading to overestimated performance.

02

Annotated subset of pneumothorax images with drains to improve data quality.

03

Recommendations for better dataset validation and subgroup testing.

Abstract

The availability of large public datasets and the increased amount of computing power have shifted the interest of the medical community to high-performance algorithms. However, little attention is paid to the quality of the data and their annotations. High performance on benchmark datasets may be reported without considering possible shortcuts or artifacts in the data, besides, models are not tested on subpopulation groups. With this work, we aim to raise awareness about shortcuts problems. We validate previous findings, and present a case study on chest X-rays using two publicly available datasets. We share annotations for a subset of pneumothorax images with drains. We conclude with general recommendations for medical image classification.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
catd1860/DrainDetection
model· 2 dl
2 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 diagnosis using AI · Radiomics and Machine Learning in Medical Imaging · AI in cancer detection