On the limits of cross-domain generalization in automated X-ray prediction
Joseph Paul Cohen, Mohammad Hashir, Rupert Brooks, Hadrien, Bertrand

TL;DR
This study investigates the limits of cross-domain generalization in automated X-ray diagnosis, revealing that label shifts, not image shifts, primarily hinder model transferability across datasets.
Contribution
It provides a large-scale analysis of cross-domain generalization in X-ray prediction, highlighting the role of label shifts and discrepancies between model performance and agreement.
Findings
Label shifts, not image shifts, dominate generalization issues.
Models with similar performance can disagree significantly.
Regularization reveals variation in task similarity across datasets.
Abstract
This large scale study focuses on quantifying what X-rays diagnostic prediction tasks generalize well across multiple different datasets. We present evidence that the issue of generalization is not due to a shift in the images but instead a shift in the labels. We study the cross-domain performance, agreement between models, and model representations. We find interesting discrepancies between performance and agreement where models which both achieve good performance disagree in their predictions as well as models which agree yet achieve poor performance. We also test for concept similarity by regularizing a network to group tasks across multiple datasets together and observe variation across the tasks. All code is made available online and data is publicly available: https://github.com/mlmed/torchxrayvision
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗torchxrayvision/densenet121-res224-allmodel· 16 dl· ♡ 316 dl♡ 3
- 🤗torchxrayvision/densenet121-res224-nihmodel· 8 dl· ♡ 18 dl♡ 1
- 🤗torchxrayvision/densenet121-res224-pcmodel· 6 dl6 dl
- 🤗torchxrayvision/densenet121-res224-chexmodel· 17 dl· ♡ 217 dl♡ 2
- 🤗torchxrayvision/densenet121-res224-rsnamodel· 5 dl5 dl
- 🤗torchxrayvision/densenet121-res224-mimic_nbmodel· 7 dl7 dl
- 🤗torchxrayvision/densenet121-res224-mimic_chmodel· 4 dl4 dl
- 🤗torchxrayvision/resnet50-res512-allmodel· 22 dl22 dl
- 🤗lucazhou2000/my-densenet121-chexmodel· 5 dl5 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Radiology practices and education · AI in cancer detection
MethodsTest
