Deep Learning Applied to Chest X-Rays: Exploiting and Preventing Shortcuts
Sarah Jabbour, David Fouhey, Ella Kazerooni, Michael W. Sjoding, Jenna, Wiens

TL;DR
This paper investigates how deep learning models for chest X-ray diagnosis can exploit dataset biases, and demonstrates that transfer learning can mitigate shortcut reliance, improving generalization across different patient populations.
Contribution
It identifies the tendency of deep networks to rely on spurious correlations in chest X-ray datasets and proposes a transfer learning method to reduce this shortcut behavior.
Findings
Deep nets can accurately identify patient attributes like sex and age.
Models tend to exploit attribute-outcome correlations, reducing test performance when these correlations change.
Transfer learning significantly improves model generalization on skewed chest X-ray datasets.
Abstract
While deep learning has shown promise in improving the automated diagnosis of disease based on chest X-rays, deep networks may exhibit undesirable behavior related to shortcuts. This paper studies the case of spurious class skew in which patients with a particular attribute are spuriously more likely to have the outcome of interest. For instance, clinical protocols might lead to a dataset in which patients with pacemakers are disproportionately likely to have congestive heart failure. This skew can lead to models that take shortcuts by heavily relying on the biased attribute. We explore this problem across a number of attributes in the context of diagnosing the cause of acute hypoxemic respiratory failure. Applied to chest X-rays, we show that i) deep nets can accurately identify many patient attributes including sex (AUROC = 0.96) and age (AUROC >= 0.90), ii) they tend to exploit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Machine Learning in Healthcare · Phonocardiography and Auscultation Techniques
