Robustness Stress Testing in Medical Image Classification
Mobarakol Islam, Zeju Li, Ben Glocker

TL;DR
This paper introduces progressive stress testing to evaluate the robustness and fairness of medical image classification models, revealing variability in model performance and the influence of pretraining on robustness.
Contribution
It proposes a systematic stress testing framework for assessing robustness and subgroup disparities in medical image models, emphasizing its importance for clinical validation.
Findings
Some models are more robust and equitable than others.
Pretraining significantly affects model robustness.
Stress testing reveals vulnerabilities not seen in standard evaluation.
Abstract
Deep neural networks have shown impressive performance for image-based disease detection. Performance is commonly evaluated through clinical validation on independent test sets to demonstrate clinically acceptable accuracy. Reporting good performance metrics on test sets, however, is not always a sufficient indication of the generalizability and robustness of an algorithm. In particular, when the test data is drawn from the same distribution as the training data, the iid test set performance can be an unreliable estimate of the accuracy on new data. In this paper, we employ stress testing to assess model robustness and subgroup performance disparities in disease detection models. We design progressive stress testing using five different bidirectional and unidirectional image perturbations with six different severity levels. As a use case, we apply stress tests to measure the robustness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · COVID-19 diagnosis using AI · Artificial Intelligence in Healthcare and Education
