Robustness of an Artificial Intelligence Solution for Diagnosis of Normal Chest X-Rays
Tom Dyer, Jordan Smith, Gaetan Dissez, Nicole Tay, Qaiser Malik, Tom, Naunton Morgan, Paul Williams, Liliana Garcia-Mondragon, George Pearse, and, Simon Rasalingham

TL;DR
This study evaluates the robustness of an AI system for diagnosing normal chest X-rays, demonstrating high negative predictive value and consistent performance across diverse patient groups and settings, with potential workload reduction benefits.
Contribution
The paper provides a comprehensive assessment of an AI diagnostic tool's robustness across multiple subgroups and compares its errors with human radiologists, highlighting its reliability and efficiency.
Findings
AI classified 18.5% of scans as high confidence normal
AI achieved a negative predictive value of 96.0%
AI errors matched radiologist errors in false negatives
Abstract
Purpose: Artificial intelligence (AI) solutions for medical diagnosis require thorough evaluation to demonstrate that performance is maintained for all patient sub-groups and to ensure that proposed improvements in care will be delivered equitably. This study evaluates the robustness of an AI solution for the diagnosis of normal chest X-rays (CXRs) by comparing performance across multiple patient and environmental subgroups, as well as comparing AI errors with those made by human experts. Methods: A total of 4,060 CXRs were sampled to represent a diverse dataset of NHS patients and care settings. Ground-truth labels were assigned by a 3-radiologist panel. AI performance was evaluated against assigned labels and sub-groups analysis was conducted against patient age and sex, as well as CXR view, modality, device manufacturer and hospital site. Results: The AI solution was able to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Artificial Intelligence in Healthcare and Education · COVID-19 diagnosis using AI
