Objective Evaluation of Deep Uncertainty Predictions for COVID-19 Detection
Hamzeh Asgharnezhad, Afshar Shamsi, Roohallah Alizadehsani, Abbas, Khosravi, Saeid Nahavandi, Zahra Alizadeh Sani, and Dipti Srinivasan

TL;DR
This paper evaluates uncertainty quantification techniques for COVID-19 detection in chest X-ray images, introducing new metrics and a confusion matrix to assess the reliability of neural network predictions in medical diagnosis.
Contribution
It proposes the uncertainty confusion matrix and new evaluation metrics, and compares the effectiveness of different uncertainty quantification methods for COVID-19 detection.
Findings
Networks trained on CXR images outperform those pretrained on ImageNet.
Uncertainty estimates are higher for incorrect predictions, enabling risk flagging.
Ensemble methods more reliably capture predictive uncertainties.
Abstract
Deep neural networks (DNNs) have been widely applied for detecting COVID-19 in medical images. Existing studies mainly apply transfer learning and other data representation strategies to generate accurate point estimates. The generalization power of these networks is always questionable due to being developed using small datasets and failing to report their predictive confidence. Quantifying uncertainties associated with DNN predictions is a prerequisite for their trusted deployment in medical settings. Here we apply and evaluate three uncertainty quantification techniques for COVID-19 detection using chest X-Ray (CXR) images. The novel concept of uncertainty confusion matrix is proposed and new performance metrics for the objective evaluation of uncertainty estimates are introduced. Through comprehensive experiments, it is shown that networks pertained on CXR images outperform networks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Machine Learning in Healthcare · Anomaly Detection Techniques and Applications
