Evaluating Reliability in Medical DNNs: A Critical Analysis of Feature and Confidence-Based OOD Detection
Harry Anthony, Konstantinos Kamnitsas

TL;DR
This paper critically evaluates confidence-based and feature-based out-of-distribution detection methods in medical DNNs, revealing artefact influence on confidence scores and advocating for combined approaches to improve reliability.
Contribution
It introduces new OOD benchmarks for medical imaging, analyzes artefact effects on confidence scores, and compares the effectiveness of different OOD detection methods.
Findings
Artefacts can increase model confidence, contradicting assumptions.
Feature-based methods outperform confidence-based methods in OOD detection.
Combining methods can mitigate individual weaknesses.
Abstract
Reliable use of deep neural networks (DNNs) for medical image analysis requires methods to identify inputs that differ significantly from the training data, called out-of-distribution (OOD), to prevent erroneous predictions. OOD detection methods can be categorised as either confidence-based (using the model's output layer for OOD detection) or feature-based (not using the output layer). We created two new OOD benchmarks by dividing the D7P (dermatology) and BreastMNIST (ultrasound) datasets into subsets which either contain or don't contain an artefact (rulers or annotations respectively). Models were trained with artefact-free images, and images with the artefacts were used as OOD test sets. For each OOD image, we created a counterfactual by manually removing the artefact via image processing, to assess the artefact's impact on the model's predictions. We show that OOD artefacts can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuality and Safety in Healthcare · Anomaly Detection Techniques and Applications · Artificial Intelligence in Healthcare
MethodsSoftmax
