Deep Unsupervised Anomaly Detection in Brain Imaging: Large-Scale Benchmarking and Bias Analysis
Alexander Frotscher, Christian F. Baumgartner, Thomas Wolfers

TL;DR
This paper presents a comprehensive large-scale benchmark for deep unsupervised anomaly detection in brain MRI, revealing significant variability in performance, biases related to scanner and demographics, and highlighting areas for future improvement.
Contribution
It provides the first large-scale, multi-center benchmark of unsupervised brain MRI anomaly detection, systematically evaluating robustness, biases, and limitations of current algorithms.
Findings
Reconstruction-based methods outperform feature-based methods in lesion segmentation.
Algorithm performance varies widely, with Dice scores from 0.03 to 0.65.
Systematic biases related to scanner, age, and sex were identified.
Abstract
Deep unsupervised anomaly detection in brain magnetic resonance imaging offers a promising route to identify pathological deviations without requiring lesion-specific annotations. Yet, fragmented evaluations, heterogeneous datasets, and inconsistent metrics have hindered progress toward clinical translation. Here, we present a large-scale, multi-center benchmark of deep unsupervised anomaly detection for brain imaging. The training cohort comprised 2,976 T1 and 2,972 T2-weighted scans from healthy individuals across six scanners, with ages ranging from 6 to 89 years. Validation used 92 scans to tune hyperparameters and estimate unbiased thresholds. Testing encompassed 2,221 T1w and 1,262 T2w scans spanning healthy datasets and diverse clinical cohorts. Across all algorithms, the Dice-based segmentation performance varied between 0.03 and 0.65, indicating substantial variability. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning · Functional Brain Connectivity Studies
