Towards Robust Foundation Models for Digital Pathology
Jonah K\"omen, Edwin D. de Jong, Julius Hense, Hannah Marienwald, Jonas Dippel, Philip Naumann, Eric Marcus, Lukas Ruff, Maximilian Alber, Jonas Teuwen, Frederick Klauschen, Klaus-Robert M\"uller

TL;DR
This paper systematically investigates the robustness of biomedical foundation models in digital pathology, revealing significant vulnerabilities to non-biological features and proposing a framework for their robustification to ensure safe clinical deployment.
Contribution
It introduces PathoROB, a new robustness benchmark with metrics and datasets, and demonstrates the importance of robustness evaluation and improvement for clinical safety.
Findings
All evaluated FMs show robustness deficits.
Robustness differences significantly impact diagnostic accuracy.
Post-hoc robustification reduces, but does not eliminate, errors.
Abstract
Biomedical Foundation Models (FMs) are rapidly transforming AI-enabled healthcare research and entering clinical validation. However, their susceptibility to learning non-biological technical features -- including variations in surgical/endoscopic techniques, laboratory procedures, and scanner hardware -- poses risks for clinical deployment. We present the first systematic investigation of pathology FM robustness to non-biological features. Our work (i) introduces measures to quantify FM robustness, (ii) demonstrates the consequences of limited robustness, and (iii) proposes a framework for FM robustification to mitigate these issues. Specifically, we developed PathoROB, a robustness benchmark with three novel metrics, including the robustness index, and four datasets covering 28 biological classes from 34 medical centers. Our experiments reveal robustness deficits across all 20…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection
