Detecting Domain Shift in Multiple Instance Learning for Digital Pathology Using Fr\'echet Domain Distance
Milda Pocevi\v{c}i\=ut\.e, Gabriel Eilertsen, Stina Garvin, Claes, Lundstr\"om

TL;DR
This paper investigates the sensitivity of multiple-instance learning (MIL) in digital pathology to real-world domain shifts and introduces Fréchet Domain Distance (FDD), an unsupervised metric to detect such shifts effectively.
Contribution
The study demonstrates MIL's vulnerability to clinical domain shifts, evaluates feature suitability for shift detection, and proposes FDD as a novel unsupervised metric for quantifying domain shifts.
Findings
FDD achieved a 0.70 correlation with performance changes.
Compared to baselines, FDD showed superior correlation.
MIL performance is affected by realistic domain differences.
Abstract
Multiple-instance learning (MIL) is an attractive approach for digital pathology applications as it reduces the costs related to data collection and labelling. However, it is not clear how sensitive MIL is to clinically realistic domain shifts, i.e., differences in data distribution that could negatively affect performance, and if already existing metrics for detecting domain shifts work well with these algorithms. We trained an attention-based MIL algorithm to classify whether a whole-slide image of a lymph node contains breast tumour metastases. The algorithm was evaluated on data from a hospital in a different country and various subsets of this data that correspond to different levels of domain shift. Our contributions include showing that MIL for digital pathology is affected by clinically realistic differences in data, evaluating which features from a MIL model are most suitable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
