Detecting Domain Shift in Multiple Instance Learning for Digital   Pathology Using Fr\'echet Domain Distance

Milda Pocevi\v{c}i\=ut\.e; Gabriel Eilertsen; Stina Garvin; Claes; Lundstr\"om

arXiv:2405.09934·cs.CV·May 17, 2024

Detecting Domain Shift in Multiple Instance Learning for Digital Pathology Using Fr\'echet Domain Distance

Milda Pocevi\v{c}i\=ut\.e, Gabriel Eilertsen, Stina Garvin, Claes, Lundstr\"om

PDF

TL;DR

This paper investigates the sensitivity of multiple-instance learning (MIL) in digital pathology to real-world domain shifts and introduces Fréchet Domain Distance (FDD), an unsupervised metric to detect such shifts effectively.

Contribution

The study demonstrates MIL's vulnerability to clinical domain shifts, evaluates feature suitability for shift detection, and proposes FDD as a novel unsupervised metric for quantifying domain shifts.

Findings

01

FDD achieved a 0.70 correlation with performance changes.

02

Compared to baselines, FDD showed superior correlation.

03

MIL performance is affected by realistic domain differences.

Abstract

Multiple-instance learning (MIL) is an attractive approach for digital pathology applications as it reduces the costs related to data collection and labelling. However, it is not clear how sensitive MIL is to clinically realistic domain shifts, i.e., differences in data distribution that could negatively affect performance, and if already existing metrics for detecting domain shifts work well with these algorithms. We trained an attention-based MIL algorithm to classify whether a whole-slide image of a lymph node contains breast tumour metastases. The algorithm was evaluated on data from a hospital in a different country and various subsets of this data that correspond to different levels of domain shift. Our contributions include showing that MIL for digital pathology is affected by clinically realistic differences in data, evaluating which features from a MIL model are most suitable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.