From Correlation to Causation: Max-Pooling-Based Multi-Instance Learning Leads to More Robust Whole Slide Image Classification
Xin Liu, Weijia Zhang, Wei Tang, Thuc Duy Le, Jiuyong Li, Lin Liu, Min-Ling Zhang

TL;DR
This paper introduces FocusMIL, a novel MIL method that combines max-pooling with a variational information bottleneck to focus on causal features, improving robustness and localization in whole slide image classification.
Contribution
It provides a causal perspective on max-pooling in MIL, proposes FocusMIL with VIB and multi-bag training, and demonstrates improved out-of-distribution performance.
Findings
FocusMIL outperforms existing methods in OOD scenarios.
It enhances tumor region localization accuracy.
Theoretical analysis links max-pooling to causal feature focus.
Abstract
In whole slide images (WSIs) analysis, attention-based multi-instance learning (MIL) models are susceptible to spurious correlations and degrade under domain shift. These methods may assign high attention weights to non-tumor regions, such as staining biases or artifacts, leading to unreliable tumor region localization. In this paper, we revisit max-pooling-based MIL methods from a causal perspective. Under mild assumptions, our theoretical results demonstrate that max-pooling encourages the model to focus on causal factors while ignoring bias-related factors. Furthermore, we discover that existing max-pooling-based methods may overfit the training set through rote memorization of instance features and fail to learn meaningful patterns. To address these issues, we propose FocusMIL, which couples max-pooling with an instance-level variational information bottleneck (VIB) to learn…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Image Retrieval and Classification Techniques · Digital Imaging for Blood Diseases
MethodsVariational Inference · Focus
