SAM-MIL: A Spatial Contextual Aware Multiple Instance Learning Approach for Whole Slide Image Classification
Heng Fang, Sheng Huang, Wenhao Tang, Luwen Huangfu, Bo Liu

TL;DR
SAM-MIL introduces a novel spatially aware multiple instance learning framework for whole slide image classification, leveraging the Segment Anything Model to incorporate global spatial context and improve accuracy over existing methods.
Contribution
The paper presents SAM-MIL, a new MIL approach that explicitly integrates spatial context using SAM, addressing limitations of local patch-based features in WSI classification.
Findings
Outperforms existing methods on CAMELYON-16 and TCGA datasets
Effectively incorporates spatial context through SAM-guided features
Enhances training with pseudo-bags and context consistency strategies
Abstract
Multiple Instance Learning (MIL) represents the predominant framework in Whole Slide Image (WSI) classification, covering aspects such as sub-typing, diagnosis, and beyond. Current MIL models predominantly rely on instance-level features derived from pretrained models such as ResNet. These models segment each WSI into independent patches and extract features from these local patches, leading to a significant loss of global spatial context and restricting the model's focus to merely local features. To address this issue, we propose a novel MIL framework, named SAM-MIL, that emphasizes spatial contextual awareness and explicitly incorporates spatial context by extracting comprehensive, image-level information. The Segment Anything Model (SAM) represents a pioneering visual segmentation foundational model that can capture segmentation features without the need for additional fine-tuning,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Machine Learning and Data Classification
MethodsAverage Pooling · Max Pooling · Global Average Pooling · Focus · Convolution · Kaiming Initialization
