CAPRMIL: Context-Aware Patch Representations for Multiple Instance Learning
Andreas Lolos, Theofilos Christodoulou, Aris L. Moustakas, Stergios Christodoulidis, Maria Vakalopoulou

TL;DR
CAPRMIL introduces a simple, efficient framework for multiple instance learning in pathology that produces context-aware patch embeddings, achieving state-of-the-art performance with fewer parameters and lower computational costs.
Contribution
It proposes a novel, aggregator-agnostic approach that injects global context into patch embeddings using self-attention, simplifying MIL models while maintaining high accuracy.
Findings
Matches state-of-the-art performance on pathology benchmarks
Reduces trainable parameters by up to 92.8%
Lowers FLOPs during inference by up to 99%
Abstract
In computational pathology, weak supervision has become the standard for deep learning due to the gigapixel scale of WSIs and the scarcity of pixel-level annotations, with Multiple Instance Learning (MIL) established as the principal framework for slide-level model training. In this paper, we introduce a novel setting for MIL methods, inspired by proceedings in Neural Partial Differential Equation (PDE) Solvers. Instead of relying on complex attention-based aggregation, we propose an efficient, aggregator-agnostic framework that removes the complexity of correlation learning from the MIL aggregator. CAPRMIL produces rich context-aware patch embeddings that promote effective correlation learning on downstream tasks. By projecting patch features -- extracted using a frozen patch encoder -- into a small set of global context/morphology-aware tokens and utilizing multi-head self-attention,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Generative Adversarial Networks and Image Synthesis · Cell Image Analysis Techniques
