Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology
Wenhao Tang, Rong Qin, Heng Fang, Fengtao Zhou, Hao Chen, Xiang Li, Ming-Ming Cheng

TL;DR
This paper revisits end-to-end learning in computational pathology, addressing optimization challenges with a novel MIL called ABMILX, leading to state-of-the-art results with efficient training.
Contribution
It introduces ABMILX, a new MIL method that mitigates E2E optimization issues, enabling efficient training and improved performance in computational pathology.
Findings
ABMILX outperforms state-of-the-art models on multiple benchmarks.
E2E training with ABMILX is computationally efficient (<10 RTX3090 hours).
The study highlights the potential of E2E learning in computational pathology.
Abstract
Pre-trained encoders for offline feature extraction followed by multiple instance learning (MIL) aggregators have become the dominant paradigm in computational pathology (CPath), benefiting cancer diagnosis and prognosis. However, performance limitations arise from the absence of encoder fine-tuning for downstream tasks and disjoint optimization with MIL. While slide-level supervised end-to-end (E2E) learning is an intuitive solution to this issue, it faces challenges such as high computational demands and suboptimal results. These limitations motivate us to revisit E2E learning. We argue that prior work neglects inherent E2E optimization challenges, leading to performance disparities compared to traditional two-stage methods. In this paper, we pioneer the elucidation of optimization challenge caused by sparse-attention MIL and propose a novel MIL called ABMILX. It mitigates this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAI in cancer detection
MethodsSoftmax · Attention Is All You Need · Focus
