AHDMIL: Asymmetric Hierarchical Distillation Multi-Instance Learning for Fast and Accurate Whole-Slide Image Classification
Jiuyang Dong, Jiahan Li, Junjun Jiang, Kui Jiang, Yongbing Zhang

TL;DR
AHDMIL introduces an efficient hierarchical distillation framework for whole-slide image classification, significantly reducing inference time while improving accuracy by focusing on relevant patches through a two-step training process.
Contribution
The paper presents a novel asymmetric hierarchical distillation approach with a dual-branch network and a Chebyshev-polynomial-based classifier for improved speed and accuracy in pathology image analysis.
Findings
Achieves 5.3% accuracy improvement on Camelyon16
Speeds up inference by 1.2 to 2.1 times across datasets
Outperforms previous state-of-the-art methods in accuracy and efficiency
Abstract
Although multi-instance learning (MIL) has succeeded in pathological image classification, it faces the challenge of high inference costs due to the need to process thousands of patches from each gigapixel whole slide image (WSI). To address this, we propose AHDMIL, an Asymmetric Hierarchical Distillation Multi-Instance Learning framework that enables fast and accurate classification by eliminating irrelevant patches through a two-step training process. AHDMIL comprises two key components: the Dynamic Multi-Instance Network (DMIN), which operates on high-resolution WSIs, and the Dual-Branch Lightweight Instance Pre-screening Network (DB-LIPN), which analyzes corresponding low-resolution counterparts. In the first step, self-distillation (SD), DMIN is trained for WSI classification while generating per-instance attention scores to identify irrelevant patches. These scores guide the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
