AHDMIL: Asymmetric Hierarchical Distillation Multi-Instance Learning for Fast and Accurate Whole-Slide Image Classification

Jiuyang Dong; Jiahan Li; Junjun Jiang; Kui Jiang; Yongbing Zhang

arXiv:2508.05114·cs.CV·August 8, 2025

AHDMIL: Asymmetric Hierarchical Distillation Multi-Instance Learning for Fast and Accurate Whole-Slide Image Classification

Jiuyang Dong, Jiahan Li, Junjun Jiang, Kui Jiang, Yongbing Zhang

PDF

TL;DR

AHDMIL introduces an efficient hierarchical distillation framework for whole-slide image classification, significantly reducing inference time while improving accuracy by focusing on relevant patches through a two-step training process.

Contribution

The paper presents a novel asymmetric hierarchical distillation approach with a dual-branch network and a Chebyshev-polynomial-based classifier for improved speed and accuracy in pathology image analysis.

Findings

01

Achieves 5.3% accuracy improvement on Camelyon16

02

Speeds up inference by 1.2 to 2.1 times across datasets

03

Outperforms previous state-of-the-art methods in accuracy and efficiency

Abstract

Although multi-instance learning (MIL) has succeeded in pathological image classification, it faces the challenge of high inference costs due to the need to process thousands of patches from each gigapixel whole slide image (WSI). To address this, we propose AHDMIL, an Asymmetric Hierarchical Distillation Multi-Instance Learning framework that enables fast and accurate classification by eliminating irrelevant patches through a two-step training process. AHDMIL comprises two key components: the Dynamic Multi-Instance Network (DMIN), which operates on high-resolution WSIs, and the Dual-Branch Lightweight Instance Pre-screening Network (DB-LIPN), which analyzes corresponding low-resolution counterparts. In the first step, self-distillation (SD), DMIN is trained for WSI classification while generating per-instance attention scores to identify irrelevant patches. These scores guide the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.