TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification
Zhuchen Shao, Hao Bian, Yang Chen, Yifeng Wang, Jian Zhang, Xiangyang, Ji, Yongbing Zhang

TL;DR
TransMIL introduces a transformer-based correlated multiple instance learning framework that leverages morphological and spatial information, significantly improving whole slide image classification accuracy and interpretability in pathology diagnostics.
Contribution
It proposes a novel correlated MIL framework and TransMIL model that address instance correlation neglect in traditional MIL, with proven convergence and enhanced performance.
Findings
Achieved up to 93.09% AUC on CAMELYON16
Achieved up to 98.82% AUC on TCGA-RCC
Faster convergence and better visualization compared to state-of-the-art methods
Abstract
Multiple instance learning (MIL) is a powerful tool to solve the weakly supervised classification in whole slide image (WSI) based pathology diagnosis. However, the current MIL methods are usually based on independent and identical distribution hypothesis, thus neglect the correlation among different instances. To address this problem, we proposed a new framework, called correlated MIL, and provided a proof for convergence. Based on this framework, we devised a Transformer based MIL (TransMIL), which explored both morphological and spatial information. The proposed TransMIL can effectively deal with unbalanced/balanced and binary/multiple classification with great visualization and interpretability. We conducted various experiments for three different computational pathology problems and achieved better performance and faster convergence compared with state-of-the-art methods. The test…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAI in cancer detection · Image Retrieval and Classification Techniques · Digital Imaging for Blood Diseases
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Adam · Label Smoothing · Layer Normalization · Residual Connection
