Multi-scale Efficient Graph-Transformer for Whole Slide Image Classification
Saisai Ding, Juncheng Li, Jun Wang, Shihui Ying, Jun Shi

TL;DR
This paper introduces MEGT, a multi-scale graph-transformer framework that efficiently processes gigapixel whole slide images by combining low- and high-resolution features for improved cancer diagnosis accuracy.
Contribution
The paper proposes a novel multi-scale graph-transformer architecture with a feature fusion module and token pruning for efficient WSI classification.
Findings
MEGT outperforms existing methods on TCGA-RCC and CAMELYON16 datasets.
The token pruning module accelerates training without sacrificing accuracy.
Multi-scale fusion improves the capture of spatial and semantic information.
Abstract
The multi-scale information among the whole slide images (WSIs) is essential for cancer diagnosis. Although the existing multi-scale vision Transformer has shown its effectiveness for learning multi-scale image representation, it still cannot work well on the gigapixel WSIs due to their extremely large image sizes. To this end, we propose a novel Multi-scale Efficient Graph-Transformer (MEGT) framework for WSI classification. The key idea of MEGT is to adopt two independent Efficient Graph-based Transformer (EGT) branches to process the low-resolution and high-resolution patch embeddings (i.e., tokens in a Transformer) of WSIs, respectively, and then fuse these tokens via a multi-scale feature fusion module (MFFM). Specifically, we design an EGT to efficiently learn the local-global information of patch tokens, which integrates the graph representation into Transformer to capture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · Brain Tumor Detection and Classification
MethodsMulti-Head Attention · Attention Is All You Need · Pruning · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Label Smoothing · Adam
