Multi-scale Efficient Graph-Transformer for Whole Slide Image   Classification

Saisai Ding; Juncheng Li; Jun Wang; Shihui Ying; Jun Shi

arXiv:2305.15773·cs.CV·May 26, 2023·2 cites

Multi-scale Efficient Graph-Transformer for Whole Slide Image Classification

Saisai Ding, Juncheng Li, Jun Wang, Shihui Ying, Jun Shi

PDF

Open Access

TL;DR

This paper introduces MEGT, a multi-scale graph-transformer framework that efficiently processes gigapixel whole slide images by combining low- and high-resolution features for improved cancer diagnosis accuracy.

Contribution

The paper proposes a novel multi-scale graph-transformer architecture with a feature fusion module and token pruning for efficient WSI classification.

Findings

01

MEGT outperforms existing methods on TCGA-RCC and CAMELYON16 datasets.

02

The token pruning module accelerates training without sacrificing accuracy.

03

Multi-scale fusion improves the capture of spatial and semantic information.

Abstract

The multi-scale information among the whole slide images (WSIs) is essential for cancer diagnosis. Although the existing multi-scale vision Transformer has shown its effectiveness for learning multi-scale image representation, it still cannot work well on the gigapixel WSIs due to their extremely large image sizes. To this end, we propose a novel Multi-scale Efficient Graph-Transformer (MEGT) framework for WSI classification. The key idea of MEGT is to adopt two independent Efficient Graph-based Transformer (EGT) branches to process the low-resolution and high-resolution patch embeddings (i.e., tokens in a Transformer) of WSIs, respectively, and then fuse these tokens via a multi-scale feature fusion module (MFFM). Specifically, we design an EGT to efficiently learn the local-global information of patch tokens, which integrates the graph representation into Transformer to capture…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · Brain Tumor Detection and Classification

MethodsMulti-Head Attention · Attention Is All You Need · Pruning · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Label Smoothing · Adam