Local Attention Graph-based Transformer for Multi-target Genetic Alteration Prediction
Daniel Reisenb\"uchler, Sophia J. Wagner, Melanie Boxberg, Tingying, Peng

TL;DR
This paper introduces a local attention graph-based Transformer for multiple instance learning that effectively models local dependencies in large-scale images, achieving state-of-the-art mutation prediction in gastrointestinal cancer.
Contribution
The paper proposes LA-MIL, a local attention Transformer that restricts self-attention to local regimes, improving efficiency and performance in multi-target mutation prediction.
Findings
LA-MIL outperforms existing models on colorectal cancer biomarkers.
Local self-attention suffices to model dependencies comparable to global attention.
The approach achieves state-of-the-art results in mutation prediction for gastrointestinal cancer.
Abstract
Classical multiple instance learning (MIL) methods are often based on the identical and independent distributed assumption between instances, hence neglecting the potentially rich contextual information beyond individual entities. On the other hand, Transformers with global self-attention modules have been proposed to model the interdependencies among all instances. However, in this paper we question: Is global relation modeling using self-attention necessary, or can we appropriately restrict self-attention calculations to local regimes in large-scale whole slide images (WSIs)? We propose a general-purpose local attention graph-based Transformer for MIL (LA-MIL), introducing an inductive bias by explicitly contextualizing instances in adaptive local regimes of arbitrary size. Additionally, an efficiently adapted loss function enables our approach to learn expressive WSI embeddings for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Video Analysis and Summarization · Colorectal Cancer Screening and Detection
MethodsAttention Is All You Need · Linear Layer · Adam · Byte Pair Encoding · Absolute Position Encodings · Residual Connection · Label Smoothing · Position-Wise Feed-Forward Layer · Dense Connections · Dropout
