Fully transformer-based biomarker prediction from colorectal cancer histology: a large-scale multicentric study
Sophia J. Wagner, Daniel Reisenb\"uchler, Nicholas P. West, Jan Moritz, Niehues, Gregory Patrick Veldhuizen, Philip Quirke, Heike I. Grabsch, Piet A., van den Brandt, Gordon G. A. Hutchins, Susan D. Richman, Tanwei Yuan, Rupert, Langer, Josien Christina Anna Jenniskens

TL;DR
This study introduces a fully transformer-based deep learning pipeline for accurate biomarker prediction in colorectal cancer histology, trained on large multicenter datasets, outperforming existing methods in sensitivity, generalizability, and interpretability.
Contribution
The paper presents the first large-scale application of a fully transformer-based model for biomarker prediction in colorectal cancer histology, demonstrating superior performance and clinical utility.
Findings
Achieved 0.97 sensitivity and 0.99 NPV for MSI prediction.
Reaching clinical-grade performance on biopsy tissue.
Outperforming CNN-based approaches in accuracy and generalizability.
Abstract
Background: Deep learning (DL) can extract predictive and prognostic biomarkers from routine pathology slides in colorectal cancer. For example, a DL test for the diagnosis of microsatellite instability (MSI) in CRC has been approved in 2022. Current approaches rely on convolutional neural networks (CNNs). Transformer networks are outperforming CNNs and are replacing them in many applications, but have not been used for biomarker prediction in cancer at a large scale. In addition, most DL approaches have been trained on small patient cohorts, which limits their clinical utility. Methods: In this study, we developed a new fully transformer-based pipeline for end-to-end biomarker prediction from pathology slides. We combine a pre-trained transformer encoder and a transformer network for patch aggregation, capable of yielding single and multi-target prediction at patient level. We train…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Colorectal Cancer Screening and Detection · AI in cancer detection
MethodsMulti-Head Attention · Attention Is All You Need · Test · Dense Connections · Adam · Position-Wise Feed-Forward Layer · Softmax · Linear Layer · Absolute Position Encodings · Dropout
