A comparative study between vision transformers and CNNs in digital pathology
Luca Deininger, Bernhard Stimpel, Anil Yuce, Samaneh, Abbasi-Sureshjani, Simon Sch\"onenberger, Paolo Ocampo, Konstanty Korski,, Fabien Gaire

TL;DR
This study compares vision transformers and CNNs in digital pathology, finding that vision transformers perform similarly to CNNs in tumor detection tasks, with potential advantages in feature detection but requiring more training effort.
Contribution
It provides a comparative analysis of vision transformers and CNNs in digital pathology, highlighting their performance and training considerations in tumor detection and tissue classification.
Findings
Vision transformers perform on par with CNNs in tumor detection.
Both models capture similar imaging features at slide level.
Vision transformers require more training effort to outperform CNNs.
Abstract
Recently, vision transformers were shown to be capable of outperforming convolutional neural networks when pretrained on sufficient amounts of data. In comparison to convolutional neural networks, vision transformers have a weaker inductive bias and therefore allow a more flexible feature detection. Due to their promising feature detection, this work explores vision transformers for tumor detection in digital pathology whole slide images in four tissue types, and for tissue type identification. We compared the patch-wise classification performance of the vision transformer DeiT-Tiny to the state-of-the-art convolutional neural network ResNet18. Due to the sparse availability of annotated whole slide images, we further compared both models pretrained on large amounts of unlabeled whole-slide images using state-of-the-art self-supervised approaches. The results show that the vision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Digital Imaging for Blood Diseases · Radiomics and Machine Learning in Medical Imaging
MethodsAttention Is All You Need · Linear Layer · Softmax · Multi-Head Attention · Residual Connection · Layer Normalization · Dense Connections · Vision Transformer
