Handcrafted Histological Transformer (H2T): Unsupervised Representation of Whole Slide Images
Quoc Dang Vu, Kashif Rajpoot, Shan E Ahmed Raza, Nasir Rajpoot

TL;DR
This paper introduces H2T, a handcrafted, transparent deep learning framework for representing whole-slide histological images, achieving competitive accuracy and significantly faster processing compared to existing Transformer-based methods.
Contribution
The paper presents a novel handcrafted approach that interprets Transformer processes for histological image analysis, enhancing transparency and speed while maintaining competitive performance.
Findings
H2T achieves comparable accuracy to state-of-the-art methods.
H2T is up to 14 times faster than traditional Transformer models.
The framework is effective across multiple datasets with over 5,300 WSIs.
Abstract
Diagnostic, prognostic and therapeutic decision-making of cancer in pathology clinics can now be carried out based on analysis of multi-gigapixel tissue images, also known as whole-slide images (WSIs). Recently, deep convolutional neural networks (CNNs) have been proposed to derive unsupervised WSI representations; these are attractive as they rely less on expert annotation which is cumbersome. However, a major trade-off is that higher predictive power generally comes at the cost of interpretability, posing a challenge to their clinical use where transparency in decision-making is generally expected. To address this challenge, we present a handcrafted framework based on deep CNN for constructing holistic WSI-level representations. Building on recent findings about the internal working of the Transformer in the domain of natural language processing, we break down its processes and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · Colorectal Cancer Screening and Detection
MethodsAttention Is All You Need · Linear Layer · Softmax · Layer Normalization · Multi-Head Attention · Byte Pair Encoding · Dense Connections · Absolute Position Encodings · Dropout · Label Smoothing
