HVTSurv: Hierarchical Vision Transformer for Patient-Level Survival Prediction from Whole Slide Image
Zhuchen Shao, Yang Chen, Hao Bian, Jian Zhang, Guojun Liu, Yongbing, Zhang

TL;DR
HVTSurv introduces a hierarchical vision Transformer model that effectively captures spatial, contextual, and hierarchical interactions in whole slide images, significantly improving patient survival prediction accuracy across multiple cancer types.
Contribution
The paper proposes a novel hierarchical vision Transformer framework, HVTSurv, which encodes multi-level interactions in WSIs for better survival prediction, outperforming previous weakly supervised methods.
Findings
Average C-Index improved by 2.50-11.30% over prior methods
Validated on 6 TCGA datasets with 3,104 patients
Attention visualization confirms model's interpretability
Abstract
Survival prediction based on whole slide images (WSIs) is a challenging task for patient-level multiple instance learning (MIL). Due to the vast amount of data for a patient (one or multiple gigapixels WSIs) and the irregularly shaped property of WSI, it is difficult to fully explore spatial, contextual, and hierarchical interaction in the patient-level bag. Many studies adopt random sampling pre-processing strategy and WSI-level aggregation models, which inevitably lose critical prognostic information in the patient-level bag. In this work, we propose a hierarchical vision Transformer framework named HVTSurv, which can encode the local-level relative spatial information, strengthen WSI-level context-aware communication, and establish patient-level hierarchical interaction. Firstly, we design a feature pre-processing strategy, including feature rearrangement and random window masking.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Colorectal Cancer Screening and Detection · Image Retrieval and Classification Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Byte Pair Encoding · Linear Layer · Label Smoothing · Adam · Position-Wise Feed-Forward Layer · Residual Connection · Softmax
