DSCA: A Dual-Stream Network with Cross-Attention on Whole-Slide Image Pyramids for Cancer Prognosis
Pei Liu, Bo Fu, Feng Ye, Rui Yang, Bin Xu, and Luping Ji

TL;DR
This paper introduces DSCA, a dual-stream network with cross-attention for efficient and improved cancer prognosis using whole-slide images, addressing computational costs and semantic gaps in multi-resolution feature fusion.
Contribution
The paper proposes a novel dual-stream network with cross-attention that effectively fuses multi-resolution features in WSIs, reducing computational costs while improving prognosis accuracy.
Findings
Outperforms state-of-the-art methods with 4.6% higher C-Index.
More efficient with fewer computational costs despite more parameters.
Key components significantly improve model performance.
Abstract
The cancer prognosis on gigapixel Whole-Slide Images (WSIs) has always been a challenging task. To further enhance WSI visual representations, existing methods have explored image pyramids, instead of single-resolution images, in WSIs. In spite of this, they still face two major problems: high computational cost and the unnoticed semantical gap in multi-resolution feature fusion. To tackle these problems, this paper proposes to efficiently exploit WSI pyramids from a new perspective, the dual-stream network with cross-attention (DSCA). Our key idea is to utilize two sub-streams to process the WSI patches with two resolutions, where a square pooling is devised in a high-resolution stream to significantly reduce computational costs, and a cross-attention-based method is proposed to properly handle the fusion of dual-stream features. We validate our DSCA on three publicly-available…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · Advanced Image Processing Techniques
MethodsALIGN
