Sparse Task Vector Mixup with Hypernetworks for Efficient Knowledge Transfer in Whole-Slide Image Prognosis

Pei Liu; Xiangxiang Zeng; Tengfei Ma; Yucheng Xing; Xuanbai Ren; Yiping Liu

arXiv:2603.10526·cs.CV·March 12, 2026

Sparse Task Vector Mixup with Hypernetworks for Efficient Knowledge Transfer in Whole-Slide Image Prognosis

Pei Liu, Xiangxiang Zeng, Tengfei Ma, Yucheng Xing, Xuanbai Ren, Yiping Liu

PDF

Open Access

TL;DR

This paper introduces STEPH, a hypernetwork-based method that efficiently transfers knowledge across different cancer types in whole-slide image prognosis, outperforming existing methods with less computational cost.

Contribution

It proposes a novel sparse task vector mixup approach with hypernetworks for efficient multi-cancer knowledge transfer in prognosis tasks.

Findings

01

Achieves 5.14% improvement over cancer-specific models

02

Outperforms existing knowledge transfer baselines by 2.01%

03

Requires less computational resources than large-scale joint training

Abstract

Whole-Slide Images (WSIs) are widely used for estimating the prognosis of cancer patients. Current studies generally follow a cancer-specific learning paradigm. However, the available training samples for one cancer type are usually scarce in pathology. Consequently, the model often struggles to learn generalizable knowledge, thus performing worse on the tumor samples with inherent high heterogeneity. Although multi-cancer joint learning and knowledge transfer approaches have been explored recently to address it, they either rely on large-scale joint training or extensive inference across multiple models, posing new challenges in computational efficiency. To this end, this paper proposes a new scheme, Sparse Task Vector Mixup with Hypernetworks (STEPH). Unlike previous ones, it efficiently absorbs generalizable knowledge from other cancers for the target via model merging: i) applying…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · Domain Adaptation and Few-Shot Learning