Towards A Generalizable Pathology Foundation Model via Unified Knowledge Distillation
Jiabo Ma, Zhengrui Guo, Fengtao Zhou, Yihui Wang, Yingxue Xu, Jinbang, Li, Fang Yan, Yu Cai, Zhengjie Zhu, Cheng Jin, Yi Lin, Xinrui Jiang,, Chenglong Zhao, Danyi Li, Anjia Han, Zhenhui Li, Ronald Cheong Kin Chan,, Jiguang Wang, Peng Fei, Kwang-Ting Cheng, Shaoting Zhang

TL;DR
This paper introduces a comprehensive benchmark for evaluating pathology foundation models across diverse clinical tasks and proposes a unified knowledge distillation framework to enhance their generalization, resulting in a new model that outperforms existing ones.
Contribution
The paper presents a large-scale benchmark for pathology models and develops a novel unified knowledge distillation framework to improve their generalization across multiple tasks.
Findings
Existing models excel in some tasks but lack generalization.
The proposed GPFM achieves top performance on 42 out of 72 tasks.
The benchmark reveals the limited generalization of current foundation models.
Abstract
Foundation models pretrained on large-scale datasets are revolutionizing the field of computational pathology (CPath). The generalization ability of foundation models is crucial for the success in various downstream clinical tasks. However, current foundation models have only been evaluated on a limited type and number of tasks, leaving their generalization ability and overall performance unclear. To address this gap, we established a most comprehensive benchmark to evaluate the performance of off-the-shelf foundation models across six distinct clinical task types, encompassing a total of 72 specific tasks, including slide-level classification, survival prediction, ROI-tissue classification, ROI retrieval, visual question answering, and report generation. Our findings reveal that existing foundation models excel at certain task types but struggle to effectively handle the full breadth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Biomedical Text Mining and Ontologies · Machine Learning in Healthcare
MethodsKnowledge Distillation
