Towards A Generalizable Pathology Foundation Model via Unified Knowledge   Distillation

Jiabo Ma; Zhengrui Guo; Fengtao Zhou; Yihui Wang; Yingxue Xu; Jinbang; Li; Fang Yan; Yu Cai; Zhengjie Zhu; Cheng Jin; Yi Lin; Xinrui Jiang,; Chenglong Zhao; Danyi Li; Anjia Han; Zhenhui Li; Ronald Cheong Kin Chan,; Jiguang Wang; Peng Fei; Kwang-Ting Cheng; Shaoting Zhang; Li Liang; Hao Chen

arXiv:2407.18449·eess.IV·April 15, 2025·5 cites

Towards A Generalizable Pathology Foundation Model via Unified Knowledge Distillation

Jiabo Ma, Zhengrui Guo, Fengtao Zhou, Yihui Wang, Yingxue Xu, Jinbang, Li, Fang Yan, Yu Cai, Zhengjie Zhu, Cheng Jin, Yi Lin, Xinrui Jiang,, Chenglong Zhao, Danyi Li, Anjia Han, Zhenhui Li, Ronald Cheong Kin Chan,, Jiguang Wang, Peng Fei, Kwang-Ting Cheng, Shaoting Zhang

PDF

Open Access 1 Repo 1 Models

TL;DR

This paper introduces a comprehensive benchmark for evaluating pathology foundation models across diverse clinical tasks and proposes a unified knowledge distillation framework to enhance their generalization, resulting in a new model that outperforms existing ones.

Contribution

The paper presents a large-scale benchmark for pathology models and develops a novel unified knowledge distillation framework to improve their generalization across multiple tasks.

Findings

01

Existing models excel in some tasks but lack generalization.

02

The proposed GPFM achieves top performance on 42 out of 72 tasks.

03

The benchmark reveals the limited generalization of current foundation models.

Abstract

Foundation models pretrained on large-scale datasets are revolutionizing the field of computational pathology (CPath). The generalization ability of foundation models is crucial for the success in various downstream clinical tasks. However, current foundation models have only been evaluated on a limited type and number of tasks, leaving their generalization ability and overall performance unclear. To address this gap, we established a most comprehensive benchmark to evaluate the performance of off-the-shelf foundation models across six distinct clinical task types, encompassing a total of 72 specific tasks, including slide-level classification, survival prediction, ROI-tissue classification, ROI retrieval, visual question answering, and report generation. Our findings reveal that existing foundation models excel at certain task types but struggle to effectively handle the full breadth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

birkhoffkiki/gpfm
pytorch

Models

🤗
majiabo/GPFM
model· ♡ 2
♡ 2

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · Biomedical Text Mining and Ontologies · Machine Learning in Healthcare

MethodsKnowledge Distillation