Knowledge-enhanced Visual-Language Pretraining for Computational   Pathology

Xiao Zhou; Xiaoman Zhang; Chaoyi Wu; Ya Zhang; Weidi Xie; Yanfeng Wang

arXiv:2404.09942·cs.CV·September 17, 2024·1 cites

Knowledge-enhanced Visual-Language Pretraining for Computational Pathology

Xiao Zhou, Xiaoman Zhang, Chaoyi Wu, Ya Zhang, Weidi Xie, Yanfeng Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel knowledge-enhanced visual-language pretraining method for computational pathology, leveraging a large structured pathology knowledge base to improve visual representation learning and downstream task performance.

Contribution

It creates the first comprehensive pathology knowledge base and integrates it into a pretraining framework to enhance visual representations in pathology analysis.

Findings

01

Significant improvements in cross-modal retrieval accuracy.

02

Effective zero-shot classification on pathology patches.

03

Enhanced zero-shot tumor subtyping on whole slide images.

Abstract

In this paper, we consider the problem of visual representation learning for computational pathology, by exploiting large-scale image-text pairs gathered from public resources, along with the domain-specific knowledge in pathology. Specifically, we make the following contributions: (i) We curate a pathology knowledge tree that consists of 50,470 informative attributes for 4,718 diseases requiring pathology diagnosis from 32 human tissues. To our knowledge, this is the first comprehensive structured pathology knowledge base; (ii) We develop a knowledge-enhanced visual-language pretraining approach, where we first project pathology-specific knowledge into latent embedding space via a language model, and use it to guide the visual representation learning; (iii) We conduct thorough experiments to validate the effectiveness of our proposed components, demonstrating significant performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

magic-ai4med/kep
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Cancer-related molecular mechanisms research · Domain Adaptation and Few-Shot Learning