MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning

Minghao Han; Linhao Qu; Dingkang Yang; Xukun Zhang; Xiaoying Wang; Lihua Zhang

arXiv:2408.11505·cs.CV·September 10, 2025

MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning

Minghao Han, Linhao Qu, Dingkang Yang, Xukun Zhang, Xiaoying Wang, Lihua Zhang

PDF

Open Access 1 Repo

TL;DR

MSCPT introduces a novel multi-scale, context-aware prompt tuning approach leveraging vision-language models for effective few-shot whole slide image classification, addressing data scarcity and rare disease challenges.

Contribution

The paper proposes MSCPT, a multi-scale, context-focused prompt tuning method that fully utilizes VLMs' prior knowledge and instance aggregation for WSI classification in few-shot settings.

Findings

01

MSCPT outperforms existing methods on five datasets.

02

It effectively leverages multi-scale and contextual information.

03

The approach demonstrates strong interpretability and generalization.

Abstract

Multiple instance learning (MIL) has become a standard paradigm for the weakly supervised classification of whole slide images (WSIs). However, this paradigm relies on using a large number of labeled WSIs for training. The lack of training data and the presence of rare diseases pose significant challenges for these methods. Prompt tuning combined with pre-trained Vision-Language models (VLMs) is an effective solution to the Few-shot Weakly Supervised WSI Classification (FSWC) task. Nevertheless, applying prompt tuning methods designed for natural images to WSIs presents three significant challenges: 1) These methods fail to fully leverage the prior knowledge from the VLM's text modality; 2) They overlook the essential multi-scale and contextual information in WSIs, leading to suboptimal results; and 3) They lack exploration of instance aggregation methods. To address these problems, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hanminghao/mscpt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Human Pose and Action Recognition