Initialization matters in few-shot adaptation of vision-language models for histopathological image classification

Pablo Meseguer; Roc\'io del Amor; Valery Naranjo

arXiv:2602.18766·cs.CV·February 24, 2026

Initialization matters in few-shot adaptation of vision-language models for histopathological image classification

Pablo Meseguer, Roc\'io del Amor, Valery Naranjo

PDF

Open Access

TL;DR

This paper introduces ZS-MIL, a novel initialization method for MIL classifiers in histopathological image classification that leverages VLM text embeddings, improving few-shot learning performance and robustness.

Contribution

The paper proposes ZS-MIL, a new initialization approach using class-level text embeddings to enhance few-shot MIL classification in histopathology.

Findings

01

ZS-MIL outperforms traditional weight initialization methods.

02

ZS-MIL demonstrates increased robustness and consistency.

03

Significant improvement in subtyping prediction accuracy.

Abstract

Vision language models (VLM) pre-trained on datasets of histopathological image-caption pairs enabled zero-shot slide-level classification. The ability of VLM image encoders to extract discriminative features also opens the door for supervised fine-tuning for whole-slide image (WSI) classification, ideally using few labeled samples. Slide-level prediction frameworks require the incorporation of multiple instance learning (MIL) due to the gigapixel size of the WSI. Following patch-level feature extraction and aggregation, MIL frameworks rely on linear classifiers trained on top of the slide-level aggregated features. Classifier weight initialization has a large influence on Linear Probing performance in efficient transfer learning (ETL) approaches based on few-shot learning. In this work, we propose Zero-Shot Multiple-Instance Learning (ZS-MIL) to address the limitations of random…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · Domain Adaptation and Few-Shot Learning · Digital Imaging for Blood Diseases