Initialization matters in few-shot adaptation of vision-language models for histopathological image classification
Pablo Meseguer, Roc\'io del Amor, Valery Naranjo

TL;DR
This paper introduces ZS-MIL, a novel initialization method for MIL classifiers in histopathological image classification that leverages VLM text embeddings, improving few-shot learning performance and robustness.
Contribution
The paper proposes ZS-MIL, a new initialization approach using class-level text embeddings to enhance few-shot MIL classification in histopathology.
Findings
ZS-MIL outperforms traditional weight initialization methods.
ZS-MIL demonstrates increased robustness and consistency.
Significant improvement in subtyping prediction accuracy.
Abstract
Vision language models (VLM) pre-trained on datasets of histopathological image-caption pairs enabled zero-shot slide-level classification. The ability of VLM image encoders to extract discriminative features also opens the door for supervised fine-tuning for whole-slide image (WSI) classification, ideally using few labeled samples. Slide-level prediction frameworks require the incorporation of multiple instance learning (MIL) due to the gigapixel size of the WSI. Following patch-level feature extraction and aggregation, MIL frameworks rely on linear classifiers trained on top of the slide-level aggregated features. Classifier weight initialization has a large influence on Linear Probing performance in efficient transfer learning (ETL) approaches based on few-shot learning. In this work, we propose Zero-Shot Multiple-Instance Learning (ZS-MIL) to address the limitations of random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Domain Adaptation and Few-Shot Learning · Digital Imaging for Blood Diseases
