Effortless Vision-Language Model Specialization in Histopathology without Annotation

Jingna Qiu; Nishanth Jain; Jonas Ammeling; Marc Aubreville; Katharina Breininger

arXiv:2508.07835·cs.CV·August 12, 2025

Effortless Vision-Language Model Specialization in Histopathology without Annotation

Jingna Qiu, Nishanth Jain, Jonas Ammeling, Marc Aubreville, Katharina Breininger

PDF

Open Access

TL;DR

This paper presents an annotation-free method for adapting vision-language models to histopathology tasks by continued pretraining on domain-specific image-caption pairs, improving performance without manual labels.

Contribution

It introduces a novel annotation-free adaptation approach via continued pretraining on existing image-caption data, enhancing VLMs for histopathology applications.

Findings

01

Continued pretraining improves zero-shot and few-shot performance.

02

Larger training sizes match few-shot results without manual labels.

03

The method is task-agnostic and effective across multiple tasks.

Abstract

Recent advances in Vision-Language Models (VLMs) in histopathology, such as CONCH and QuiltNet, have demonstrated impressive zero-shot classification capabilities across various tasks. However, their general-purpose design may lead to suboptimal performance in specific downstream applications. While supervised fine-tuning methods address this issue, they require manually labeled samples for adaptation. This paper investigates annotation-free adaptation of VLMs through continued pretraining on domain- and task-relevant image-caption pairs extracted from existing databases. Our experiments on two VLMs, CONCH and QuiltNet, across three downstream tasks reveal that these pairs substantially enhance both zero-shot and few-shot performance. Notably, with larger training sizes, continued pretraining matches the performance of few-shot methods while eliminating manual labeling. Its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · Domain Adaptation and Few-Shot Learning · Digital Imaging for Blood Diseases