TL;DR
This paper introduces AttriPrompter, an auto-prompting method leveraging visual-language models for zero-shot nuclei detection in histopathology images, addressing the domain gap and improving detection accuracy without manual prompts.
Contribution
The paper presents a novel auto-prompting pipeline and a self-trained knowledge distillation framework to enhance zero-shot nuclei detection using VLPMs, specifically adapting GLIP for medical images.
Findings
Outperforms existing unsupervised methods in nuclei detection
Demonstrates strong generality across different datasets
Highlights potential of natural image-text pre-trained VLPMs for medical tasks
Abstract
Large-scale visual-language pre-trained models (VLPMs) have demonstrated exceptional performance in downstream object detection through text prompts for natural scenes. However, their application to zero-shot nuclei detection on histopathology images remains relatively unexplored, mainly due to the significant gap between the characteristics of medical images and the web-originated text-image pairs used for pre-training. This paper aims to investigate the potential of the object-level VLPM, Grounded Language-Image Pre-training (GLIP), for zero-shot nuclei detection. Specifically, we propose an innovative auto-prompting pipeline, named AttriPrompter, comprising attribute generation, attribute augmentation, and relevance sorting, to avoid subjective manual prompt design. AttriPrompter utilizes VLPMs' text-to-image alignment to create semantically rich text prompts, which are then fed into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsKnowledge Distillation
