AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot   Nuclei Detection via Visual-Language Pre-trained Models

Yongjian Wu; Yang Zhou; Jiya Saiyin; Bingzheng Wei; Maode Lai,; Jianzhong Shou; Yan Xu

arXiv:2410.16820·cs.CV·October 23, 2024

AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models

Yongjian Wu, Yang Zhou, Jiya Saiyin, Bingzheng Wei, Maode Lai,, Jianzhong Shou, Yan Xu

PDF

1 Repo

TL;DR

This paper introduces AttriPrompter, an auto-prompting method leveraging visual-language models for zero-shot nuclei detection in histopathology images, addressing the domain gap and improving detection accuracy without manual prompts.

Contribution

The paper presents a novel auto-prompting pipeline and a self-trained knowledge distillation framework to enhance zero-shot nuclei detection using VLPMs, specifically adapting GLIP for medical images.

Findings

01

Outperforms existing unsupervised methods in nuclei detection

02

Demonstrates strong generality across different datasets

03

Highlights potential of natural image-text pre-trained VLPMs for medical tasks

Abstract

Large-scale visual-language pre-trained models (VLPMs) have demonstrated exceptional performance in downstream object detection through text prompts for natural scenes. However, their application to zero-shot nuclei detection on histopathology images remains relatively unexplored, mainly due to the significant gap between the characteristics of medical images and the web-originated text-image pairs used for pre-training. This paper aims to investigate the potential of the object-level VLPM, Grounded Language-Image Pre-training (GLIP), for zero-shot nuclei detection. Specifically, we propose an innovative auto-prompting pipeline, named AttriPrompter, comprising attribute generation, attribute augmentation, and relevance sorting, to avoid subjective manual prompt design. AttriPrompter utilizes VLPMs' text-to-image alignment to create semantically rich text prompts, which are then fed into…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wuyongjiancode/attriprompter
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsKnowledge Distillation