PFPs: Prompt-guided Flexible Pathological Segmentation for Diverse Potential Outcomes Using Large Vision and Language Models
Can Cui, Ruining Deng, Junlin Guo, Quan Liu, Tianyuan Yao, Haichun, Yang, Yuankai Huo

TL;DR
This paper introduces a flexible, prompt-guided segmentation approach for pathological images using large vision and language models, enabling adaptable multi-class segmentation guided by physician input.
Contribution
It presents a novel pipeline combining finetuned language prompts with vision models for flexible, multi-class pathological segmentation, including a new kidney pathology dataset.
Findings
Flexible segmentation performance improves with prompt guidance
Free-text prompts outperform fixed prompts in accuracy
Model generalizes well to new cases during inference
Abstract
The Vision Foundation Model has recently gained attention in medical image analysis. Its zero-shot learning capabilities accelerate AI deployment and enhance the generalizability of clinical applications. However, segmenting pathological images presents a special focus on the flexibility of segmentation targets. For instance, a single click on a Whole Slide Image (WSI) could signify a cell, a functional unit, or layers, adding layers of complexity to the segmentation tasks. Current models primarily predict potential outcomes but lack the flexibility needed for physician input. In this paper, we explore the potential of enhancing segmentation model flexibility by introducing various task prompts through a Large Language Model (LLM) alongside traditional task tokens. Our contribution is in four-fold: (1) we construct a computational-efficient pipeline that uses finetuned language prompts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Artificial Intelligence in Healthcare and Education
MethodsSoftmax · Attention Is All You Need · Focus
