PromptSeg: An End-to-End Universal Medical Image Segmentation Method via Visual Prompts

Minfan Zhao; Bingxun Wang; Jun Shi; Hong An

PMC · DOI:10.3390/e28030342·March 18, 2026

PromptSeg: An End-to-End Universal Medical Image Segmentation Method via Visual Prompts

Minfan Zhao, Bingxun Wang, Jun Shi, Hong An

PDF

Open Access

TL;DR

PromptSeg is a new AI method for medical image segmentation that improves generalization across different tasks using visual prompts.

Contribution

PromptSeg introduces a universal framework for medical image segmentation using visual prompts and information-theoretic principles.

Findings

01

PromptSeg outperforms state-of-the-art methods on CT and MRI datasets.

02

The method shows strong generalization across multi-modality and unseen datasets.

03

Only a few annotated visual prompts are needed for new tasks without retraining.

Abstract

Deep learning has achieved remarkable advancements in medical image segmentation, yet its generalization capability across unseen tasks remains a significant challenge. The variety of task objectives, disease-dependent labeling variations, and multi-center data contribute to the high uncertainty of task-specific models on unseen distributions. In this study, we propose PromptSeg, an innovative Transformer-based unified framework for universal 2D medical image segmentation. From an information-theoretic perspective, PromptSeg formulates the segmentation process as a conditional entropy minimization problem, utilizing visual prompts as side information to reduce the uncertainty of the target task. Guided by the information bottleneck principle, PromptSeg aims to utilize the provided visual prompts to filter out redundant noise and learn contextual representations, thereby breaking the…

Figures9

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications