PromptSeg: An End-to-End Universal Medical Image Segmentation Method via Visual Prompts
Minfan Zhao, Bingxun Wang, Jun Shi, Hong An

TL;DR
PromptSeg is a new AI method for medical image segmentation that improves generalization across different tasks using visual prompts.
Contribution
PromptSeg introduces a universal framework for medical image segmentation using visual prompts and information-theoretic principles.
Findings
PromptSeg outperforms state-of-the-art methods on CT and MRI datasets.
The method shows strong generalization across multi-modality and unseen datasets.
Only a few annotated visual prompts are needed for new tasks without retraining.
Abstract
Deep learning has achieved remarkable advancements in medical image segmentation, yet its generalization capability across unseen tasks remains a significant challenge. The variety of task objectives, disease-dependent labeling variations, and multi-center data contribute to the high uncertainty of task-specific models on unseen distributions. In this study, we propose PromptSeg, an innovative Transformer-based unified framework for universal 2D medical image segmentation. From an information-theoretic perspective, PromptSeg formulates the segmentation process as a conditional entropy minimization problem, utilizing visual prompts as side information to reduce the uncertainty of the target task. Guided by the information bottleneck principle, PromptSeg aims to utilize the provided visual prompts to filter out redundant noise and learn contextual representations, thereby breaking the…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
