Promise:Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation Models
Hao Li, Han Liu, Dewei Hu, Jiacheng Wang, Ipek Oguz

TL;DR
ProMISe leverages pretrained 2D vision transformers and minimal prompts to effectively perform 3D medical image segmentation, overcoming domain discrepancies and anatomical variability with high accuracy.
Contribution
The paper introduces a prompt-driven 3D segmentation method using pretrained 2D models and lightweight adapters, enabling effective transfer learning without retraining the entire model.
Findings
Outperforms state-of-the-art segmentation methods on public datasets.
Uses only a single point prompt for 3D segmentation.
Achieves precise boundary delineation with boundary-aware loss.
Abstract
To address prevalent issues in medical imaging, such as data acquisition challenges and label availability, transfer learning from natural to medical image domains serves as a viable strategy to produce reliable segmentation results. However, several existing barriers between domains need to be broken down, including addressing contrast discrepancies, managing anatomical variability, and adapting 2D pretrained models for 3D segmentation tasks. In this paper, we propose ProMISe,a prompt-driven 3D medical image segmentation model using only a single point prompt to leverage knowledge from a pretrained 2D image foundation model. In particular, we use the pretrained vision transformer from the Segment Anything Model (SAM) and integrate lightweight adapters to extract depth-related (3D) spatial context without updating the pretrained weights. For robust results, a hybrid network with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · AI in cancer detection · Medical Image Segmentation Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Residual Connection · Dense Connections · Layer Normalization · Vision Transformer
