Promise:Prompt-driven 3D Medical Image Segmentation Using Pretrained   Image Foundation Models

Hao Li; Han Liu; Dewei Hu; Jiacheng Wang; Ipek Oguz

arXiv:2310.19721·eess.IV·November 15, 2023·1 cites

Promise:Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation Models

Hao Li, Han Liu, Dewei Hu, Jiacheng Wang, Ipek Oguz

PDF

Open Access 1 Repo

TL;DR

ProMISe leverages pretrained 2D vision transformers and minimal prompts to effectively perform 3D medical image segmentation, overcoming domain discrepancies and anatomical variability with high accuracy.

Contribution

The paper introduces a prompt-driven 3D segmentation method using pretrained 2D models and lightweight adapters, enabling effective transfer learning without retraining the entire model.

Findings

01

Outperforms state-of-the-art segmentation methods on public datasets.

02

Uses only a single point prompt for 3D segmentation.

03

Achieves precise boundary delineation with boundary-aware loss.

Abstract

To address prevalent issues in medical imaging, such as data acquisition challenges and label availability, transfer learning from natural to medical image domains serves as a viable strategy to produce reliable segmentation results. However, several existing barriers between domains need to be broken down, including addressing contrast discrepancies, managing anatomical variability, and adapting 2D pretrained models for 3D segmentation tasks. In this paper, we propose ProMISe,a prompt-driven 3D medical image segmentation model using only a single point prompt to leverage knowledge from a pretrained 2D image foundation model. In particular, we use the pretrained vision transformer from the Segment Anything Model (SAM) and integrate lightweight adapters to extract depth-related (3D) spatial context without updating the pretrained weights. For robust results, a hybrid network with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

medicl-vu/promise
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · AI in cancer detection · Medical Image Segmentation Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Residual Connection · Dense Connections · Layer Normalization · Vision Transformer