TextSAM-EUS: Text Prompt Learning for SAM to Accurately Segment Pancreatic Tumor in Endoscopic Ultrasound

Pascal Spiegler; Taha Koleilat; Arash Harirpoush; Corey S. Miller; Hassan Rivaz; Marta Kersten-Oertel; Yiming Xiao

arXiv:2507.18082·cs.CV·July 31, 2025

TextSAM-EUS: Text Prompt Learning for SAM to Accurately Segment Pancreatic Tumor in Endoscopic Ultrasound

Pascal Spiegler, Taha Koleilat, Arash Harirpoush, Corey S. Miller, Hassan Rivaz, Marta Kersten-Oertel, Yiming Xiao

PDF

TL;DR

TextSAM-EUS introduces a lightweight, text-driven adaptation of the SAM model for automatic pancreatic tumor segmentation in endoscopic ultrasound images, achieving state-of-the-art accuracy without manual prompts.

Contribution

It is the first to incorporate prompt learning into SAM-based medical image segmentation, enabling automatic, accurate tumor segmentation with minimal parameter tuning.

Findings

01

Achieves 82.69% Dice score with automatic prompts

02

Outperforms existing supervised and foundation models

03

Requires tuning only 0.86% of parameters

Abstract

Pancreatic cancer carries a poor prognosis and relies on endoscopic ultrasound (EUS) for targeted biopsy and radiotherapy. However, the speckle noise, low contrast, and unintuitive appearance of EUS make segmentation of pancreatic tumors with fully supervised deep learning (DL) models both error-prone and dependent on large, expert-curated annotation datasets. To address these challenges, we present TextSAM-EUS, a novel, lightweight, text-driven adaptation of the Segment Anything Model (SAM) that requires no manual geometric prompts at inference. Our approach leverages text prompt learning (context optimization) through the BiomedCLIP text encoder in conjunction with a LoRA-based adaptation of SAM's architecture to enable automatic pancreatic tumor segmentation in EUS, tuning only 0.86% of the total parameters. On the public Endoscopic Ultrasound Database of the Pancreas, TextSAM-EUS…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.