Prompt learning with bounding box constraints for medical image segmentation

M\'elanie Gaillochet; Mehrdad Noori; Sahar Dastani; Christian Desrosiers; Herv\'e Lombaert

arXiv:2507.02743·cs.CV·July 4, 2025

Prompt learning with bounding box constraints for medical image segmentation

M\'elanie Gaillochet, Mehrdad Noori, Sahar Dastani, Christian Desrosiers, Herv\'e Lombaert

PDF

TL;DR

This paper introduces a novel weakly supervised medical image segmentation method that uses bounding box annotations to automate prompt generation for foundation models, achieving high accuracy with limited data.

Contribution

It presents a new framework that combines foundation models with bounding box annotations to automate prompt generation, reducing annotation effort in medical image segmentation.

Findings

01

Achieves an average Dice score of 84.90% with limited data.

02

Outperforms existing fully- and weakly-supervised methods.

03

Demonstrates effectiveness across multimodal datasets.

Abstract

Pixel-wise annotations are notoriously labourious and costly to obtain in the medical domain. To mitigate this burden, weakly supervised approaches based on bounding box annotations-much easier to acquire-offer a practical alternative. Vision foundation models have recently shown noteworthy segmentation performance when provided with prompts such as points or bounding boxes. Prompt learning exploits these models by adapting them to downstream tasks and automating segmentation, thereby reducing user intervention. However, existing prompt learning approaches depend on fully annotated segmentation masks. This paper proposes a novel framework that combines the representational power of foundation models with the annotation efficiency of weakly supervised segmentation. More specifically, our approach automates prompt generation for foundation models using only bounding box annotations. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.