MedSeg-R: Medical Image Segmentation with Clinical Reasoning

Hao Shao; Qibin Hou

arXiv:2506.18669·cs.CV·June 24, 2025

MedSeg-R: Medical Image Segmentation with Clinical Reasoning

Hao Shao, Qibin Hou

PDF

TL;DR

MedSeg-R introduces a dual-stage framework inspired by clinical reasoning that enhances medical image segmentation by integrating semantic priors and dynamic feature modulation, significantly improving detection of small and ambiguous lesions.

Contribution

The paper presents MedSeg-R, a novel lightweight framework that incorporates structured semantic priors into the segmentation process, improving generalization and sensitivity over existing methods.

Findings

01

Achieves large Dice score improvements on challenging benchmarks.

02

Effectively detects small and overlapping lesions.

03

Demonstrates compatibility with SAM-based systems.

Abstract

Medical image segmentation is challenging due to overlapping anatomies with ambiguous boundaries and a severe imbalance between the foreground and background classes, which particularly affects the delineation of small lesions. Existing methods, including encoder-decoder networks and prompt-driven variants of the Segment Anything Model (SAM), rely heavily on local cues or user prompts and lack integrated semantic priors, thus failing to generalize well to low-contrast or overlapping targets. To address these issues, we propose MedSeg-R, a lightweight, dual-stage framework inspired by inspired by clinical reasoning. Its cognitive stage interprets medical report into structured semantic priors (location, texture, shape), which are fused via transformer block. In the perceptual stage, these priors modulate the SAM backbone: spatial attention highlights likely lesion regions, dynamic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution · Segment Anything Model