CORE-Seg: Reasoning-Driven Segmentation for Complex Lesions via Reinforcement Learning

Yuxin Xie; Yuming Chen; Yishan Yang; Yi Zhou; Tao Zhou; Zhen Zhao; Jiacheng Liu; Huazhu Fu

arXiv:2603.05911·cs.CV·March 9, 2026

CORE-Seg: Reasoning-Driven Segmentation for Complex Lesions via Reinforcement Learning

Yuxin Xie, Yuming Chen, Yishan Yang, Yi Zhou, Tao Zhou, Zhen Zhao, Jiacheng Liu, Huazhu Fu

PDF

Open Access

TL;DR

CORE-Seg introduces a reasoning-driven segmentation framework that combines cognitive reasoning with pixel-level analysis, significantly improving accuracy and interpretability in complex lesion segmentation tasks.

Contribution

The paper presents CORE-Seg, a novel framework integrating reasoning with segmentation via a Semantic-Guided Prompt Adapter and a new reasoning benchmark for complex lesions.

Findings

01

Achieved a mean Dice score of 37.06%, surpassing baselines by 14.89%.

02

Reduced failure rate to 18.42%.

03

Introduced ComLesion-14K, a new reasoning-driven segmentation benchmark.

Abstract

Medical image segmentation is undergoing a paradigm shift from conventional visual pattern matching to cognitive reasoning analysis. Although Multimodal Large Language Models (MLLMs) have shown promise in integrating linguistic and visual knowledge, significant gaps remain: existing general MLLMs possess broad common sense but lack the specialized visual reasoning required for complex lesions, whereas traditional segmentation models excel at pixel-level segmentation but lack logical interpretability. In this paper, we introduce ComLesion-14K, the first diverse Chain-of-Thought (CoT) benchmark for reasoning-driven complex lesion segmentation. To accomplish this task, we propose CORE-Seg, an end-to-end framework integrating reasoning with segmentation through a Semantic-Guided Prompt Adapter. We design a progressive training strategy from SFT to GRPO, equipped with an adaptive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning