CORE-Seg: Reasoning-Driven Segmentation for Complex Lesions via Reinforcement Learning
Yuxin Xie, Yuming Chen, Yishan Yang, Yi Zhou, Tao Zhou, Zhen Zhao, Jiacheng Liu, Huazhu Fu

TL;DR
CORE-Seg introduces a reasoning-driven segmentation framework that combines cognitive reasoning with pixel-level analysis, significantly improving accuracy and interpretability in complex lesion segmentation tasks.
Contribution
The paper presents CORE-Seg, a novel framework integrating reasoning with segmentation via a Semantic-Guided Prompt Adapter and a new reasoning benchmark for complex lesions.
Findings
Achieved a mean Dice score of 37.06%, surpassing baselines by 14.89%.
Reduced failure rate to 18.42%.
Introduced ComLesion-14K, a new reasoning-driven segmentation benchmark.
Abstract
Medical image segmentation is undergoing a paradigm shift from conventional visual pattern matching to cognitive reasoning analysis. Although Multimodal Large Language Models (MLLMs) have shown promise in integrating linguistic and visual knowledge, significant gaps remain: existing general MLLMs possess broad common sense but lack the specialized visual reasoning required for complex lesions, whereas traditional segmentation models excel at pixel-level segmentation but lack logical interpretability. In this paper, we introduce ComLesion-14K, the first diverse Chain-of-Thought (CoT) benchmark for reasoning-driven complex lesion segmentation. To accomplish this task, we propose CORE-Seg, an end-to-end framework integrating reasoning with segmentation through a Semantic-Guided Prompt Adapter. We design a progressive training strategy from SFT to GRPO, equipped with an adaptive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
