MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision

Zhonghao Yan; Muxi Diao; Yuxuan Yang; Ruoyan Jing; Jiayuan Xu; Kaizhou Zhang; Lele Yang; Yanxi Liu; Kongming Liang; Zhanyu Ma

arXiv:2508.08177·cs.CV·February 19, 2026

MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision

Zhonghao Yan, Muxi Diao, Yuxuan Yang, Ruoyan Jing, Jiayuan Xu, Kaizhou Zhang, Lele Yang, Yanxi Liu, Kongming Liang, Zhanyu Ma

PDF

Open Access 1 Video

TL;DR

MedReasoner leverages reinforcement learning to improve clinical reasoning and pixel-level grounding in medical imaging, addressing implicit queries and enhancing interpretability in medical diagnosis.

Contribution

The paper introduces UMRG, a new vision-language task, releases a comprehensive dataset, and proposes MedReasoner, a modular RL-based framework for medical grounding.

Findings

01

State-of-the-art performance on U-MRG-14K dataset

02

Strong generalization to unseen clinical queries

03

Reinforcement learning enhances interpretability and accuracy

Abstract

Accurately grounding regions of interest (ROIs) is critical for diagnosis and treatment planning in medical imaging. While multimodal large language models (MLLMs) combine visual perception with natural language, current medical-grounding pipelines still rely on supervised fine-tuning with explicit spatial hints, making them ill-equipped to handle the implicit queries common in clinical practice. This work makes three core contributions. We first define Unified Medical Reasoning Grounding (UMRG), a novel vision-language task that demands clinical reasoning and pixel-level grounding. Second, we release U-MRG-14K, a dataset of 14K samples featuring pixel-level masks alongside implicit clinical queries and reasoning traces, spanning 10 modalities, 15 super-categories, and 108 specific categories. Finally, we introduce MedReasoner, a modular framework that distinctly separates reasoning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision· underline

Taxonomy

TopicsMultimodal Machine Learning Applications · Machine Learning in Healthcare · Topic Modeling