MMedExpert-R1: Strengthening Multimodal Medical Reasoning via Domain-Specific Adaptation and Clinical Guideline Reinforcement
Meidan Ding, Jipeng Zhang, Wenxuan Wang, Haiqin Zhong, Xiaoling Luo, Wenting Chen, Linlin Shen

TL;DR
This paper introduces MMedExpert-R1, a multimodal medical reasoning model that leverages domain-specific adaptation and clinical guideline reinforcement to improve complex clinical reasoning across multiple specialties.
Contribution
It proposes a novel framework combining specialty-specific modules and clinical guidelines, along with a new high-quality dataset for training and evaluation.
Findings
Achieved state-of-the-art results on MedXpert-MM and OmniMedVQA datasets.
Developed a high-quality dataset of 10K samples with reasoning traces.
Demonstrated robust multi-specialty clinical reasoning capabilities.
Abstract
Medical Vision-Language Models (MedVLMs) excel at perception tasks but struggle with complex clinical reasoning required in real-world scenarios. While reinforcement learning (RL) has been explored to enhance reasoning capabilities, existing approaches face critical mismatches: the scarcity of deep reasoning data, cold-start limits multi-specialty alignment, and standard RL algorithms fail to model clinical reasoning diversity. We propose MMedExpert-R1, a novel reasoning MedVLM that addresses these challenges through domain-specific adaptation and clinical guideline reinforcement. We construct MMedExpert, a high-quality dataset of 10K samples across four specialties with step-by-step reasoning traces. Our Domain-Specific Adaptation (DSA) creates specialty-specific LoRA modules to provide diverse initialization, while Guideline-Based Advantages (GBA) explicitly models different clinical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Multimodal Machine Learning Applications · Artificial Intelligence in Healthcare and Education
