MRG-R1: Reinforcement Learning for Clinically Aligned Medical Report Generation

Pengyu Wang; Shuchang Ye; Usman Naseem; Jinman Kim

arXiv:2512.16145·cs.CL·March 30, 2026

MRG-R1: Reinforcement Learning for Clinically Aligned Medical Report Generation

Pengyu Wang, Shuchang Ye, Usman Naseem, Jinman Kim

PDF

TL;DR

This paper introduces MRG-R1, a reinforcement learning framework that directly optimizes clinical correctness in medical report generation, improving accuracy over traditional token-level training methods.

Contribution

It proposes a semantic-driven reinforcement learning approach with a report-level reward function to enhance medical report accuracy and clinical relevance.

Findings

01

Improves accuracy and coverage of clinically relevant findings.

02

Achieves state-of-the-art clinical efficacy on IU X-Ray and MIMIC-CXR datasets.

03

Outperforms token-level likelihood training methods.

Abstract

Medical report generation aims to automatically produce radiology-style reports from medical images, supporting efficient and accurate clinical decision-making.However, existing approaches predominately rely on token-level likelihood training, which favors local lexical matching and leaves clinical correctness under-specified in the training objective. This behavior can be attributed to token-level likelihood optimization, which rewards surface-form agreement and therefore fails to directly encode constraints on medically accurate findings. To address this objective mismatch, we introduce a semantic-driven reinforcement learning (SRL) framework for medical report generation, named MRG-R1, which directly optimizes report-level clinical correctness rather than token-level likelihood. The key module is a clinically grounded report-level reward function, which reinforces semantic agreement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.