MedLoc-R1: Performance-Aware Curriculum Reward Scheduling for GRPO-Based Medical Visual Grounding

Guangjing Yang; Ziyuan Qin; Chaoran Zhang; Chenlin Du; Jinlin Wang; Wanran Sun; Zhenyu Zhang; Bing Ji; Qicheng Lao

arXiv:2603.28120·cs.CV·March 31, 2026

MedLoc-R1: Performance-Aware Curriculum Reward Scheduling for GRPO-Based Medical Visual Grounding

Guangjing Yang, Ziyuan Qin, Chaoran Zhang, Chenlin Du, Jinlin Wang, Wanran Sun, Zhenyu Zhang, Bing Ji, Qicheng Lao

PDF

1 Repo

TL;DR

MedLoc-R1 introduces a performance-aware reward scheduling method for reinforcement learning in medical visual grounding, progressively tightening reward criteria to improve localization accuracy and training stability.

Contribution

It proposes a novel reward scheduling framework that adapts reward strictness based on model readiness without extra networks, enhancing RL-based medical grounding.

Findings

01

Consistently improves localization accuracy over baselines.

02

Enhances training stability in RL medical grounding tasks.

03

Effective across multiple medical visual grounding benchmarks.

Abstract

Medical visual grounding serves as a crucial foundation for fine-grained multimodal reasoning and interpretable clinical decision support. Despite recent advances in reinforcement learning (RL) for grounding tasks, existing approaches such as Group Relative Policy Optimization~(GRPO) suffer from severe reward sparsity when directly applied to medical images, primarily due to the inherent difficulty of localizing small or ambiguous regions of interest, which is further exacerbated by the rigid and suboptimal nature of fixed IoU-based reward schemes in RL. This leads to vanishing policy gradients and stagnated optimization, particularly during early training. To address this challenge, we propose MedLoc-R1, a performance-aware reward scheduling framework that progressively tightens the reward criterion in accordance with model readiness. MedLoc-R1 introduces a sliding-window performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MembrAI/MedLoc-R1
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.