CURE-Med: Curriculum-Informed Reinforcement Learning for Multilingual Medical Reasoning

Eric Onyame; Akash Ghosh; Subhadip Baidya; Sriparna Saha; Xiuying Chen; Chirag Agarwal

arXiv:2601.13262·cs.AI·April 28, 2026

CURE-Med: Curriculum-Informed Reinforcement Learning for Multilingual Medical Reasoning

Eric Onyame, Akash Ghosh, Subhadip Baidya, Sriparna Saha, Xiuying Chen, Chirag Agarwal

PDF

2 Repos 5 Models 1 Datasets

TL;DR

This paper introduces CURE-MED, a reinforcement learning framework and multilingual dataset to improve medical reasoning in large language models across thirteen languages, emphasizing logical correctness and language stability.

Contribution

It presents a new multilingual medical reasoning dataset and a curriculum-informed reinforcement learning method that enhances reasoning accuracy and language consistency in LLMs.

Findings

01

Achieved 85.21% language consistency at 7B parameters.

02

Achieved 54.35% logical correctness at 7B parameters.

03

Code and dataset are publicly available at https://cure-med.github.io/.

Abstract

While large language models (LLMs) have shown to perform well on monolingual mathematical and commonsense reasoning, they remain unreliable for multilingual medical reasoning applications, hindering their deployment in multilingual healthcare settings. We address this by first introducing CUREMED-BENCH, a high-quality multilingual medical reasoning dataset with open-ended reasoning queries with a single verifiable answer, spanning thirteen languages, including underrepresented languages such as Amharic, Yoruba, and Swahili. Building on this dataset, we propose CURE-MED, a curriculum-informed reinforcement learning framework that integrates code-switching-aware supervised fine-tuning and Group Relative Policy Optimization to jointly improve logical correctness and language stability. Across thirteen languages, our approach consistently outperforms strong baselines and scales effectively,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Aikyam-Lab/CUREMED-BENCH
dataset· 125 dl
125 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.