Real-World Doctor Agent with Proactive Consultation through Multi-Agent Reinforcement Learning

Yichun Feng; Jiawei Wang; Lu Zhou; Yikai Zheng; Zhen Lei; Yixue Li

arXiv:2505.19630·cs.CL·May 1, 2026

Real-World Doctor Agent with Proactive Consultation through Multi-Agent Reinforcement Learning

Yichun Feng, Jiawei Wang, Lu Zhou, Yikai Zheng, Zhen Lei, Yixue Li

PDF

1 Repo 2 Models

TL;DR

This paper introduces DoctorAgent-RL, a reinforcement learning multi-agent system trained on a new medical dialogue dataset, which improves diagnostic accuracy and interaction quality in clinical consultations.

Contribution

It presents a novel RL-based framework for medical dialogue, emphasizing strategic questioning and dynamic decision-making, with a new dataset for training and evaluation.

Findings

01

DoctorAgent-RL achieved a 70% exact diagnostic match rate.

02

The system outperformed frontier models in clinical diagnosis tasks.

03

Rigorous evaluations confirmed its effectiveness in real-world scenarios.

Abstract

Large language models (LLMs) struggle in real-world clinical consultations. Single-turn consultation systems require patients to describe all symptoms at once, which often leads to unclear complaints and vague diagnoses. Traditional dialogue models, constrained by static supervised learning, are limited to superficially imitating existing dialogue patterns and lack the ability to actively construct understanding in dynamic interactions, thus failing to achieve genuine clinical reasoning.To address these challenges, we propose DoctorAgent-RL, a reinforcement learning (RL)-based multi-agent collaborative framework, and train a doctor agent on Qwen2.5-7B-Instruct using this framework. Within this framework, a medical consultation is modeled as a dynamic decision-making process under uncertainty. The core intelligence of the doctor agent is shifted from knowing the answer to learning and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jarvisustc/DoctorAgent-RL
github

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.