TL;DR
This paper introduces DoctorAgent-RL, a reinforcement learning multi-agent system trained on a new medical dialogue dataset, which improves diagnostic accuracy and interaction quality in clinical consultations.
Contribution
It presents a novel RL-based framework for medical dialogue, emphasizing strategic questioning and dynamic decision-making, with a new dataset for training and evaluation.
Findings
DoctorAgent-RL achieved a 70% exact diagnostic match rate.
The system outperformed frontier models in clinical diagnosis tasks.
Rigorous evaluations confirmed its effectiveness in real-world scenarios.
Abstract
Large language models (LLMs) struggle in real-world clinical consultations. Single-turn consultation systems require patients to describe all symptoms at once, which often leads to unclear complaints and vague diagnoses. Traditional dialogue models, constrained by static supervised learning, are limited to superficially imitating existing dialogue patterns and lack the ability to actively construct understanding in dynamic interactions, thus failing to achieve genuine clinical reasoning.To address these challenges, we propose DoctorAgent-RL, a reinforcement learning (RL)-based multi-agent collaborative framework, and train a doctor agent on Qwen2.5-7B-Instruct using this framework. Within this framework, a medical consultation is modeled as a dynamic decision-making process under uncertainty. The core intelligence of the doctor agent is shifted from knowing the answer to learning and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
