TL;DR
PatientSim is a customizable, open-source simulator generating diverse, realistic patient personas for clinical training and evaluation of doctor language models, grounded in real-world data and validated by clinicians.
Contribution
The paper introduces PatientSim, a novel, persona-driven patient simulator that enhances realism and diversity in medical dialogue systems using real-world data and expert validation.
Findings
PatientSim generates 37 unique patient personas.
Llama 3.3 70B performs best among tested LLMs.
Clinicians validate the robustness of PatientSim.
Abstract
Doctor-patient consultations require multi-turn, context-aware communication tailored to diverse patient personas. Training or evaluating doctor LLMs in such settings requires realistic patient interaction systems. However, existing simulators often fail to reflect the full range of personas seen in clinical practice. To address this, we introduce PatientSim, a patient simulator that generates realistic and diverse patient personas for clinical scenarios, grounded in medical expertise. PatientSim operates using: 1) clinical profiles, including symptoms and medical history, derived from real-world data in the MIMIC-ED and MIMIC-IV datasets, and 2) personas defined by four axes: personality, language proficiency, medical history recall level, and cognitive confusion level, resulting in 37 unique combinations. We evaluate eight LLMs for factual accuracy and persona consistency. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
