Towards Conversational Diagnostic AI
Tao Tu, Anil Palepu, Mike Schaekermann, Khaled Saab, Jan Freyberg,, Ryutaro Tanno, Amy Wang, Brenna Li, Mohamed Amin, Nenad Tomasev, Shekoofeh, Azizi, Karan Singhal, Yong Cheng, Le Hou, Albert Webson, Kavita Kulkarni, S, Sara Mahdavi, Christopher Semturs, Juraj Gottweis

TL;DR
This paper introduces AMIE, an AI system based on large language models designed for diagnostic dialogue, demonstrating superior performance to primary care physicians in a comprehensive evaluation, marking progress towards conversational diagnostic AI.
Contribution
The paper presents AMIE, a novel LLM-based diagnostic dialogue system with a self-play environment and a new evaluation framework, achieving higher accuracy than physicians in simulated clinical scenarios.
Findings
AMIE outperformed primary care physicians in diagnostic accuracy.
AMIE scored higher on most performance axes from specialists and patient actors.
The system demonstrated potential for scalable, conversational diagnostic support.
Abstract
At the heart of medicine lies the physician-patient dialogue, where skillful history-taking paves the way for accurate diagnosis, effective management, and enduring trust. Artificial Intelligence (AI) systems capable of diagnostic dialogue could increase accessibility, consistency, and quality of care. However, approximating clinicians' expertise is an outstanding grand challenge. Here, we introduce AMIE (Articulate Medical Intelligence Explorer), a Large Language Model (LLM) based AI system optimized for diagnostic dialogue. AMIE uses a novel self-play based simulated environment with automated feedback mechanisms for scaling learning across diverse disease conditions, specialties, and contexts. We designed a framework for evaluating clinically-meaningful axes of performance including history-taking, diagnostic accuracy, management reasoning, communication skills, and empathy. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsClinical Reasoning and Diagnostic Skills · Artificial Intelligence in Healthcare and Education · Machine Learning in Healthcare
