Complementary Human-AI Clinical Reasoning in Ophthalmology
Mertcan Sevgi, Fares Antaki, Abdullah Zafar Khan, Ariel Yuhan Ong, David Adrian Merle, Kuang Hu, Shafi Balal, Sophie-Christin Kornelia Ernst, Josef Huemer, Gabriel T. Kaufmann, Hagar Khalid, Faye Levina, Celeste Limoli, Ana Paula Ribeiro Reis, Samir Touma, Anil Palepu

TL;DR
This study evaluates AMIE, an AI-powered conversational system for ophthalmology, demonstrating its potential to support clinicians by improving diagnostic accuracy and decision-making through interactive reasoning and comparison with expert answers.
Contribution
The paper introduces AMIE, a fine-tuned AI system that enhances ophthalmic clinical reasoning and demonstrates its complementary role alongside human clinicians in real-world scenarios.
Findings
AMIE's diagnostic performance is comparable to clinicians at baseline.
Clinicians improved their diagnostic rankings and plans after reviewing AMIE's outputs.
AMIE's support led to increased agreement among clinicians and enriched clinical reasoning.
Abstract
Vision impairment and blindness are a major global health challenge where gaps in the ophthalmology workforce limit access to specialist care. We evaluate AMIE, a medically fine-tuned conversational system based on Gemini with integrated web search and self-critique reasoning, using real-world clinical vignettes that reflect scenarios a general ophthalmologist would be expected to manage. We conducted two complementary evaluations: (1) a human-AI interactive diagnostic reasoning study in which ophthalmologists recorded initial differentials and plans, then reviewed AMIE's structured output and revised their answers; and (2) a masked preference and quality study comparing AMIE's narrative outputs with case author reference answers using a predefined rubric. AMIE showed standalone diagnostic performance comparable to clinicians at baseline. Crucially, after reviewing AMIE's responses,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
