A comparative study of artificial intelligence and human doctors for the purpose of triage and diagnosis
Salman Razzaki, Adam Baker, Yura Perov, Katherine Middleton, Janie, Baxter, Daniel Mullarkey, Davinder Sangar, Michael Taliercio, Mobasher Butt,, Azeem Majeed, Arnold DoRosario, Megan Mahoney, Saurabh Johri

TL;DR
This study evaluates an AI triage and diagnostic system against human doctors, finding comparable diagnostic accuracy and safer triage recommendations, highlighting AI's potential to enhance patient care.
Contribution
It provides a prospective validation comparing AI and human doctors in triage and diagnosis, demonstrating AI's safety and accuracy in clinical decision-making.
Findings
AI system matches human doctors in diagnostic accuracy
AI triage recommendations are safer than those of humans
Study uses independent judges and benchmark vignettes for validation
Abstract
Online symptom checkers have significant potential to improve patient care, however their reliability and accuracy remain variable. We hypothesised that an artificial intelligence (AI) powered triage and diagnostic system would compare favourably with human doctors with respect to triage and diagnostic accuracy. We performed a prospective validation study of the accuracy and safety of an AI powered triage and diagnostic system. Identical cases were evaluated by both an AI system and human doctors. Differential diagnoses and triage outcomes were evaluated by an independent judge, who was blinded from knowing the source (AI system or human doctor) of the outcomes. Independently of these cases, vignettes from publicly available resources were also assessed to provide a benchmark to previous studies and the diagnostic component of the MRCGP exam. Overall we found that the Babylon AI powered…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsClinical Reasoning and Diagnostic Skills · Radiology practices and education · Artificial Intelligence in Healthcare and Education
