Improving Clinical Documentation with AI: A Comparative Study of Sporo AI Scribe and GPT-4o mini
Chanseo Lee, Sonu Kumar, Kimon A. Vogt, Sam Meraj

TL;DR
This study compares Sporo AI Scribe and GPT-4o Mini in clinical documentation, showing Sporo's superior accuracy, clinician satisfaction, and reliability in real-world healthcare settings.
Contribution
It introduces a multi-agent system leveraging fine-tuned medical LLMs and provides a comparative evaluation against GPT-4o Mini in clinical documentation tasks.
Findings
Sporo AI outperforms GPT-4o Mini in recall, precision, and F1 scores.
Clinicians rated Sporo summaries higher for accuracy and relevance.
Sporo AI demonstrates effective, reliable documentation with fewer hallucinations.
Abstract
AI-powered medical scribes have emerged as a promising solution to alleviate the documentation burden in healthcare. Ambient AI scribes provide real-time transcription and automated data entry into Electronic Health Records (EHRs), with the potential to improve efficiency, reduce costs, and enhance scalability. Despite early success, the accuracy of AI scribes remains critical, as errors can lead to significant clinical consequences. Additionally, AI scribes face challenges in handling the complexity and variability of medical language and ensuring the privacy of sensitive patient data. This case study aims to evaluate Sporo Health's AI scribe, a multi-agent system leveraging fine-tuned medical LLMs, by comparing its performance with OpenAI's GPT-4o Mini on multiple performance metrics. Using a dataset of de-identified patient conversation transcripts, AI-generated summaries were…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Explainable Artificial Intelligence (XAI)
