Evaluating the Impact of a Specialized LLM on Physician Experience in Clinical Decision Support: A Comparison of Ask Avo and ChatGPT-4
Daniel Jung, Alex Butler, Joongheum Park, Yair Saperstein

TL;DR
This study compares Ask Avo, a specialized LLM for clinical decision support, with ChatGPT-4, showing Ask Avo's superior performance in trustworthiness, relevance, and usability in simulated clinical scenarios.
Contribution
The paper introduces Ask Avo, a tailored LLM with proprietary retrieval and citation features, demonstrating improved physician experience over general-purpose models.
Findings
Ask Avo significantly outperforms ChatGPT-4 in all evaluated criteria.
Specialized LLMs tailored for clinicians enhance trust and usability.
Evidence-based design improves clinical decision support tools.
Abstract
The use of Large language models (LLMs) to augment clinical decision support systems is a topic with rapidly growing interest, but current shortcomings such as hallucinations and lack of clear source citations make them unreliable for use in the clinical environment. This study evaluates Ask Avo, an LLM-derived software by AvoMD that incorporates a proprietary Language Model Augmented Retrieval (LMAR) system, in-built visual citation cues, and prompt engineering designed for interactions with physicians, against ChatGPT-4 in end-user experience for physicians in a simulated clinical scenario environment. Eight clinical questions derived from medical guideline documents in various specialties were prompted to both models by 62 study participants, with each response rated on trustworthiness, actionability, relevancy, comprehensiveness, and friendly format from 1 to 5. Ask Avo…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Clinical Reasoning and Diagnostic Skills
