A Super-Learner with Large Language Models for Medical Emergency Advising
Sergey K. Aityan, Abdolreza Mosaddegh, Rolando Herrero, Haitham Tayyar, Jiang Han, Vikram Sawant, Qi Chen, Rishabh Jain, Aruna Senthamaraikannan, Stephen Wood, Manuel Mersini, Rita Lazzaro, Mario Balzanelli, Nicola Iacovazzo, and Ciro Gargiulo Isacco

TL;DR
This paper introduces MEDAS, a super-learner system combining multiple large language models to improve diagnostic accuracy in emergency medicine, surpassing individual models and human performance.
Contribution
The study develops a novel super-learner framework that integrates multiple LLMs with meta-learning to enhance diagnostic accuracy in medical emergencies.
Findings
Super-learner achieves 70% accuracy, higher than individual LLMs.
At least one LLM in the ensemble reaches 85% accuracy.
Meta-learning improves collective diagnostic performance.
Abstract
Medical decision-support and advising systems are critical for emergency physicians to quickly and accurately assess patients' conditions and make diagnosis. Artificial Intelligence (AI) has emerged as a transformative force in healthcare in recent years and Large Language Models (LLMs) have been employed in various fields of medical decision-support systems. We studied responses of a group of different LLMs to real cases in emergency medicine. The results of our study on five most renown LLMs showed significant differences in capabilities of Large Language Models for diagnostics acute diseases in medical emergencies with accuracy ranging between 58% and 65%. This accuracy significantly exceeds the reported accuracy of human doctors. We built a super-learner MEDAS (Medical Emergency Diagnostic Advising System) of five major LLMs - Gemini, Llama, Grok, GPT, and Claude). The super-learner…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · COVID-19 diagnosis using AI · Machine Learning in Healthcare
