Expert-Guided Prompting and Retrieval-Augmented Generation for Emergency Medical Service Question Answering
Xueren Ge, Sahil Murtaza, Anthony Cortez, Homa Alemzadeh

TL;DR
This paper introduces EMSQA, a specialized dataset and methods for improving medical question answering in emergency services by leveraging expert-guided prompting and retrieval-augmented generation, leading to higher accuracy and certification pass rates.
Contribution
The paper presents EMSQA, a new dataset with clinical expertise annotations, and introduces Expert-CoT and ExpertRAG methods that incorporate domain-specific knowledge for enhanced LLM performance.
Findings
Expert-CoT improves accuracy by up to 2.05% over vanilla CoT.
ExpertRAG combined with Expert-CoT yields up to 4.59% accuracy gain.
Expert-augmented LLMs pass EMS certification exams.
Abstract
Large language models (LLMs) have shown promise in medical question answering, yet they often overlook the domain-specific expertise that professionals depend on, such as the clinical subject areas (e.g., trauma, airway) and the certification level (e.g., EMT, Paramedic). Existing approaches typically apply general-purpose prompting or retrieval strategies without leveraging this structured context, limiting performance in high-stakes settings. We address this gap with EMSQA, an 24.3K-question multiple-choice dataset spanning 10 clinical subject areas and 4 certification levels, accompanied by curated, subject area-aligned knowledge bases (40K documents and 2M tokens). Building on EMSQA, we introduce (i) Expert-CoT, a prompting strategy that conditions chain-of-thought (CoT) reasoning on specific clinical subject area and certification level, and (ii) ExpertRAG, a retrieval-augmented…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Expert finding and Q&A systems · Multimodal Machine Learning Applications
