MIRIAD: Augmenting LLMs with millions of medical query-response pairs

Qinyue Zheng; Salman Abdullah; Sam Rawal; Cyril Zakka; Sophie Ostmeier; Maximilian Purk; Eduardo Reis; Eric J. Topol; Jure Leskovec; Michael Moor

arXiv:2506.06091·cs.CL·June 10, 2025

MIRIAD: Augmenting LLMs with millions of medical query-response pairs

Qinyue Zheng, Salman Abdullah, Sam Rawal, Cyril Zakka, Sophie Ostmeier, Maximilian Purk, Eduardo Reis, Eric J. Topol, Jure Leskovec, Michael Moor

PDF

Open Access 3 Datasets

TL;DR

MIRIAD is a large, curated medical QA dataset that enhances LLMs' accuracy and reliability in healthcare by providing structured, high-quality question-answer pairs grounded in peer-reviewed literature.

Contribution

The paper introduces MIRIAD, a novel large-scale, curated medical QA corpus with a semi-automated pipeline, improving LLM performance and hallucination detection in medical applications.

Findings

01

Augmentation with MIRIAD improves LLM accuracy by up to 6.7%.

02

MIRIAD enhances LLMs' ability to detect medical hallucinations by 22.5-37%.

03

MIRIAD enables targeted retrieval with structured QA pairs.

Abstract

LLMs are bound to transform healthcare with advanced decision support and flexible chat assistants. However, LLMs are prone to generate inaccurate medical content. To ground LLMs in high-quality medical knowledge, LLMs have been equipped with external knowledge via RAG, where unstructured medical knowledge is split into small text chunks that can be selectively retrieved and integrated into the LLMs context. Yet, existing RAG pipelines rely on raw, unstructured medical text, which can be noisy, uncurated and difficult for LLMs to effectively leverage. Systematic approaches to organize medical knowledge to best surface it to LLMs are generally lacking. To address these challenges, we introduce MIRIAD, a large-scale, curated corpus of 5,821,948 medical QA pairs, each rephrased from and grounded in a passage from peer-reviewed medical literature using a semi-automated pipeline combining…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Artificial Intelligence in Healthcare and Education