# A Dataset of Medical Questions Paired with Automatically Generated Answers and Evidence-supported References

**Authors:** Deepak Gupta, Davis Bartels, Dina Demner-Fushman

PMC · DOI: 10.1038/s41597-025-05233-z · 2025-06-19

## TL;DR

This paper introduces MedAESQA, a dataset for evaluating and improving medical question-answering systems by linking answers to evidence from scientific sources.

## Contribution

The novel contribution is a dataset with medical questions and answers linked to supporting scientific evidence for evaluating factual accuracy.

## Key findings

- The dataset includes 40 deidentified medical questions with 30 human and LLM-generated answers each.
- Each answer statement is linked to a scientific abstract, with manual judgments on accuracy and relevance.
- The dataset supports the development of models that can attribute facts to reliable sources.

## Abstract

New Large Language Models (LLM)-based approaches to medical Question Answering show unprecedented improvements in the fluency, grammaticality, and other qualities of the generated answers. However, the systems occasionally produce coherent, topically relevant, and plausible answers that are not based on facts and may be misleading and even harmful. New types of datasets are needed to evaluate the truthfulness of generated answers and develop reliable approaches for detecting answers that are not supported by evidence. The MedAESQA (Medical Attributable and Evidence Supported Question Answering) dataset presented in this work is designed for developing, fine-tuning, and evaluating language generation models for their ability to attribute or support the stated facts by linking the statements to the relevant passages of reliable sources. The dataset comprises 40 naturally occurring aggregated deidentified questions. Each question has 30 human and LLM-generated answers in which each statement is linked to a scientific abstract that supports it. The dataset provides manual judgments on the accuracy of the statements and the relevancy of the scientific papers.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12179289/full.md

---
Source: https://tomesphere.com/paper/PMC12179289