MeDiSumQA: Patient-Oriented Question-Answer Generation from Discharge   Letters

Amin Dada; Osman Alperen Koras; Marie Bauer; Amanda Butler; Kaleb E.; Smith; Jens Kleesiek; Julian Friedrich

arXiv:2502.03298·cs.CL·February 6, 2025

MeDiSumQA: Patient-Oriented Question-Answer Generation from Discharge Letters

Amin Dada, Osman Alperen Koras, Marie Bauer, Amanda Butler, Kaleb E., Smith, Jens Kleesiek, Julian Friedrich

PDF

Open Access 1 Video

TL;DR

This paper introduces MeDiSumQA, a new dataset for evaluating large language models in generating patient-friendly answers from discharge summaries, aiming to improve medical communication and patient understanding.

Contribution

The creation of MeDiSumQA dataset from discharge summaries and its use to evaluate LLMs for patient-oriented question-answering is a novel contribution.

Findings

01

General-purpose LLMs outperform biomedical models in this task.

02

Automated metrics show good correlation with human judgment.

03

Releasing the dataset aims to foster further research in patient-centered medical AI.

Abstract

While increasing patients' access to medical documents improves medical care, this benefit is limited by varying health literacy levels and complex medical terminology. Large language models (LLMs) offer solutions by simplifying medical information. However, evaluating LLMs for safe and patient-friendly text generation is difficult due to the lack of standardized evaluation resources. To fill this gap, we developed MeDiSumQA. MeDiSumQA is a dataset created from MIMIC-IV discharge summaries through an automated pipeline combining LLM-based question-answer generation with manual quality checks. We use this dataset to evaluate various LLMs on patient-oriented question-answering. Our findings reveal that general-purpose LLMs frequently surpass biomedical-adapted models, while automated metrics correlate with human judgment. By releasing MeDiSumQA on PhysioNet, we aim to advance the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

MeDiSumQA: Patient-Oriented Question-Answer Generation from Discharge Letters· underline

Taxonomy

TopicsTopic Modeling · Text Readability and Simplification