Lessons from the TREC Plain Language Adaptation of Biomedical Abstracts (PLABA) track

Brian Ondov; William Xia; Kush Attal; Ishita Unde; Jerry He; Dina Demner-Fushman

arXiv:2507.14096·cs.CL·July 23, 2025

Lessons from the TREC Plain Language Adaptation of Biomedical Abstracts (PLABA) track

Brian Ondov, William Xia, Kush Attal, Ishita Unde, Jerry He, Dina Demner-Fushman

PDF

TL;DR

The PLABA track evaluated language models' ability to adapt biomedical abstracts into plain language, revealing promising results but also highlighting challenges in automatic evaluation and term simplification.

Contribution

This study introduces a rigorous evaluation framework for biomedical text simplification using large language models, with comprehensive manual and automatic assessments.

Findings

01

Top models matched human accuracy in factual correctness

02

Automatic metrics poorly correlated with manual judgments

03

LLMs showed strength in generating accurate and complete simplified text

Abstract

Objective: Recent advances in language models have shown potential to adapt professional-facing biomedical literature to plain language, making it accessible to patients and caregivers. However, their unpredictability, combined with the high potential for harm in this domain, means rigorous evaluation is necessary. Our goals with this track were to stimulate research and to provide high-quality evaluation of the most promising systems. Methods: We hosted the Plain Language Adaptation of Biomedical Abstracts (PLABA) track at the 2023 and 2024 Text Retrieval Conferences. Tasks included complete, sentence-level, rewriting of abstracts (Task 1) as well as identifying and replacing difficult terms (Task 2). For automatic evaluation of Task 1, we developed a four-fold set of professionally-written references. Submissions for both Tasks 1 and 2 were provided extensive manual evaluation from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.