PERCS: Persona-Guided Controllable Biomedical Summarization Dataset

Rohan Charudatt Salvi; Chirag Chawla; Dhruv Jain; Swapnil Panigrahi; Md Shad Akhtar; Shweta Yadav

arXiv:2512.03340·cs.CL·December 4, 2025

PERCS: Persona-Guided Controllable Biomedical Summarization Dataset

Rohan Charudatt Salvi, Chirag Chawla, Dhruv Jain, Swapnil Panigrahi, Md Shad Akhtar, Shweta Yadav

PDF

Open Access

TL;DR

PERCS is a new dataset of biomedical abstracts with summaries tailored to four distinct personas, enabling research on audience-specific biomedical text summarization and communication.

Contribution

This paper introduces PERCS, a novel dataset with persona-specific biomedical summaries, along with validation and benchmarking of large language models for controllable summarization.

Findings

01

Distinct readability and vocabulary across personas

02

Benchmark results for large language models on PERCS

03

Dataset and guidelines publicly available

Abstract

Automatic medical text simplification plays a key role in improving health literacy by making complex biomedical research accessible to diverse readers. However, most existing resources assume a single generic audience, overlooking the wide variation in medical literacy and information needs across user groups. To address this limitation, we introduce PERCS (Persona-guided Controllable Summarization), a dataset of biomedical abstracts paired with summaries tailored to four personas: Laypersons, Premedical Students, Non-medical Researchers, and Medical Experts. These personas represent different levels of medical literacy and information needs, emphasizing the need for targeted, audience-specific summarization. Each summary in PERCS was reviewed by physicians for factual accuracy and persona alignment using a detailed error taxonomy. Technical validation shows clear differences in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPersona Design and Applications · Text Readability and Simplification · Topic Modeling