Large Language Models for Biomedical Text Simplification: Promising But   Not There Yet

Zihao Li; Samuel Belkadi; Nicolo Micheletti; Lifeng Han; Matthew; Shardlow; Goran Nenadic

arXiv:2408.03871·cs.CL·October 23, 2024·2 cites

Large Language Models for Biomedical Text Simplification: Promising But Not There Yet

Zihao Li, Samuel Belkadi, Nicolo Micheletti, Lifeng Han, Matthew, Shardlow, Goran Nenadic

PDF

Open Access 1 Repo

TL;DR

This paper evaluates various large language models for biomedical abstract simplification, showing promising results but highlighting that the task is still challenging and not fully solved yet.

Contribution

It presents a comprehensive system combining fine-tuned models and prompting techniques for biomedical text simplification, with competitive evaluation results.

Findings

01

BeeManc ranks 2nd in automatic evaluation

02

LaySciFive ranks 3rd among evaluated systems

03

BART-w-CTs achieves high human scores in sentence and term simplicity

Abstract

In this system report, we describe the models and methods we used for our participation in the PLABA2023 task on biomedical abstract simplification, part of the TAC 2023 tracks. The system outputs we submitted come from the following three categories: 1) domain fine-tuned T5-like models including Biomedical-T5 and Lay-SciFive; 2) fine-tuned BARTLarge model with controllable attributes (via tokens) BART-w-CTs; 3) ChatGPTprompting. We also present the work we carried out for this task on BioGPT finetuning. In the official automatic evaluation using SARI scores, BeeManc ranks 2nd among all teams and our model LaySciFive ranks 3rd among all 13 evaluated systems. In the official human evaluation, our model BART-w-CTs ranks 2nd on Sentence-Simplicity (score 92.84), 3rd on Term-Simplicity (score 82.33) among all 7 evaluated systems; It also produced a high score 91.57 on Fluency in comparison…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hecta-uom/plaba-mu
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification