WisPerMed at BioLaySumm: Adapting Autoregressive Large Language Models for Lay Summarization of Scientific Articles
Tabea M. G. Pakull, Hendrik Damm, Ahmad Idrissi-Yaghir, Henning, Sch\"afer, Peter A. Horn, Christoph M. Friedrich

TL;DR
This paper presents WisPerMed's approach to adapting large language models for biomedical lay summarization, using fine-tuning, few-shot learning, and a dynamic selection mechanism to improve readability and factuality of scientific article summaries.
Contribution
The paper introduces a novel combination of fine-tuning, prompt engineering, and dynamic output selection for biomedical lay summarization with LLMs, achieving competitive results.
Findings
Fine-tuning outperformed other methods across metrics.
Few-shot learning enhanced relevance and factuality.
Dynamic Expert Selection improved summary quality.
Abstract
This paper details the efforts of the WisPerMed team in the BioLaySumm2024 Shared Task on automatic lay summarization in the biomedical domain, aimed at making scientific publications accessible to non-specialists. Large language models (LLMs), specifically the BioMistral and Llama3 models, were fine-tuned and employed to create lay summaries from complex scientific texts. The summarization performance was enhanced through various approaches, including instruction tuning, few-shot learning, and prompt variations tailored to incorporate specific context information. The experiments demonstrated that fine-tuning generally led to the best performance across most evaluated metrics. Few-shot learning notably improved the models' ability to generate relevant and factually accurate texts, particularly when using a well-crafted prompt. Additionally, a Dynamic Expert Selection (DES) mechanism to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Natural Language Processing Techniques
