Enhancing Biomedical Text Summarization and Question-Answering: On the   Utility of Domain-Specific Pre-Training

Dima Galat; Marian-Andrei Rizoiu

arXiv:2307.04412·cs.CL·July 11, 2023·1 cites

Enhancing Biomedical Text Summarization and Question-Answering: On the Utility of Domain-Specific Pre-Training

Dima Galat, Marian-Andrei Rizoiu

PDF

Open Access

TL;DR

This paper demonstrates that in biomedical text summarization, a three-step fine-tuning approach with general-domain pre-training can outperform domain-specific pre-training, especially with limited in-domain data.

Contribution

It introduces a novel three-step fine-tuning method that leverages general-domain pre-training for biomedical summarization, challenging the assumption that domain-specific pre-training is always superior.

Findings

01

General-domain pre-training followed by task-specific fine-tuning improves performance.

02

In-domain pre-training does not always provide advantages in biomedical summarization.

03

A large language model without domain-specific pre-training can excel in biomedical text generation.

Abstract

Biomedical summarization requires large datasets to train for text generation. We show that while transfer learning offers a viable option for addressing this challenge, an in-domain pre-training does not always offer advantages in a BioASQ summarization task. We identify a suitable model architecture and use it to show a benefit of a general-domain pre-training followed by a task-specific fine-tuning in the context of a BioASQ summarization task, leading to a novel three-step fine-tuning approach that works with only a thousand in-domain examples. Our results indicate that a Large Language Model without domain-specific pre-training can have a significant edge in some domain-specific biomedical text generation tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques