To Adapt or to Fine-tune: A Case Study on Abstractive Summarization

Zheng Zhao; Pinzhen Chen

arXiv:2208.14559·cs.CL·September 1, 2022

To Adapt or to Fine-tune: A Case Study on Abstractive Summarization

Zheng Zhao, Pinzhen Chen

PDF

Open Access 1 Repo

TL;DR

This paper compares fine-tuning and adapter-based methods for abstractive summarization, finding fine-tuning generally performs better with more data, but adapters excel in low-resource scenarios, providing insights into their practical use.

Contribution

It offers a comprehensive investigation into the effectiveness of fine-tuning versus adapters for summarization across various transfer scenarios, highlighting conditions favoring each approach.

Findings

01

Fine-tuning outperforms adapters with sufficient data.

02

Adapters are more effective in extremely low-resource settings.

03

Insights on multilinguality, convergence, and robustness are provided.

Abstract

Recent advances in the field of abstractive summarization leverage pre-trained language models rather than train a model from scratch. However, such models are sluggish to train and accompanied by a massive overhead. Researchers have proposed a few lightweight alternatives such as smaller adapters to mitigate the drawbacks. Nonetheless, it remains uncertain whether using adapters benefits the task of summarization, in terms of improved efficiency without an unpleasant sacrifice in performance. In this work, we carry out multifaceted investigations on fine-tuning and adapters for summarization tasks with varying complexity: language, domain, and task transfer. In our experiments, fine-tuning a pre-trained language model generally attains a better performance than using adapters; the performance gap positively correlates with the amount of training data used. Notably, adapters exceed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zsquaredz/adapt_vs_finetune
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification