Abstractive Text Summarization Using the BRIO Training Paradigm

Khang Nhut Lam; Thieu Gia Doan; Khang Thua Pham; Jugal Kalita

arXiv:2305.13696·cs.CL·September 1, 2023·1 cites

Abstractive Text Summarization Using the BRIO Training Paradigm

Khang Nhut Lam, Thieu Gia Doan, Khang Thua Pham, Jugal Kalita

PDF

Open Access

TL;DR

This paper introduces the BRIO training paradigm for abstractive summarization, enhancing model performance and control, especially for Vietnamese, by fine-tuning pre-trained language models with a non-deterministic approach.

Contribution

It presents a novel training paradigm, BRIO, that improves abstractive summarization by reducing dependence on reference summaries and demonstrates its effectiveness on Vietnamese and English datasets.

Findings

01

Models trained with BRIO outperform existing methods.

02

Significant improvements in Vietnamese summarization.

03

Effective on basic hardware.

Abstract

Summary sentences produced by abstractive summarization models may be coherent and comprehensive, but they lack control and rely heavily on reference summaries. The BRIO training paradigm assumes a non-deterministic distribution to reduce the model's dependence on reference summaries, and improve model performance during inference. This paper presents a straightforward but effective technique to improve abstractive summaries by fine-tuning pre-trained language models, and training them with the BRIO paradigm. We build a text summarization dataset for Vietnamese, called VieSum. We perform experiments with abstractive summarization models trained with the BRIO paradigm on the CNNDM and the VieSum datasets. The results show that the models, trained on basic hardware, outperform all existing abstractive summarization models, especially for Vietnamese.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques