Inverse Reinforcement Learning for Text Summarization

Yu Fu; Deyi Xiong; Yue Dong

arXiv:2212.09917·cs.CL·December 6, 2023

Inverse Reinforcement Learning for Text Summarization

Yu Fu, Deyi Xiong, Yue Dong

PDF

Open Access

TL;DR

This paper presents an inverse reinforcement learning approach for training abstractive text summarization models, which better imitates human summarization and improves multiple evaluation metrics across diverse datasets.

Contribution

It introduces IRL for summarization, estimating reward functions from human-like behaviors, and demonstrates its effectiveness over traditional methods like MLE and RL.

Findings

01

IRL model outperforms MLE and RL baselines

02

Summaries have higher ROUGE, coverage, and novelty scores

03

Model achieves better factuality and human evaluation results

Abstract

We introduce inverse reinforcement learning (IRL) as an effective paradigm for training abstractive summarization models, imitating human summarization behaviors. Our IRL model estimates the reward function using a suite of important sub-rewards for summarization and concurrently optimizes the policy network. Experimental results across datasets in different domains (CNN/DailyMail and WikiHow) and various model sizes (BART-base and BART-large) demonstrate the superiority of our proposed IRL model for summarization over MLE and RL baselines. The resulting summaries exhibit greater similarity to human-crafted gold references, outperforming MLE and RL baselines on metrics such as ROUGE, coverage, novelty, compression ratio, factuality, and human evaluations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques