Improving Multi-Document Summarization through Referenced Flexible   Extraction with Credit-Awareness

Yun-Zhu Song; Yi-Syuan Chen; Hong-Han Shuai

arXiv:2205.01889·cs.CL·May 5, 2022

Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness

Yun-Zhu Song, Yi-Syuan Chen, Hong-Han Shuai

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hierarchical extract-then-abstract Transformer framework for multi-document summarization, utilizing loss weighting and reinforcement learning to improve extraction quality and summary coherence, achieving state-of-the-art results.

Contribution

The paper proposes a novel loss weighting mechanism and reinforcement learning approach to enhance extract-then-abstract models for multi-document summarization.

Findings

01

Outperforms strong baselines on multiple datasets

02

Achieves the best results on Multi-News, Multi-XScience, and WikiCatSum

03

Effectively balances training and testing objectives

Abstract

A notable challenge in Multi-Document Summarization (MDS) is the extremely-long length of the input. In this paper, we present an extract-then-abstract Transformer framework to overcome the problem. Specifically, we leverage pre-trained language models to construct a hierarchical extractor for salient sentence selection across documents and an abstractor for rewriting the selected contents as summaries. However, learning such a framework is challenging since the optimal contents for the abstractor are generally unknown. Previous works typically create pseudo extraction oracle to enable the supervised learning for both the extractor and the abstractor. Nevertheless, we argue that the performance of such methods could be restricted due to the insufficient information for prediction and inconsistent objectives between training and testing. To this end, we propose a loss weighting mechanism…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yunzhusong/NAACL2022-REFLECT
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques

MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Multi-Head Attention · Residual Connection · Softmax · Label Smoothing · Adam · Position-Wise Feed-Forward Layer · Layer Normalization