Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness
Yun-Zhu Song, Yi-Syuan Chen, Hong-Han Shuai

TL;DR
This paper introduces a hierarchical extract-then-abstract Transformer framework for multi-document summarization, utilizing loss weighting and reinforcement learning to improve extraction quality and summary coherence, achieving state-of-the-art results.
Contribution
The paper proposes a novel loss weighting mechanism and reinforcement learning approach to enhance extract-then-abstract models for multi-document summarization.
Findings
Outperforms strong baselines on multiple datasets
Achieves the best results on Multi-News, Multi-XScience, and WikiCatSum
Effectively balances training and testing objectives
Abstract
A notable challenge in Multi-Document Summarization (MDS) is the extremely-long length of the input. In this paper, we present an extract-then-abstract Transformer framework to overcome the problem. Specifically, we leverage pre-trained language models to construct a hierarchical extractor for salient sentence selection across documents and an abstractor for rewriting the selected contents as summaries. However, learning such a framework is challenging since the optimal contents for the abstractor are generally unknown. Previous works typically create pseudo extraction oracle to enable the supervised learning for both the extractor and the abstractor. Nevertheless, we argue that the performance of such methods could be restricted due to the insufficient information for prediction and inconsistent objectives between training and testing. To this end, we propose a loss weighting mechanism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques
MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Multi-Head Attention · Residual Connection · Softmax · Label Smoothing · Adam · Position-Wise Feed-Forward Layer · Layer Normalization
