Deep Communicating Agents for Abstractive Summarization

Asli Celikyilmaz; Antoine Bosselut; Xiaodong He; Yejin Choi

arXiv:1803.10357·cs.CL·August 17, 2018

Deep Communicating Agents for Abstractive Summarization

Asli Celikyilmaz, Antoine Bosselut, Xiaodong He, Yejin Choi

PDF

TL;DR

This paper introduces deep communicating agents within an encoder-decoder framework to improve abstractive summarization of long documents by dividing encoding tasks among multiple collaborating agents, resulting in higher quality summaries.

Contribution

It proposes a novel deep communicating agents architecture that enhances long document encoding through collaboration, trained end-to-end with reinforcement learning.

Findings

01

Multiple communicating encoders outperform non-communicating ones.

02

The approach yields higher quality summaries than strong baselines.

03

End-to-end training improves coherence and focus in summaries.

Abstract

We present deep communicating agents in an encoder-decoder architecture to address the challenges of representing a long document for abstractive summarization. With deep communicating agents, the task of encoding a long text is divided across multiple collaborating agents, each in charge of a subsection of the input text. These encoders are connected to a single decoder, trained end-to-end using reinforcement learning to generate a focused and coherent summary. Empirical results demonstrate that multiple communicating encoders lead to a higher quality summary compared to several strong baselines, including those based on a single encoder or multiple non-communicating encoders.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.