Efficient Summarization with Read-Again and Copy Mechanism

Wenyuan Zeng; Wenjie Luo; Sanja Fidler; Raquel Urtasun

arXiv:1611.03382·cs.CL·November 11, 2016·82 cites

Efficient Summarization with Read-Again and Copy Mechanism

Wenyuan Zeng, Wenjie Luo, Sanja Fidler, Raquel Urtasun

PDF

Open Access

TL;DR

This paper introduces a novel encoder-decoder model with a read-again mechanism and a copy mechanism, improving sequence summarization by producing better representations and handling out-of-vocabulary words efficiently.

Contribution

It proposes a simple read-again encoder and a copy mechanism, addressing limitations of existing models in sequence summarization tasks.

Findings

01

Outperforms state-of-the-art on Gigaword dataset

02

Achieves faster decoding with small vocabularies

03

Handles out-of-vocabulary words effectively

Abstract

Encoder-decoder models have been widely used to solve sequence to sequence prediction tasks. However current approaches suffer from two shortcomings. First, the encoders compute a representation of each word taking into account only the history of the words it has read so far, yielding suboptimal representations. Second, current decoders utilize large vocabularies in order to minimize the problem of unknown words, resulting in slow decoding times. In this paper we address both shortcomings. Towards this goal, we first introduce a simple mechanism that first reads the input sequence before committing to a representation of each word. Furthermore, we propose a simple copy mechanism that is able to exploit very small vocabularies and handle out-of-vocabulary words. We demonstrate the effectiveness of our approach on the Gigaword dataset and DUC competition outperforming the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques