DeepChannel: Salience Estimation by Contrastive Learning for Extractive   Document Summarization

Jiaxin Shi; Chen Liang; Lei Hou; Juanzi Li; Zhiyuan Liu; Hanwang Zhang

arXiv:1811.02394·cs.CL·November 8, 2018·6 cites

DeepChannel: Salience Estimation by Contrastive Learning for Extractive Document Summarization

Jiaxin Shi, Chen Liang, Lei Hou, Juanzi Li, Zhiyuan Liu, Hanwang Zhang

PDF

Open Access 1 Repo

TL;DR

DeepChannel is a neural model for extractive summarization that estimates sentence salience using contrastive learning, achieving state-of-the-art results with high data efficiency and robustness across datasets.

Contribution

It introduces a contrastive training strategy for salience estimation, improving extractive summarization performance and data efficiency.

Findings

01

Achieves state-of-the-art ROUGE scores on CNN/Daily Mail.

02

Demonstrates robustness on out-of-domain DUC2007 dataset.

03

Reaches high ROUGE-1 F-1 with only 1% of training data.

Abstract

We propose DeepChannel, a robust, data-efficient, and interpretable neural model for extractive document summarization. Given any document-summary pair, we estimate a salience score, which is modeled using an attention-based deep neural network, to represent the salience degree of the summary for yielding the document. We devise a contrastive training strategy to learn the salience estimation network, and then use the learned salience score as a guide and iteratively extract the most salient sentences from the document as our generated summary. In experiments, our model not only achieves state-of-the-art ROUGE scores on CNN/Daily Mail dataset, but also shows strong robustness in the out-of-domain test on DUC2007 test set. Moreover, our model reaches a ROUGE-1 F-1 score of 39.41 on CNN/Daily Mail test set with merely $1/100$ training set, demonstrating a tremendous data efficiency.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lliangchenc/DeepChannel
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques