Self-Supervised Learning for Contextualized Extractive Summarization

Hong Wang; Xin Wang; Wenhan Xiong; Mo Yu; Xiaoxiao Guo; Shiyu Chang,; William Yang Wang

arXiv:1906.04466·cs.CL·June 12, 2019·6 cites

Self-Supervised Learning for Contextualized Extractive Summarization

Hong Wang, Xin Wang, Wenhan Xiong, Mo Yu, Xiaoxiao Guo, Shiyu Chang,, William Yang Wang

PDF

Open Access 2 Repos

TL;DR

This paper introduces three self-supervised auxiliary tasks for extractive summarization that enhance document-level context understanding, leading to improved performance over existing models on the CNN/DM dataset.

Contribution

The paper proposes novel self-supervised auxiliary pre-training tasks that explicitly capture document-level context for extractive summarization.

Findings

01

Pre-training with auxiliary tasks improves summarization performance.

02

The approach outperforms previous state-of-the-art models.

03

Simple models after pre-training achieve superior results.

Abstract

Existing models for extractive summarization are usually trained from scratch with a cross-entropy loss, which does not explicitly capture the global context at the document level. In this paper, we aim to improve this task by introducing three auxiliary pre-training tasks that learn to capture the document-level context in a self-supervised fashion. Experiments on the widely-used CNN/DM dataset validate the effectiveness of the proposed auxiliary tasks. Furthermore, we show that after pre-training, a clean model with simple building blocks is able to outperform previous state-of-the-art that are carefully designed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques