Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive   Text Summarization

Chujie Zheng; Kunpeng Zhang; Harry Jiannan Wang; Ling Fan; Zhe Wang

arXiv:2108.11992·cs.CL·August 30, 2021

Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive Text Summarization

Chujie Zheng, Kunpeng Zhang, Harry Jiannan Wang, Ling Fan, Zhe Wang

PDF

Open Access 2 Repos

TL;DR

This paper introduces a contrastive learning-enhanced seq2seq autoencoder that improves abstractive text summarization performance, achieving competitive results with less complex models and training resources.

Contribution

It proposes a novel combination of denoising autoencoder and contrastive learning with sentence-level augmentation for better summarization.

Findings

01

Outperforms many existing benchmarks in ROUGE scores

02

Achieves comparable results to state-of-the-art models

03

Enhances denoising ability through contrastive learning

Abstract

In this paper, we present a denoising sequence-to-sequence (seq2seq) autoencoder via contrastive learning for abstractive text summarization. Our model adopts a standard Transformer-based architecture with a multi-layer bi-directional encoder and an auto-regressive decoder. To enhance its denoising ability, we incorporate self-supervised contrastive learning along with various sentence-level document augmentation. These two components, seq2seq autoencoder and contrastive learning, are jointly trained through fine-tuning, which improves the performance of text summarization with regard to ROUGE scores and human evaluation. We conduct experiments on two datasets and demonstrate that our model outperforms many existing benchmarks and even achieves comparable performance to the state-of-the-art abstractive systems trained with more complex architecture and extensive computation resources.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsContrastive Learning · Enhanced Seq2Seq Autoencoder via Contrastive Learning · Sigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence