CLIFF: Contrastive Learning for Improving Faithfulness and Factuality in   Abstractive Summarization

Shuyang Cao; Lu Wang

arXiv:2109.09209·cs.CL·September 21, 2021

CLIFF: Contrastive Learning for Improving Faithfulness and Factuality in Abstractive Summarization

Shuyang Cao, Lu Wang

PDF

Open Access 3 Repos 1 Models

TL;DR

This paper introduces a contrastive learning approach that improves the factual accuracy of abstractive summaries by training models to distinguish correct summaries from common error types, leading to more faithful outputs.

Contribution

The paper presents a novel contrastive learning framework utilizing reference and error-inducing summaries, with strategies tailored to common model errors, enhancing factuality in summarization.

Findings

01

Consistently produces more factual summaries across datasets.

02

Outperforms error correction and reranking methods.

03

Human evaluations confirm reduced errors in summaries.

Abstract

We study generating abstractive summaries that are faithful and factually consistent with the given articles. A novel contrastive learning formulation is presented, which leverages both reference summaries, as positive training data, and automatically generated erroneous summaries, as negative training data, to train summarization systems that are better at distinguishing between them. We further design four types of strategies for creating negative samples, to resemble errors made commonly by two state-of-the-art models, BART and PEGASUS, found in our new human annotations of summary errors. Experiments on XSum and CNN/Daily Mail show that our contrastive learning framework is robust across datasets and models. It consistently produces more factual summaries than strong comparisons with post error correction, entailment-based reranking, and unlikelihood training, according to QA-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
aiautomationlab/german-news-title-gen-mt5
model· 38 dl· ♡ 4
38 dl♡ 4

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · PEGASUS · Linear Layer · Contrastive Learning · Dense Connections · Multi-Head Attention · Byte Pair Encoding · Softmax · Dropout