Mutual Information Alleviates Hallucinations in Abstractive   Summarization

Liam van der Poel; Ryan Cotterell; Clara Meister

arXiv:2210.13210·cs.CL·November 1, 2022

Mutual Information Alleviates Hallucinations in Abstractive Summarization

Liam van der Poel, Ryan Cotterell, Clara Meister

PDF

3 Repos

TL;DR

This paper introduces a mutual information-based decoding strategy to reduce hallucinations in abstractive summarization by addressing model uncertainty, improving factual accuracy without sacrificing summary quality.

Contribution

It proposes a novel decoding method that switches to optimizing mutual information during uncertain model predictions to mitigate hallucinations in summarization.

Findings

01

Reduces hallucinated content in summaries

02

Maintains high Rouge and BertS scores

03

Addresses model uncertainty as a key factor

Abstract

Despite significant progress in the quality of language generated from abstractive summarization models, these models still exhibit the tendency to hallucinate, i.e., output content not supported by the source document. A number of works have tried to fix--or at least uncover the source of--the problem with limited success. In this paper, we identify a simple criterion under which models are significantly more likely to assign more probability to hallucinated content during generation: high model uncertainty. This finding offers a potential explanation for hallucinations: models default to favoring text with high marginal probability, i.e., high-frequency occurrences in the training set, when uncertain about a continuation. It also motivates possible routes for real-time intervention during decoding to prevent such hallucinations. We propose a decoding strategy that switches to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.