Should We Trust This Summary? Bayesian Abstractive Summarization to The   Rescue

Alexios Gidiotis; Grigorios Tsoumakas

arXiv:2105.10155·cs.CL·May 4, 2022

Should We Trust This Summary? Bayesian Abstractive Summarization to The Rescue

Alexios Gidiotis, Grigorios Tsoumakas

PDF

Open Access

TL;DR

This paper introduces a Bayesian approach to abstractive summarization using Monte Carlo dropout, enabling uncertainty quantification, improved summary quality, and better robustness compared to traditional models.

Contribution

It extends state-of-the-art summarization models with Bayesian inference, allowing for uncertainty estimation and enhanced performance through Variational Bayesian methods.

Findings

01

Bayesian models outperform deterministic counterparts on benchmarks.

02

Uncertainty filtering improves summary quality.

03

Bayesian summaries are more robust to uncertainty.

Abstract

We explore the notion of uncertainty in the context of modern abstractive summarization models, using the tools of Bayesian Deep Learning. Our approach approximates Bayesian inference by first extending state-of-the-art summarization models with Monte Carlo dropout and then using them to perform multiple stochastic forward passes. Based on Bayesian inference we are able to effectively quantify uncertainty at prediction time. Having a reliable uncertainty measure, we can improve the experience of the end user by filtering out generated summaries of high uncertainty. Furthermore, uncertainty estimation could be used as a criterion for selecting samples for annotation, and can be paired nicely with active learning and human-in-the-loop approaches. Finally, Bayesian inference enables us to find a Bayesian summary which performs better than a deterministic one and is more robust to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Algorithms and Data Compression

MethodsAttention Is All You Need · PEGASUS · Linear Layer · Residual Connection · Multi-Head Attention · Dense Connections · Adam · Softmax · Layer Normalization · Byte Pair Encoding