On Improving Summarization Factual Consistency from Natural Language   Feedback

Yixin Liu; Budhaditya Deb; Milagro Teruel; Aaron Halfaker; Dragomir; Radev; Ahmed H. Awadallah

arXiv:2212.09968·cs.CL·October 17, 2023

On Improving Summarization Factual Consistency from Natural Language Feedback

Yixin Liu, Budhaditya Deb, Milagro Teruel, Aaron Halfaker, Dragomir, Radev, Ahmed H. Awadallah

PDF

Open Access 1 Repo

TL;DR

This paper introduces DeFacto, a high-quality dataset of natural language feedback for summarization, and demonstrates how fine-tuned models can improve factual consistency in summaries by leveraging this feedback.

Contribution

The work presents a new dataset, DeFacto, and explores three tasks involving natural language feedback to enhance summarization factual accuracy.

Findings

01

Fine-tuned models improve factual consistency using DeFacto.

02

Large language models lack zero-shot ability for feedback-based summarization.

03

DeFacto enables generation of factually consistent summaries and feedback insights.

Abstract

Despite the recent progress in language generation models, their outputs may not always meet user expectations. In this work, we study whether informational feedback in natural language can be leveraged to improve generation quality and user preference alignment. To this end, we consider factual consistency in summarization, the quality that the summary should only contain information supported by the input documents, as the user-expected preference. We collect a high-quality dataset, DeFacto, containing human demonstrations and informational natural language feedback consisting of corrective instructions, edited summaries, and explanations with respect to the factual consistency of the summary. Using our dataset, we study three natural language generation tasks: (1) editing a summary by following the human feedback, (2) generating human feedback for editing the original summary, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/defacto
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications