On Improving Summarization Factual Consistency from Natural Language Feedback
Yixin Liu, Budhaditya Deb, Milagro Teruel, Aaron Halfaker, Dragomir, Radev, Ahmed H. Awadallah

TL;DR
This paper introduces DeFacto, a high-quality dataset of natural language feedback for summarization, and demonstrates how fine-tuned models can improve factual consistency in summaries by leveraging this feedback.
Contribution
The work presents a new dataset, DeFacto, and explores three tasks involving natural language feedback to enhance summarization factual accuracy.
Findings
Fine-tuned models improve factual consistency using DeFacto.
Large language models lack zero-shot ability for feedback-based summarization.
DeFacto enables generation of factually consistent summaries and feedback insights.
Abstract
Despite the recent progress in language generation models, their outputs may not always meet user expectations. In this work, we study whether informational feedback in natural language can be leveraged to improve generation quality and user preference alignment. To this end, we consider factual consistency in summarization, the quality that the summary should only contain information supported by the input documents, as the user-expected preference. We collect a high-quality dataset, DeFacto, containing human demonstrations and informational natural language feedback consisting of corrective instructions, edited summaries, and explanations with respect to the factual consistency of the summary. Using our dataset, we study three natural language generation tasks: (1) editing a summary by following the human feedback, (2) generating human feedback for editing the original summary, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
