Improving Factual Consistency of News Summarization by Contrastive   Preference Optimization

Huawen Feng; Yan Fan; Xiong Liu; Ting-En Lin; Zekun Yao; Yuchuan Wu,; Fei Huang; Yongbin Li; Qianli Ma

arXiv:2310.19347·cs.CL·February 17, 2025·1 cites

Improving Factual Consistency of News Summarization by Contrastive Preference Optimization

Huawen Feng, Yan Fan, Xiong Liu, Ting-En Lin, Zekun Yao, Yuchuan Wu,, Fei Huang, Yongbin Li, Qianli Ma

PDF

Open Access

TL;DR

This paper introduces Contrastive Preference Optimization (CPO), a novel training method that enhances large language models' ability to generate factually consistent news summaries by better distinguishing faithful content from hallucinations.

Contribution

The paper proposes CPO, a new contrastive learning approach, and a probing-based training method to improve LLMs' factual accuracy in news summarization.

Findings

01

CPO significantly improves factual consistency in LLM-generated summaries.

02

Enhanced perception of hallucinations leads to more reliable summarization.

03

Experimental results demonstrate the effectiveness of the proposed methods.

Abstract

Despite the recent progress in news summarization made by large language models (LLMs), they often generate summaries that are factually inconsistent with original articles, known as "hallucinations" in text generation. Unlike previous small models (e.g., BART, T5), current LLMs make fewer silly mistakes but more sophisticated ones, such as imposing cause and effect, adding false details, overgeneralizing, etc. These hallucinations are challenging to detect through traditional methods, which poses great challenges for improving the factual consistency of text summarization. In this paper, we propose Contrastive Preference Optimization (CPO) to disentangle the LLMs' propensities to generate faithful and fake content. Furthermore, we adopt a probing-based specific training method to improve their capacity of distinguishing two types of propensities. In this way, LLMs can execute the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Layer Normalization · Byte Pair Encoding · Dense Connections · Residual Connection · Softmax