Reducing Hallucinations in LLMs via Factuality-Aware Preference Learning

Sindhuja Chaduvula; Ahmed Y. Radwan; Azib Farooq; Yani Ioannou; Shaina Raza

arXiv:2601.03027·cs.CL·April 16, 2026

Reducing Hallucinations in LLMs via Factuality-Aware Preference Learning

Sindhuja Chaduvula, Ahmed Y. Radwan, Azib Farooq, Yani Ioannou, Shaina Raza

PDF

1 Models 1 Datasets

TL;DR

This paper introduces F-DPO, a simple method that uses factuality labels to improve the factual accuracy of large language models and reduce hallucinations without extra training stages.

Contribution

F-DPO is a novel extension of DPO that incorporates factuality labels and a label-flipping transformation to enhance model factuality and reduce hallucinations.

Findings

01

F-DPO reduces hallucination rates by 5x on Qwen3-8B.

02

F-DPO improves factuality scores by 50% on evaluated models.

03

F-DPO enhances out-of-distribution factuality benchmarks.

Abstract

Preference alignment methods such as RLHF and Direct Preference Optimization (DPO) improve instruction following, but they can also reinforce hallucinations when preference judgments reward fluency and confidence over factual correctness. We introduce F-DPO (Factuality-aware Direct Preference Optimization), a simple extension of DPO that uses only binary factuality labels. F-DPO (i) applies a label-flipping transformation that corrects misordered preference pairs so the chosen response is never less factual than the rejected one, and (ii) adds a factuality-aware margin that emphasizes pairs with clear correctness differences, while reducing to standard DPO when both responses share the same factuality. We construct factuality-aware preference data by augmenting DPO pairs with binary factuality indicators and synthetic hallucinated variants. Across seven open-weight LLMs (1B-14B), F-DPO…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
vector-institute/Factuality-Alignment-Qwen2.5-14B
model· 1 dl
1 dl

Datasets

vector-institute/Factuality_Alignment
dataset· 10 dl
10 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.