PretrainRL: Alleviating Factuality Hallucination of Large Language Models at the Beginning

Langming Liu; Kangtao Lv; Haibin Chen; Weidong Zhang; Yejing Wang; Shilei Liu; Xin Tong; Yujin Yuan; Yongwei Wang; Wenbo Su; Bo Zheng

arXiv:2602.01875·cs.CL·February 3, 2026

PretrainRL: Alleviating Factuality Hallucination of Large Language Models at the Beginning

Langming Liu, Kangtao Lv, Haibin Chen, Weidong Zhang, Yejing Wang, Shilei Liu, Xin Tong, Yujin Yuan, Yongwei Wang, Wenbo Su, Bo Zheng

PDF

Open Access

TL;DR

PretrainRL is a novel reinforcement learning framework integrated into the pretraining phase of large language models, aimed at reducing factual hallucinations by reshaping probability distributions and emphasizing truthful knowledge.

Contribution

It introduces a new pretraining approach that actively debiases falsehoods and enhances factual accuracy in large language models, addressing a core root of hallucinations.

Findings

01

PretrainRL significantly reduces factual hallucinations in LLMs.

02

The method outperforms existing approaches on multiple benchmarks.

03

Efficient negative sampling and novel metrics improve factual knowledge learning.

Abstract

Large language models (LLMs), despite their powerful capabilities, suffer from factual hallucinations where they generate verifiable falsehoods. We identify a root of this issue: the imbalanced data distribution in the pretraining corpus, which leads to a state of "low-probability truth" and "high-probability falsehood". Recent approaches, such as teaching models to say "I don't know" or post-hoc knowledge editing, either evade the problem or face catastrophic forgetting. To address this issue from its root, we propose \textbf{PretrainRL}, a novel framework that integrates reinforcement learning into the pretraining phase to consolidate factual knowledge. The core principle of PretrainRL is "\textbf{debiasing then learning}." It actively reshapes the model's probability distribution by down-weighting high-probability falsehoods, thereby making "room" for low-probability truths to be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Topic Modeling · Multimodal Machine Learning Applications