Are Reasoning Models More Prone to Hallucination?

Zijun Yao; Yantao Liu; Yanxu Chen; Jianhui Chen; Junfeng Fang; Lei Hou; Juanzi Li; Tat-Seng Chua

arXiv:2505.23646·cs.CL·May 30, 2025·2 cites

Are Reasoning Models More Prone to Hallucination?

Zijun Yao, Yantao Liu, Yanxu Chen, Jianhui Chen, Junfeng Fang, Lei Hou, Juanzi Li, Tat-Seng Chua

PDF

Open Access

TL;DR

This paper investigates whether large reasoning models are more prone to hallucination, analyzing how different training pipelines and behaviors influence factual accuracy and model uncertainty.

Contribution

It provides a comprehensive evaluation of hallucination in LRMs, revealing how post-training methods and behaviors affect factuality and uncertainty alignment.

Findings

01

Cold start supervised fine-tuning reduces hallucination

02

RL training without cold start increases hallucination

03

Hallucination correlates with misalignment of uncertainty and factuality

Abstract

Recently evolved large reasoning models (LRMs) show powerful performance in solving complex tasks with long chain-of-thought (CoT) reasoning capability. As these LRMs are mostly developed by post-training on formal reasoning tasks, whether they generalize the reasoning capability to help reduce hallucination in fact-seeking tasks remains unclear and debated. For instance, DeepSeek-R1 reports increased performance on SimpleQA, a fact-seeking benchmark, while OpenAI-o3 observes even severer hallucination. This discrepancy naturally raises the following research question: Are reasoning models more prone to hallucination? This paper addresses the question from three perspectives. (1) We first conduct a holistic evaluation for the hallucination in LRMs. Our analysis reveals that LRMs undergo a full post-training pipeline with cold start supervised fine-tuning (SFT) and verifiable reward RL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDecision-Making and Behavioral Economics · Logic, Reasoning, and Knowledge · Explainable Artificial Intelligence (XAI)