The Reasoning Trap: How Enhancing LLM Reasoning Amplifies Tool Hallucination
Chenlong Yin, Zeyang Sha, Shiwen Cui, Changhua Meng, Zechao Li

TL;DR
Enhancing reasoning in Large Language Models increases tool hallucination, and this effect is consistent across methods and training data, revealing a fundamental reliability-capability trade-off.
Contribution
This work systematically demonstrates that reasoning enhancement methods causally amplify tool hallucination in LLMs, highlighting a key reliability challenge.
Findings
Strengthening reasoning through RL increases tool hallucination proportionally.
Training on non-tool tasks still amplifies hallucination when reasoning is enhanced.
The hallucination effect occurs across different reasoning methods and training approaches.
Abstract
Enhancing the reasoning capabilities of Large Language Models (LLMs) is a key strategy for building Agents that "think then act." However, recent observations, like OpenAI's o3, suggest a paradox: stronger reasoning often coincides with increased hallucination, yet no prior work has systematically examined whether reasoning enhancement itself causes tool hallucination. To address this gap, we pose the central question: Does strengthening reasoning increase tool hallucination? To answer this, we introduce SimpleToolHalluBench, a diagnostic benchmark measuring tool hallucination in two failure modes: (i) no tool available, and (ii) only distractor tools available. Through controlled experiments, we establish three key findings. First, we demonstrate a causal relationship: progressively enhancing reasoning through RL increases tool hallucination proportionally with task performance gains.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
