Verify Before You Commit: Towards Faithful Reasoning in LLM Agents via Self-Auditing

Wenhao Yuan; Chenchen Lin; Jian Chen; Jinfeng Xu; Xuehe Wang; Edith Cheuk Han Ngai

arXiv:2604.08401·cs.AI·April 10, 2026

Verify Before You Commit: Towards Faithful Reasoning in LLM Agents via Self-Auditing

Wenhao Yuan, Chenchen Lin, Jian Chen, Jinfeng Xu, Xuehe Wang, Edith Cheuk Han Ngai

PDF

TL;DR

This paper introduces SAVeR, a framework for LLM agents that verifies and repairs internal reasoning to ensure logical consistency and faithfulness, improving long-term decision accuracy.

Contribution

SAVeR is the first method to enforce internal belief verification and repair in LLM agents, enhancing reasoning faithfulness without sacrificing task performance.

Findings

01

Improves reasoning faithfulness across six benchmark datasets.

02

Maintains competitive performance on end tasks.

03

Effectively localizes and repairs reasoning violations.

Abstract

In large language model (LLM) agents, reasoning trajectories are treated as reliable internal beliefs for guiding actions and updating memory. However, coherent reasoning can still violate logical or evidential constraints, allowing unsupported beliefs repeatedly stored and propagated across decision steps, leading to systematic behavioral drift in long-horizon agentic systems. Most existing strategies rely on the consensus mechanism, conflating agreement with faithfulness. In this paper, inspired by the vulnerability of unfaithful intermediate reasoning trajectories, we propose \textbf{S}elf-\textbf{A}udited \textbf{Ve}rified \textbf{R}easoning (\textsc{SAVeR}), a novel framework that enforces verification over internal belief states within the agent before action commitment, achieving faithful reasoning. Concretely, we structurally generate persona-based diverse candidate beliefs for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.