Self-Correcting RAG: Enhancing Faithfulness via MMKP Context Selection and NLI-Guided MCTS

Shijia Xu; Zhou Wu; Xiaolong Jia; Yu Wang; Kai Liu; April Xiaowen Dong

arXiv:2604.10734·cs.CL·April 14, 2026

Self-Correcting RAG: Enhancing Faithfulness via MMKP Context Selection and NLI-Guided MCTS

Shijia Xu, Zhou Wu, Xiaolong Jia, Yu Wang, Kai Liu, April Xiaowen Dong

PDF

1 Repo

TL;DR

Self-Correcting RAG enhances large language models by optimizing context selection with MMKP and validating answers with NLI-guided MCTS, significantly improving reasoning accuracy and reducing hallucinations.

Contribution

It introduces a novel framework combining MMKP-based context selection and NLI-guided MCTS for more faithful and accurate retrieval-augmented generation.

Findings

01

Improves reasoning accuracy on multi-hop questions.

02

Reduces hallucinations in generated answers.

03

Outperforms existing baselines on multiple datasets.

Abstract

Retrieval-augmented generation (RAG) substantially extends the knowledge boundary of large language models. However, it still faces two major challenges when handling complex reasoning tasks: low context utilization and frequent hallucinations. To address these issues, we propose Self-Correcting RAG, a unified framework that reformulates retrieval and generation as constrained optimization and path planning. On the input side, we move beyond traditional greedy retrieval and, for the first time, formalize context selection as a multi-dimensional multiple-choice knapsack problem (MMKP), thereby maximizing information density and removing redundancy under a strict token budget. On the output side, we introduce a natural language inference (NLI)-guided Monte Carlo Tree Search (MCTS) mechanism, which leverages test-time compute to dynamically explore reasoning trajectories and validate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xjiacs/Self-Correcting-RAG
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.