TL;DR
Self-Correcting RAG enhances large language models by optimizing context selection with MMKP and validating answers with NLI-guided MCTS, significantly improving reasoning accuracy and reducing hallucinations.
Contribution
It introduces a novel framework combining MMKP-based context selection and NLI-guided MCTS for more faithful and accurate retrieval-augmented generation.
Findings
Improves reasoning accuracy on multi-hop questions.
Reduces hallucinations in generated answers.
Outperforms existing baselines on multiple datasets.
Abstract
Retrieval-augmented generation (RAG) substantially extends the knowledge boundary of large language models. However, it still faces two major challenges when handling complex reasoning tasks: low context utilization and frequent hallucinations. To address these issues, we propose Self-Correcting RAG, a unified framework that reformulates retrieval and generation as constrained optimization and path planning. On the input side, we move beyond traditional greedy retrieval and, for the first time, formalize context selection as a multi-dimensional multiple-choice knapsack problem (MMKP), thereby maximizing information density and removing redundancy under a strict token budget. On the output side, we introduce a natural language inference (NLI)-guided Monte Carlo Tree Search (MCTS) mechanism, which leverages test-time compute to dynamically explore reasoning trajectories and validate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
