PAVE: Premise-Aware Validation and Editing for Retrieval-Augmented LLMs

Tianyi Huang; Caden Yang; Emily Yin; Eric Wang; Michael Zhang

arXiv:2603.20673·cs.CL·April 3, 2026

PAVE: Premise-Aware Validation and Editing for Retrieval-Augmented LLMs

Tianyi Huang, Caden Yang, Emily Yin, Eric Wang, Michael Zhang

PDF

TL;DR

PAVE introduces a validation and editing layer for retrieval-augmented LLMs, improving answer accuracy by explicitly checking and revising responses based on extracted premises.

Contribution

It presents a novel inference-time validation method that decomposes retrieved context into atomic facts, supports answer revision, and enhances evidence-grounded consistency.

Findings

01

PAVE outperforms simpler baselines in evidence-grounded QA.

02

Largest gain of 32.7 accuracy points on a span-grounded benchmark.

03

Explicit premise extraction improves answer support and reliability.

Abstract

Retrieval-augmented language models can retrieve relevant evidence yet still commit to answers before explicitly checking whether the retrieved context supports the conclusion. We present PAVE (Premise-Grounded Answer Validation and Editing), an inference-time validation layer for evidence-grounded question answering. PAVE decomposes retrieved context into question-conditioned atomic facts, drafts an answer, scores how well that draft is supported by the extracted premises, and revises low-support outputs before finalization. The resulting trace makes answer commitment auditable at the level of explicit premises, support scores, and revision decisions. In controlled ablations with a fixed retriever and backbone, PAVE outperforms simpler post-retrieval baselines in two evidence-grounded QA settings, with the largest gain reaching 32.7 accuracy points on a span-grounded benchmark. We view…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.